Big data analytics on large-scale socio-technical software engineering archives
Title | Big data analytics on large-scale socio-technical software engineering archives |
Publication Type | Conference Paper |
Year of Publication | 2015 |
Authors | Bayati, S, Parsons, D, Susnjak, T, Heidary, M |
Secondary Title | 2015 3rd International Conference on Information and Communication Technology (ICoICT )2015 3rd International Conference on Information and Communication Technology (ICoICT) |
Pagination | 65 - 69 |
Publisher | IEEE |
Place Published | Nusa Dua, Bali, Indonesia |
Abstract | Given the fast growing nature of software engineering data in online software repositories and open source communities, it would be helpful to analyse these assets to discover valuable information about the software engineering development process and other related data. Big Data Analytics (BDA) techniques and frameworks can be applied on these data resources to achieve a high-performance and relevant data collection and analysis. Software engineering is a socio-technical process which needs development team collaboration and technical knowledge to develop a high-quality application. GitHub, as an online social coding foundation, contains valuable information about the software engineers' communications and project life cycles. In this paper, unsupervised data mining techniques are applied on the data collected by general Big Data approaches to analyse GitHub projects, source codes and interactions. Source codes and projects are clustered using features and metrics derived from historical data in repositories, object oriented programming metrics and the influences of developers on source codes. |
DOI | 10.1109/ICoICT.2015.7231398 |
Full Text |
- Log in or register to post comments
- Google Scholar
- DOI
- BibTeX
- Tagged
- EndNote XML