Towards base rates in software analytics

TitleTowards base rates in software analytics
Publication TypeJournal Article
Year of Publication2013
AuthorsBruntink, M
Refereed DesignationRefereed
Secondary TitleScience of Computer Programming
Date Published11/2013
ISSN Number01676423
Keywordsohloh
Abstract

Nowadays a vast and growing body of open source software (OSS) project data is publicly available on the internet. Despite this public body of project data, the field of software analytics has not yet settled on a solid quantitative base for basic properties such as code size, growth, team size, activity, and project failure. What is missing is a quantification of the base rates of such properties, where other fields (such as medicine) commonly rely on base rates for decision making and the evaluation of experimental results. The lack of knowledge in this area impairs both research activities in the field of software analytics and decision making on software projects in general. This paper contributes initial results of our research towards obtaining base rates using the data available at Ohloh (a large-scale index of OSS projects). Zooming in on the venerable ‘lines of code’ metric for code size and growth, we present and discuss summary statistics and identify further research challenges.

URLhttp://www.sciencedirect.com/science/article/pii/S0167642313003079
DOI10.1016/j.scico.2013.11.023
Short TitleScience of Computer Programming
Full Text
Taxonomy upgrade extras: