Estimating Commit Sizes Efficiently

TitleEstimating Commit Sizes Efficiently
Publication TypeConference Paper
Year of Publication2009
AuthorsHofmann, P, Riehle, D
Secondary TitleOSS2009: Open Source Ecosystems: Diverse Communities Interacting (IFIP 2.13)
Pagination105 - 115
Date Published2009///
ISSN Number978-3-642-02031-5

The quantitative analysis of software projects can provide insights that let us better understand open source and other software development projects. An important variable used in the analysis of software projects is the amount of work being contributed, the commit size. Unfortunately, post-facto, the commit size can only be estimated, not measured. This paper presents several algorithms for estimating the commit size. Our performance evaluation shows that simple, straightforward heuristics are superior to the more complex text-analysis-based algorithms. Not only are the heuristics significantly faster to compute, they also deliver more accurate results when estimating commit sizes. Based on this experience, we design and present an algorithm that improves on the heuristics, can be computed equally fast, and is more accurate than any of the prior approaches.

Full Text
PDF icon Estimating Commit Sizes Effciently433.37 KB