Survival analysis on the duration of open source projects

TitleSurvival analysis on the duration of open source projects
Publication TypeJournal Article
Year of Publication2010
AuthorsSamoladas, Ioannis, Angelis Lefteris, and Stamelos Ioannis
Secondary TitleInformation and Software Technology
Volume52
Issue9
Pagination902 - 922
Date Published9/2010
ISSN Number09505849
Keywordsflossmetrics, prediction, source code, survival analysis
Abstract

Context

Open source (FLOSS) project survivability is an important piece of information for many open source stakeholders. Coordinators of open source projects would like to know the chances for the survival of the projects they coordinate. Companies are also interested in knowing how viable a project is in order to either participate or invest in it, and volunteers want to contribute to vivid projects.
Objective

The purpose of this article is the application of survival analysis techniques for estimating the future development of a FLOSS project.
Method

In order to apply such approach, duration data regarding FLOSS projects from the FLOSSMETRICS (This work was partially supported by the European Community’s Sixth Framework Program under the Contract FP6-033982) database were collected. Such database contains metadata for thousands of FLOSS projects, derived from various forges. Subsequently, survival analysis methods were employed to predict the survivability of the projects, i.e. their probability of continuation in the future, by examining their duration, combined with other project characteristics such as their application domain and number of committers.
Results

It was shown how probability of termination or continuation may be calculated and how a prediction model may be built to upraise project future. In addition, the benefit of adding more committers to FLOSS projects was quantified.
Conclusion

Analysis results demonstrate the usefulness of the proposed framework for assessing the survival probability of a FLOSS project.

Notes

"For the purposes of our study we used data coming only from source code repositories."

DOI10.1016/j.infsof.2010.05.001
Short TitleInformation and Software Technology