Warehousing and Studying Open Source Versioning Metadata

TitleWarehousing and Studying Open Source Versioning Metadata
Publication TypeBook
Year of Publication2010
Authorsvan Antwerp, M., and Madey G.
Secondary AuthorsÅgerfalk, Pär, Boldyreff Cornelia, González-Barahona Jesús M., Madey Gregory R., and Noll John
Secondary TitleIFIP Advances in Information and Communication Technology Open Source Software: New Horizons (OSS 2010)
Volume319
Pagination413 - 418
PublisherSpringer Berlin Heidelberg
Place PublishedBerlin, Heidelberg
ISSN Number1861-2288
ISBN Number978-3-642-13244-5
Keywordsberlios, cvs, savannah, scm, sourceforge, srda, subversion, svn
Abstract

In this paper, we describe the downloading and warehousing of Open Source Software (OSS) versioning metadata from SourceForge, BerliOS Developer, and GNU Savannah. This data enables and supports research in areas such as software engineering, open source phenomena, social network analysis, data mining, and project management. This newly-formed database containing Concurrent Versions System (CVS) and Subversion (SVN) metadata offers new research opportunities for large-scale OSS development analysis. The CVS and SVN data is juxtaposed with the SourceForge.net Research Data Archive [5] for the purpose of performing more powerful and interesting queries. We also present an initial statistical analysis of some of the most active projects.

DOI10.1007/978-3-642-13244-5_40