Studying Production Phase SourceForge Projects: An Exploratory Analysis Using cvs2mysql and SFRA

TitleStudying Production Phase SourceForge Projects: An Exploratory Analysis Using cvs2mysql and SFRA
Publication TypeConference Paper
Year of Publication2007
AuthorsDelorey, DP, Knutson, CD, MacLean, AC
Secondary Title2nd Workshop on Public Data about Software Development (WoPDaSD 2007)
Date Published2007
KeywordsData Collection, forge, repositories, sourceforge
Abstract

A wealth of data can be extracted from the natural by-products of software development processes and used in empirical studies of software engineering. However, the size and accuracy of such studies depend in large part on the availability of tools that facilitate the collection of data from individual projects and the combination of data from multiple projects. To demonstrate this point, we present our experience gathering and analyzing data from nearly 10,000 open source projects hosted on SourceForge. We describe the tools we developed to collect the data and the ways in which these tools and data may be used by other researchers. We also provide examples of statistics that we have calculated from these data to describe interesting author- and project-level behaviors of the SourceForge community.

Full Text
AttachmentSize
PDF icon Delorey2007c.pdf139.58 KB