Programming Language Trends in Open Source Development: An Evaluation Using Data from All Production Phase SourceForge Projects
|Title||Programming Language Trends in Open Source Development: An Evaluation Using Data from All Production Phase SourceForge Projects|
|Publication Type||Conference Paper|
|Year of Publication||2007|
|Authors||Delorey, Daniel P., Knutson Charles D., and Giraud-Carrier C.|
|Secondary Title||2nd Workshop on Public Data about Software Development (WoPDaSD 2007)|
|Keywords||cvs, cvs2mysql, programming languages, sfra, sourceforge, srda|
In this work, we analyze data collected from the CVS repos- itories of 9,997 Open Source projects hosted on SourceForge in an effort to understand trends in programming language usage in the Open Source community between 2000 and 2005. The trends we consider include: 1) the relative popularity of the ten most popular programming languages over time, 2) the use of multiple programming languages by individual programmers and by individual projects, and 3) the programming languages most often used in combination.
"Our data were gathered from the SourceForge Research Ar- chive (SFRA)  and the CVS repositories Open Source projects hosted on SourceForge. We used cvs2mysql and SFRA+ to collect the data. cvs2mysql gathers data from CVS repositories and writes them to SQL scripts for im- port into a MySQL 5.0 database. The data collected by cvs2mysql are the name of the file, the location of the file in the repository, the type and state of the file, as well as the author, date, number of lines added and removed, and the author’s message for each revision to the file."