WoPDaSD 2009: 4th Workshop on Public Data about Software Development 4th edition of the WoPDaSD, co-located with the OSS 2009 Conference Motivation In the latest years, and specially thanks to the huge availability of data about software development that can be obtained from libre (free, open source) projects, the research community is starting to produce, use and exchange large data sets of information. These data sets have to be retrieved, purged, described, and can be published for public consumption by other groups. Their availability allows for the decoupling of research activities (some groups can focus on data retrieval and preliminary analysis, which others can devote to more in-depth analysis without bothering with data retrieval), the reproducibility of research results, and even the collaboration (and competition) in the analysis of data. All this activity is being presented in several workshops and conferences, but since they are not focused specifically on it, exchange of experiences and discussions are not as deep and fruitful as they could be. This workshop is once again (for the third year in a row) being such a place, hosting researchers in the field to discuss specifically about this kind of data sets, how they are retrieved, how can they be analyzed and mined, how they can be exchanged and complemented, etc. Main goals The goal of this workshop is to foster the analysis of public available data sources about software development, and the exchange of data between different research groups. The workshop is aimed at three different kinds of studies (although other related studies could also be considered): 1. Analysis of specific projects (provided by the organizers, see below). 2. Retrieval process and exchange formats of public available data collections about software development. 3. Data mining activities and new retrieval tools. Detailed description of data sources Following the goals described above, the workshop will accept papers about two specific issues (not taking into account the development of new data mining tools): 1. Analysis of two data collections about libre software development: FLOSSMole and FLOSSMetrics FLOSSMole FLOSSMole (formerly OSSmole) is a set of tools for gathering data (metrics) about the development of free/libre/open source projects. The FLOSSMole project also publishes the resulting analysis about FLOSS projects, and accepts data donations from other research groups. It offers this workshop a complete set of data gathered from the SourceForge development platform and the Freshmeat announcement systems. More information can be obtained from http://ossmole.sourceforge.net FLOSSMetrics At the end of FLOSSMetrics a huge database with data from thousands of projects. Nowadays, the project is already working on the retrieval of the data, with information already available for about 2,000 projects (mainly retrieved from CVS and SVN repositories, but also mailing lists and issue tracking systems). These results are publicly available at http://melquiades.flossmetrics.org More information can be obtained from http://flossmetrics.org Target audience The target audience is composed by the research groups interested in empirical software engineering and quantitative studies of the software development processes and methods. This includes not only software engineers, but also researchers from other fields that might use the data for economic, social and other studies. Submissions We solicit short position papers (3 pages) and research papers (6 pages). Short papers will be expected to discuss controversial issues in the field, or describe interesting or thought-provoking ideas that are not yet fully developed, while full papers will be expected to describe new research results, and have a higher degree of technical rigor than short papers. The papers must be in ACM 2-column format. Authors may indicate their intent to submit a contribution (the title of the paper and abstract will need to be submitted online using an EasyChair acccount at the EasyChair WoPDaSD area. The full paper should be sent using the aforementioned process by the deadline in PDF format. See deadlines in the corresponding section, below. If there is any problem, please, send the abstract and full paper via e-mail to megan @ elon.edu, jgb @ gsyc.es, and dizquierdo @ gsyc.es. Accepted papers will be published in agreement with the OSS requirements, as open access publications if possible. Place (see details in the OSS 2009 website): * First Hotel Billingehus Important Dates * Intent to submit: 30th January 2009 (not mandatory, only for organizational purposes) Organizing Committee * Jesus M. Gonzalez-Barahona (Universidad Rey Juan Carlos, Spain) Program Committee (tentative) * Kevin Crowston (Syracuse University, USA) Sponsoring projects * FLOSSMole Some of these projects are funded in part by the European Commission, under the Information Society Technologies (IST) research programme of the Sixth Framework Program. A list of the IST projects in the area of Software Technologies is available from http://cordis.europa.eu/ist/st/projects.htm. |
|||
