You are here

Data and analyses sharing to support research on free/libre open source software

Submitted by superadmin on Thu, 2009-09-24 23:34

Title	Data and analyses sharing to support research on free/libre open source software
Publication Type	Conference Paper
Year of Publication	2007
Secondary Title	OSS2007: Open Source Development, Adoption and Innovation (IFIP 2.13)
Volume	234/2007
Pagination	355 - 356
Date Published	2007///
Publisher	Springer
ISSN Number	978-0-387-72485-0
Abstract	Research on FLOSS has relied on several different kinds of scientific evidence, such as the archives created by the FLOSS developers, versioned code repositories, mailing list messages and bug and issue tracking repositories [1]. FLOSS teams retain and make public archives of many of their activities as by-products of their open technology-supported collaboration. However, the easy availability of primary data provides a misleading picture of ease of conducting research on FLOSS. Precisely because these data are by-products, they are generally not in a form that is useful for researchers. Instead potentially useful data is locked up in HTML pages, CVS log files, text-only mailing list archives or dumps of website databases. FLOSS research projects, therefore, expend significant energy collecting and re-structuring these archives for their research, which is repetitive and wasteful [2]. Furthermore, different researchers will extract different data at different points in time, take different approaches to processing and cleaning data and make different decisions about analyses, but without all of these decisions being visible, auditable or reproducible. In principle, these latter problems can be addressed by individual researchers better documenting what they have done. However, research publications typically have restrictions on publication lengths that make complete discussion impossible. Furthermore, published papers are just the tip of the iceberg, and knowing what others have done does not necessarily make it any easier to replicate the results.
DOI	10.1007/978-0-387-72486-7_42
Full Text

Attachment	Size
Data and analyses sharing	96.04 KB