%0 Conference Paper %B Proceedings of the 17th ACM Conference on Computer Supported Cooperative Work &\#38; Social Computing %D 2014 %T How Social Q&A Sites Are Changing Knowledge Sharing in Open Source Software Communities %A Vasilescu, Bogdan %A Serebrenik, Alexander %A Devanbu, Prem %A Filkov, Vladimir %K a %K crowdsourced knowledge %K gamification. %K mailing lists %K open source %K social q&\#38 %X Historically, mailing lists have been the preferred means for coordinating development and user support activities. With the emergence and popularity growth of social Q&A sites such as the StackExchange network (e.g., StackOverflow), this is beginning to change. Such sites offer different socio-technical incentives to their participants than mailing lists do, e.g., rich web environments to store and manage content collaboratively, or a place to showcase their knowledge and expertise more vividly to peers or potential recruiters. A key difference between StackExchange and mailing lists is gamification, i.e., StackExchange participants compete to obtain reputation points and badges. In this paper, we use a case study of R (a widely-used tool for data analysis) to investigate how mailing list participation has evolved since the launch of StackExchange. Our main contribution is the assembly of a joint data set from the two sources, in which participants in both the texttt{r-help} mailing list and StackExchange are identifiable. This permits their activities to be linked across the two resources and also over time. With this data set we found that user support activities show a strong shift away from texttt{r-help}. In particular, mailing list experts are migrating to StackExchange, where their behaviour is different. First, participants active both on texttt{r-help} and on StackExchange are more active than those who focus exclusively on only one of the two. Second, they provide faster answers on StackExchange than on texttt{r-help}, suggesting they are motivated by the emph{gamified} environment. To our knowledge, our study is the first to directly chart the changes in behaviour of specific contributors as they migrate into gamified environments, and has important implications for knowledge management in software engineering. %B Proceedings of the 17th ACM Conference on Computer Supported Cooperative Work &\#38; Social Computing %S CSCW '14 %I ACM %C New York, NY, USA %P 342–354 %@ 978-1-4503-2540-0 %U http://doi.acm.org/10.1145/2531602.2531659 %R 10.1145/2531602.2531659 %> https://flosshub.org/sites/flosshub.org/files/cscw14.pdf %0 Conference Paper %B Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - ICSE '10 %D 2010 %T Linking e-mails and source code artifacts %A Bacchelli, Alberto %A Lanza, Michele %A Robbes, Romain %Y Kramer, Jeff %Y Bishop, Judith %Y Devanbu, Prem %Y Uchitel, Sebastian %X E-mails concerning the development issues of a system constitute an important source of information about high-level design decisions, low-level implementation concerns, and the social structure of developers. Establishing links between e-mails and the software artifacts they discuss is a non-trivial problem, due to the inherently informal nature of human communication. Different approaches can be brought into play to tackle this traceability issue, but the question of how they can be evaluated remains unaddressed, as there is no recognized benchmark against which they can be compared. In this article we present such a benchmark, which we created through the manual inspection of a statistically significant number of e-mails pertaining to six unrelated software systems. We then use our benchmark to measure the effectiveness of a number of approaches, ranging from lightweight approaches based on regular expressions to full-fledged information retrieval approaches. %B Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - ICSE '10 %I ACM Press %C Cape Town, South Africa %V 1 %P 375-384 %8 05/2010 %@ 9781605587196 %U http://www.inf.usi.ch/phd/bacchelli/publications.php %! ICSE '10 %R 10.1145/1806799.1806855 %0 Conference Paper %B Fourth International Workshop on Mining Software Repositories (MSR'07:ICSE Workshops 2007) %D 2007 %T Detecting Patch Submission and Acceptance in OSS Projects %A Christian Bird %A Gourley, Alex %A Devanbu, Prem %K apache %K contributions %K mysql %K patches %K postgresql %K python %K scm %K source code %X The success of open source software (OSS) is completely dependent on the work of volunteers who contribute their time and talents. The submission of patches is the major way that participants outside of the core group of developers make contributions. We argue that the process of patch submission and acceptance into the codebase is an important piece of the open source puzzle and that the use of patch-related data can be helpful in understanding how OSS projects work. We present our methods in identifying the submission and acceptance of patches and give results and evaluation in applying these methods to the Apache webserver, Python interpreter, Postgres SQL database, and (with limitations) MySQL database projects. In addition, we present valuable ways in which this data has been and can be used. %B Fourth International Workshop on Mining Software Repositories (MSR'07:ICSE Workshops 2007) %I IEEE %C Minneapolis, MN, USA %P 26 - 26 %@ 0-7695-2950-X %R 10.1109/MSR.2007.6 %> https://flosshub.org/sites/flosshub.org/files/28300026.pdf %0 Conference Paper %B Fourth International Workshop on Mining Software RepositoriesFourth International Workshop on Mining Software Repositories (MSR'07:ICSE Workshops 2007) %D 2007 %T Open Borders? Immigration in Open Source Projects %A Christian Bird %A Gourley, Alex %A Devanbu, Prem %A Swaminathan, Anand %A Hsu, Greta %K apache %K core %K joining %K postgresql %K python %K team %X Open source software is built by teams of volunteers. Each project has a core team of developers, who have the authority to commit changes to the repository; this team is the elite, committed foundation of the project, selected through a meritocratic process from a larger number of people who participate on the mailing list. Most projects carefully regulate admission of outsiders to full developer privileges; some projects even have formal descriptions of this process. Understanding the factors that influence the "who, how and when" of this process is critical, both for the sustainability of FLOSS projects, and for outside stakeholders who want to gain entry and succeed. In this paper we mount a quantitative case study of the process by which people join FLOSS projects, using data mined from the Apache web server, Postgres, and Python. We develop a theory of open source project joining, and evaluate this theory based on our data. %B Fourth International Workshop on Mining Software RepositoriesFourth International Workshop on Mining Software Repositories (MSR'07:ICSE Workshops 2007) %I IEEE %C Minneapolis, MN, USA %P 6 - 6 %@ 0-7695-2950-X %R 10.1109/MSR.2007.23 %> https://flosshub.org/sites/flosshub.org/files/28300006.pdf %0 Conference Paper %B Proceedings of the 2006 international workshop on Mining software repositories %D 2006 %T Mining email social networks %A Christian Bird %A Gourley, Alex %A Devanbu, Prem %A Gertz, Michael %A Swaminathan, Anand %K communication %K contributions %K developers %K email %K email archives %K mailing lists %K open source %K social networks %X Communication & Co-ordination activities are central to large software projects, but are difficult to observe and study in traditional (closed-source, commercial) settings because of the prevalence of informal, direct communication modes. OSS projects, on the other hand, use the internet as the communication medium,and typically conduct discussions in an open, public manner. As a result, the email archives of OSS projects provide a useful trace of the communication and co-ordination activities of the participants. However, there are various challenges that must be addressed before this data can be effectively mined. Once this is done, we can construct social networks of email correspondents, and begin to address some interesting questions. These include questions relating to participation in the email; the social status of different types of OSS participants; the relationship of email activity and commit activity (in the CVS repositories) and the relationship of social status with commit activity. In this paper, we begin with a discussion of our infrastructure (including a novel use of Scientific Workflow software) and then discuss our approach to mining the email archives; and finally we present some preliminary results from our data analysis. %B Proceedings of the 2006 international workshop on Mining software repositories %S MSR '06 %I ACM %C New York, NY, USA %P 137–143 %@ 1-59593-397-2 %U http://doi.acm.org/10.1145/1137983.1138016 %R http://doi.acm.org/10.1145/1137983.1138016 %> https://flosshub.org/sites/flosshub.org/files/137MiningEmail.pdf %0 Conference Paper %B Proceedings of the 2006 international workshop on Mining software repositories %D 2006 %T Mining email social networks in Postgres %A Christian Bird %A Gourley, Alex %A Devanbu, Prem %A Gertz, Michael %A Swaminathan, Anand %K developers %K email %K email archives %K open source %K postgresql %K scm %K social network analysis %K social networks %K source code %K status %X Open Source Software (OSS) projects provide a unique opportunity to gather and analyze publicly available historical data. The Postgres SQL server, for example, has over seven years of recorded development and communication activity. We mined data from both the source code repository and the mailing list archives to examine the relationship between communication and development in Postgres. Along the way, we had to deal with the difficult challenge of resolving email aliases. We used a number of social network analysis measures and statistical techniques to analyze this data. We present our findings in this paper. %B Proceedings of the 2006 international workshop on Mining software repositories %S MSR '06 %I ACM %C New York, NY, USA %P 185–186 %@ 1-59593-397-2 %U http://doi.acm.org/10.1145/1137983.1138033 %R http://doi.acm.org/10.1145/1137983.1138033 %> https://flosshub.org/sites/flosshub.org/files/185MiningEmail.pdf