%0 Conference Proceedings %B International Conference on Open Source Systems 2016 %D 2016 %T Women in Free/Libre/Open Source Software: The Situation in the 2010s %A Gregorio Robles %A Reina, Laura Arjona %A González-Barahona, Jesús M. %A Dueñas Dominguez, Santiago %X Women are underrepresented in the IT sector. But the situation in FLOSS (free, libre, open source software) development is really extreme in this respect: past publications and studies show a female participation of around 2% to 5% and have shed some light into this problem. In this paper, we give an update the state of knowledge tot he current situation of gender in FLOSS, by analyzing the results of surveying more than 2,000 contributors to FLOSS projects in 2013, of which more than 200 were women. Our findings confirm that women enter the FLOSS community later than men, do primarily other tasks than coding, participate less if they have children, and have slightly different reasons to enter (and to stay in) the development communities they join. However, we also find evidence that women are joiningFLOSS projects in higher numbers in recent years, and that the share of women devoting few hours per week to FLOSS and full-time dedication is higher than for men. All in all, comparing our results with the ones from the 2000s, the context of participation of women in FLOSS has not changed much. %B International Conference on Open Source Systems 2016 %S Open Source Systems: Integrating Communities %I Springer %P 163-173 %8 05/2016 %> https://flosshub.org/sites/flosshub.org/files/paper-pre.pdf %0 Book Section %B Open Source Systems: Integrating Communities: 12th IFIP WG 2.13 International Conference, OSS 2016, Gothenburg, Sweden, May 30 - June 2, 2016, Proceedings %D 2016 %T Women in Free/Libre/Open Source Software: The Situation in the 2010s %A Gregorio Robles %A Reina, Laura Arjona %A González-Barahona, Jesús M. %A Domínguez, Santiago Dueñas %E Kevin Crowston %E Hammouda, Imed %E Lundell, Björn %E Gregorio Robles %E Gamalielsson, Jonas %E Juho Lindman %X Women are underrepresented in the IT sector. But the situation in FLOSS (free, libre, open source software) development is really extreme in this respect: past publications and studies show a female participation of around 2 % to 5 % and have shed some light into this problem. In this paper, we give an update the state of knowledge to the current situation of gender in FLOSS, by analyzing the results of surveying more than 2,000 contributors to FLOSS projects in 2013, of which more than 200 were women. Our findings confirm that women enter the FLOSS community later than men, do primarily other tasks than coding, participate less if they have children, and have slightly different reasons to enter (and to stay in) the development communities they join. However, we also find evidence that women are joining FLOSS projects in higher numbers in recent years, and that the share of women devoting few hours per week to FLOSS and full-time dedication is higher than for men. All in all, comparing our results with the ones from the 2000s, the context of participation of women in FLOSS has not changed much. %B Open Source Systems: Integrating Communities: 12th IFIP WG 2.13 International Conference, OSS 2016, Gothenburg, Sweden, May 30 - June 2, 2016, Proceedings %I Springer International Publishing %C Cham %P 163–173 %@ 978-3-319-39225-7 %U http://dx.doi.org/10.1007/978-3-319-39225-7_13 %& Women in Free/Libre/Open Source Software: The Situation in the 2010s %R 10.1007/978-3-319-39225-7_13 %0 Book %B IFIP Advances in Information and Communication TechnologyOpen Source Software: Mobile Open Source Technologies. 10th IFIP WG 2.13 International Conference on Open Source Systems, OSS 2014, San José, Costa Rica, May 6-9, 2014. Proceedings %D 2014 %T Code Review Analytics: WebKit as Case Study %A González-Barahona, Jesús M. %A Izquierdo-Cortázar, Daniel %A Gregorio Robles %A Mario Gallegos %E Corral, Luis %E Sillitti, Alberto %E Succi, Giancarlo %E Vlasenko, Jelena %E Wasserman, Anthony I. %X During the last years, most of the large free / open source software projects have included code review as an usual, or even mandatory practice for changes to their code. In many cases it is implemented as a process in which a developer proposing some change needs to ask for a review by another developer before it can enter the code base. Code reviews, therefore, become a critical process for the project, which could cause delays in contributions being accepted, and risk to become a bottleneck if not enough reviewers are available. In this paper we present a methodology designed to analyze the code review process, to determine its main characteristics and parameters, and to detect potential problems with it. We also present how we have applied this methodology to the WebKit project, learning about the main characteristics of how code review works in their case. %B IFIP Advances in Information and Communication TechnologyOpen Source Software: Mobile Open Source Technologies. 10th IFIP WG 2.13 International Conference on Open Source Systems, OSS 2014, San José, Costa Rica, May 6-9, 2014. Proceedings %I Springer Berlin Heidelberg %C Berlin, Heidelberg %V 427 %P 1 - 10 %@ 978-3-642-55128-4 %R 10.1007/978-3-642-55128-4_1 %0 Conference Paper %B Proceedings of the 11th Working Conference on Mining Software Repositories %D 2014 %T Estimating Development Effort in Free/Open Source Software Projects by Mining Software Repositories: A Case Study of OpenStack %A Gregorio Robles %A González-Barahona, Jesús M. %A Cervigón, Carlos %A Capiluppi, Andrea %A Izquierdo-Cortázar, Daniel %K effort estimation %K free software %K mining software repositories %K open source %K openstack %X Because of the distributed and collaborative nature of free / open source software (FOSS) projects, the development effort invested in a project is usually unknown, even after the software has been released. However, this information is becoming of major interest, especially ---but not only--- because of the growth in the number of companies for which FOSS has become relevant for their business strategy. In this paper we present a novel approach to estimate effort by considering data from source code management repositories. We apply our model to the OpenStack project, a FOSS project with more than 1,000 authors, in which several tens of companies cooperate. Based on data from its repositories and together with the input from a survey answered by more than 100 developers, we show that the model offers a simple, but sound way of obtaining software development estimations with bounded margins of error. %B Proceedings of the 11th Working Conference on Mining Software Repositories %S MSR 2014 %I ACM %C New York, NY, USA %P 222–231 %@ 978-1-4503-2863-0 %U http://doi.acm.org/10.1145/2597073.2597107 %R 10.1145/2597073.2597107 %> https://flosshub.org/sites/flosshub.org/files/robles_0.pdf %0 Conference Paper %B Proceedings of the 11th Working Conference on Mining Software Repositories %D 2014 %T FLOSS 2013: A Survey Dataset About Free Software Contributors: Challenges for Curating, Sharing, and Combining %A Gregorio Robles %A Reina, Laura Arjona %A Serebrenik, Alexander %A Vasilescu, Bogdan %A González-Barahona, Jesús M. %K anonymization %K data combining %K data sharing %K ethics %K free software %K microdata %K msr data showcase %K open data %K open source %K privacy %K Survey %X In this data paper we describe a data set obtained by means of performing an on-line survey to over 2,000 Free Libre Open Source Software (FLOSS) contributors. The survey includes questions related to personal characteristics (gender, age, civil status, nationality, etc.), education and level of English, professional status, dedication to FLOSS projects, reasons and motivations, involvement and goals. We describe as well the possibilities and challenges of using private information from the survey when linked with other, publicly available data sources. In this regard, an example of data sharing will be presented and legal, ethical and technical issues will be discussed. %B Proceedings of the 11th Working Conference on Mining Software Repositories %S MSR 2014 %I ACM %C New York, NY, USA %P 396–399 %@ 978-1-4503-2863-0 %U http://doi.acm.org/10.1145/2597073.2597129 %R 10.1145/2597073.2597129 %> https://flosshub.org/sites/flosshub.org/files/msr14gregorio.pdf %0 Journal Article %J Revista Eletrônica de Sistemas de Informação %D 2014 %T SENTIMENT ANALYSIS OF FREE/OPEN SOURCE DEVELOPERS: PRELIMINARY FINDINGS FROM A CASE STUDY %A Rousinopoulos, Athanasios-Ilias %A Gregorio Robles %A González-Barahona, Jesús M. %K developer productivity %K FLOSS %K mailing lists %K natural language processing %K openSUSE %K sentiment analysis %K software development; software repository mining %X Software development is a human intensive activity. And as such, how developers face their tasks is of major importance. In an environment such as the one that is common in FOSS (free/open source software) projects where professionals (i.e., paid developers) share the development effort with volunteers, the morale of the development and user community is of major importance. In this paper, we present a preliminary analysis using sentiment analysis techniques to a FOSS project. We therefore mine the mailing list of a project and apply these techniques to the most relevant participants. Although the application is at this time limited, we hope that this experience can be of benefit in the future to determine situations that may affect the developers or the project, such as low productivity, developer abandonment, project forking, etc. %B Revista Eletrônica de Sistemas de Informação %V 13 %8 08/2014 %U http://189.16.45.2/ojs/index.php/reinfo/article/view/1677 %N 2 %! RESI %R 10.5329/RESI.2014.1302006 %> https://flosshub.org/sites/flosshub.org/files/1677-6732-1-PB.pdf %0 Conference Proceedings %B IFIP Advances in Information and Communication Technology (OSS 2012) %D 2012 %T A Comprehensive Study of Software Forks: Dates, Reasons and Outcomes %A Gregorio Robles %A González-Barahona, Jesús M. %K forking %K forks %K free software %K Legal %K open source %K social %K software evolution %K sustainability %X Summary. In general it is assumed that a software product evolves within the authoring company or group of developers that develop the project. However, in some cases different groups of developers make the software evolve in different directions, a situation which is commonly known as a fork. In the case of free software, although forking is a practice that is considered as a last resort, it is inherent to the four freedoms. This paper tries to shed some light on the practice of forking. Therefore, we have identified significant forks, several hundreds in total, and have studied them in depth. Among the issues that have been analyzed for each fork is the date when the forking occurred, the reason of the fork, and the outcome of the fork, i.e., if the original or the forking project are still developed. Our investigation shows, among other results, that forks occur in every software domain, that they have become more frequent in recent years, and that very few forks merge with the original project. %B IFIP Advances in Information and Communication Technology (OSS 2012) %I IFIP AICT %V 378 %P 1-14 %> https://flosshub.org/sites/flosshub.org/files/paper_0.pdf %0 Conference Proceedings %B IFIP Advances in Information and Communication Technology 378 (OSS 2012) %D 2012 %T Do More Experienced Developers Introduce Fewer Bugs? %A Izquierdo-Cortázar, Daniel %A Gregorio Robles %A González-Barahona, Jesús M. %K mercurial %K mozilla %K scm %K source code analysis %X Developer experience is a common matter of study in the software maintenance and evolution research literature. However it is still not well understood if less experienced developers are more prone to introduce errors in the source code than their more experienced colleagues. This paper aims to study the relationships between experience and the bug introduction ratio using the Mozilla community as case of study. As results, statistical differences among developers with different levels of experience has not been observed, when the expected result would have been the opposite1. %B IFIP Advances in Information and Communication Technology 378 (OSS 2012) %I IFIP AICT, Springer %V 378 %P 268-273 %8 09/2012 %0 Book %B IFIP Advances in Information and Communication TechnologyOpen Source Software: New Horizons %D 2010 %T Open Source Software Developer and Project Networks %A Madey, G. %A van Antwerp, M. %E Ågerfalk, Pär %E Boldyreff, Cornelia %E González-Barahona, Jesús M. %E Madey, Gregory R. %E Noll, John %K berlios %K savannah %K sourceforge %X This paper outlines complex network concepts and how social networks are built from Open Source Software (OSS) data. We present an initial study of the social networks of three different OSS forges, BerliOS Developer, GNU Savannah, and SourceForge. Much research has been done on snapshot or conflated views of these networks, especially SourceForge, due to the size of the SourceForge community. The degree distribution, connectedness, centrality, and scale-free nature of SourceForge has been presented for the network at particular points in time. However, very little research has been done on how the network grows, how connections were made, especially during its infancy, and how these metrics evolve over time. %B IFIP Advances in Information and Communication TechnologyOpen Source Software: New Horizons %I Springer Berlin Heidelberg %C Berlin, Heidelberg %V 319 %P 407 - 412 %@ 978-3-642-13244-5 %R 10.1007/978-3-642-13244-5_39 %0 Book %B IFIP Advances in Information and Communication Technology Open Source Software: New Horizons (OSS 2010) %D 2010 %T Warehousing and Studying Open Source Versioning Metadata %A van Antwerp, M. %A Madey, G. %E Ågerfalk, Pär %E Boldyreff, Cornelia %E González-Barahona, Jesús M. %E Madey, Gregory R. %E Noll, John %K berlios %K cvs %K savannah %K scm %K sourceforge %K srda %K subversion %K svn %X In this paper, we describe the downloading and warehousing of Open Source Software (OSS) versioning metadata from SourceForge, BerliOS Developer, and GNU Savannah. This data enables and supports research in areas such as software engineering, open source phenomena, social network analysis, data mining, and project management. This newly-formed database containing Concurrent Versions System (CVS) and Subversion (SVN) metadata offers new research opportunities for large-scale OSS development analysis. The CVS and SVN data is juxtaposed with the SourceForge.net Research Data Archive [5] for the purpose of performing more powerful and interesting queries. We also present an initial statistical analysis of some of the most active projects. %B IFIP Advances in Information and Communication Technology Open Source Software: New Horizons (OSS 2010) %I Springer Berlin Heidelberg %C Berlin, Heidelberg %V 319 %P 413 - 418 %@ 978-3-642-13244-5 %R 10.1007/978-3-642-13244-5_40 %0 Conference Paper %B 2009 42nd Hawaii International Conference on System Sciences %D 2009 %T On the Analysis of Contributions from Privileged Users in Virtual Open Communities %A Ortega, Felipe %A Izquierdo-Cortázar, Daniel %A González-Barahona, Jesús M. %A Gregorio Robles %X Collaborative projects built around virtual communities on the Internet have gained momentum over the last decade. Nevertheless, their rapid growth rate rises some questions:which is the most effective approach to manage and organize their content creation process? Can these communities scale, controlling their projects as their size continues to grow over time? To answer these questions, we undertake a quantitative analysis of privileged users in FLOSS development projects and in Wikipedia. From our results, we conclude that the inequality level of user contributions in both types of initiatives is remarkably distinct, even though both communities present almost identical patterns regard-ing the number of distinct contributors per file (in FLOSS projects) or per article (in Wikipedia). As a result, totally open projects like Wikipedia can effectively deal with faster growing rates, while FLOSS projects may be affected by bottlenecks on committers who play critical roles. %B 2009 42nd Hawaii International Conference on System Sciences %I IEEE %C Waikoloa, Hawaii, USA %P 1 - 10 %@ 978-0-7695-3450-3 %R 10.1109/HICSS.2009.328 %0 Journal Article %J International Journal of Open Source Software and Processes %D 2009 %T Tools for the Study of the Usual Data Sources found in Libre Software Projects %A Gregorio Robles %A González-Barahona, Jesús M. %A Izquierdo-Cortazar, Daniel %A Herraiz, Israel %K bug tracking systems %K data sources %K mailing lists %K scm %K tools %X Due to the open nature of Free/Libre/Open Source software projects, researchers have gained access to a rich set of development-related information. Although this information is publicly available on the Internet, obtaining and analyzing it in a convenient way is not an easy task and many considerations have to be taken into account. In this paper we present the most important data sources that can be found in libre software projects and that are studied by the research community: source code, source code management systems, mailing lists and bug tracking systems. We will give advice for the problems that can be found when retrieving and preparing the data sources for a posterior analysis, as well as provide information about the tools that support these tasks. %B International Journal of Open Source Software and Processes %V 1 %P 24 - 45 %8 31/2009 %N 1 %R 10.4018/jossp.2009010102 %> https://flosshub.org/sites/flosshub.org/files/robles.pdf %0 Conference Paper %B Proceedings of the 2008 international workshop on Mining software repositories - MSR '08 %D 2008 %T Determinism and evolution %A González-Barahona, Jesús M. %A Gregorio Robles %A Herraiz, Israel %Y Hassan, Ahmed E. %Y Lanza, Michele %Y Godfrey, Michael W. %K changes %K evolution %K source code %K sourceforge %X It has been proposed that software evolution follows a Self-Organized Criticality (SOC) dynamics. This fact is supported by the presence of long range correlations in the time series of the number of changes made to the source code over time. Those long range correlations imply that the current state of the project was determined time ago. In other words, the evolution of the software project is governed by a sort of determinism. But this idea seems to contradict intuition. To explore this apparent contradiction, we have performed an empirical study on a sample of 3,821 libre (free, open source) software projects, finding that their evolution projects is short range correlated. This suggests that the dynamics of software evolution may not be SOC, and therefore that the past of a project does not determine its future except for relatively short periods of time, at least for libre software. %B Proceedings of the 2008 international workshop on Mining software repositories - MSR '08 %I ACM Press %C New York, New York, USA %P 1-9 %8 05/2008 %@ 9781605580241 %! MSR '08 %R 10.1145/1370750.1370752 %> https://flosshub.org/sites/flosshub.org/files/p1-herraiz.pdf %0 Book %B Emerging Free and Open Source Software Practices %D 2007 %T Emerging Free and Open Source Software PracticesVolunteers in Large Libre Software Projects %A Gregorio Robles %A González-Barahona, Jesús M. %A Martin Michlmayr %E Sowe, Sulayman K. %E Stamelos, Ioannis G. %E Samoladas, Ioannis %X Most libre (free, open source) software projects rely on the work of volunteers. Therefore, attracting people who contribute their time and technical skills is of paramount importance, both in technical and economic terms. This reliance on volunteers leads to some fundamental management challenges: Volunteer contributions are inherently difficult to predict, plan, and manage, especially in the case of large projects. In this chapter we present an analysis of the evolution over time of the human resources in large libre software projects, using the Debian project, one of the largest and most complex libre software projects based mainly in voluntary work, as a case study. We have performed a quantitative investigation of data corresponding to roughly seven years, studying how volunteer involvement has affected the software released by the project, and the developer community itself. %B Emerging Free and Open Source Software Practices %I IGI Global %P 1 - 24 %U http://www.igi-global.com/chapter/volunteers-large-libre-software-projects/10080 %R 10.4018/978-1-59904-210-7.ch001 %0 Conference Paper %B OSS2005: Open Source Systems %D 2005 %T Transfering Libre Software Development Practices to the Production of Educational Resources: the Edukalibre Project %A González-Barahona, Jesús M. %A Chris Tebb %A Vania Dimitrova %A Chaparro, Diego %A Romera, Teo %K educational resources %K information systems %K open source %K software development practices %X The transfer of methodologies common in libre (free, open source) sofware development to the domain of educational resources can radically change the way educational content is developed and used, enabling both educational practitioners and students to become actively involved in its creation and distribution. New software architectures and tools are needed to effectively support this process. This paper describes a platform aimed to support the creation of free, collaboratively constructed educational content on the web, which has been developed within the Edukalibre project. It provides easy access to core technologies: a control version system combined with conversion tools to produce several convenient formats for each document. Its modular architecture offers many different interfaces to the users. The Edukalibre platform is distributed as libre software. %B OSS2005: Open Source Systems %P 341-348 %U http://pascal.case.unibz.it/handle/2038/1548