%0 Conference Paper %B Proceedings of the 11th Working Conference on Mining Software Repositories %D 2014 %T Analysing the 'Biodiversity' of Open Source Ecosystems: The GitHub Case %A Matragkas, Nicholas %A Williams, James R. %A Kolovos, Dimitris S. %A Paige, Richard F. %K Data and knowledge visualization %K data mining %K mining challenge %K msr challenge %X In nature the diversity of species and genes in ecological communities affects the functioning of these communities. Biologists have found out that more diverse communities appear to be more productive than less diverse communities. Moreover such communities appear to be more stable in the face of perturbations. In this paper, we draw the analogy between ecological communities and Open Source Software (OSS) ecosystems, and we investigate the diversity and structure of OSS communities. To address this question we use the MSR 2014 challenge dataset, which includes data from the top-10 software projects for the top programming languages on GitHub. Our findings show that OSS communities on GitHub consist of 3 types of users (core developers, active users, passive users). Moreover, we show that the percentage of core developers and active users does not change as the project grows and that the majority of members of large projects are passive users. %B Proceedings of the 11th Working Conference on Mining Software Repositories %S MSR 2014 %I ACM %C New York, NY, USA %P 356–359 %@ 978-1-4503-2863-0 %U http://doi.acm.org/10.1145/2597073.2597119 %R 10.1145/2597073.2597119 %0 Conference Paper %B Proceedings of the 11th Working Conference on Mining Software Repositories %D 2014 %T Models of OSS Project Meta-information: A Dataset of Three Forges %A Williams, James R. %A Di Ruscio, Davide %A Matragkas, Nicholas %A Di Rocco, Juri %A Kolovos, Dimitris S. %K data mining %K flossmole cited %X The process of selecting open-source software (OSS) for adoption is not straightforward as it involves exploring various sources of information to determine the quality, maturity, activity, and user support of each project. In the context of the OSSMETER project, we have developed a forge-agnostic metamodel that captures the meta-information common to all OSS projects. We specialise this metamodel for popular OSS forges in order to capture forge-specific meta-information. In this paper we present a dataset conforming to these metamodels for over 500,000 OSS projects hosted on three popular OSS forges: Eclipse, SourceForge, and GitHub. The dataset enables different kinds of automatic analysis and supports objective comparisons of cross-forge OSS alternatives with respect to a user's needs and quality requirements. %B Proceedings of the 11th Working Conference on Mining Software Repositories %S MSR 2014 %I ACM %C New York, NY, USA %P 408–411 %@ 978-1-4503-2863-0 %U http://doi.acm.org/10.1145/2597073.2597132 %R 10.1145/2597073.2597132 %> https://flosshub.org/sites/flosshub.org/files/Models_of_OSS_Project_Meta-Information_A_Dataset_of_Three_Forges_draft.pdf