%0 Conference Paper %B Proceedings of the 8th working conference on Mining software repositories - MSR '11 %D 2011 %T A study of language usage evolution in open source software %A Gall, Harald %A Karus, Siim %Y van Deursen, Arie %Y Xie, Tao %Y Zimmermann, Thomas %X The use of programming languages such as Java and C in Open Source Software (OSS) has been well studied. However, many other popular languages such as XSL or XML have received minor attention. In this paper, we discuss some trends in OSS development that we observed when considering multiple programming language evolution of OSS. Based on the revision data of 22 OSS projects, we tracked the evolution of language usage and other artefacts such as documentation files, binaries and graphics files. In these systems several different languages and artefact types including C/C++, Java, XML, XSL, Makefile, Groovy, HTML, Shell scripts, CSS, Graphics files, JavaScript, JSP, Ruby, Phyton, XQuery, OpenDocument files, PHP, etc. have been used. We found that the amount of code written in different languages differs substantially. Some of our findings can be summarized as follows: (1) JavaScript and CSS files most often co-evolve with XSL; (2) Most Java developers but only every second C/C++ developer work with XML; (3) and more generally, we observed a significant increase of usage of XML and XSL during recent years and found that Java or C are hardly ever the only language used by a developer. In fact, a developer works with more than 5 different artefact types (or 4 different languages) in a project on average. %B Proceedings of the 8th working conference on Mining software repositories - MSR '11 %I ACM Press %C Waikiki, Honolulu, HI, USA %P 13-22 %8 05/2011 %@ 9781450305747 %! MSR '11 %R 10.1145/1985441.1985447 %0 Conference Paper %B Proceedings of the 17th International Symposium on Software Reliability Engineering %D 2009 %T Putting it All Together: Using Socio-Technical Networks to Predict Failures %A Christian Bird %A Nachiappan Nagappan %A Devanbu, Premkumar %A Gall, Harald %A Brendan Murphy %K eclipse %K microsoft %K social network %K vista %K windows %X Studies have shown that social factors in development organizations have a dramatic effect on software quality. Separately, program dependency information has also been used successfully to predict which software components are more fault prone. Interestingly, the influence of these two phenomena have only been studied separately. Intuition and practical experience suggests, however, that task assignment (i.e. who worked on which components and how much) and dependency structure (which components have dependencies on others) together interact to influence the quality of the resulting software. We study the influence of combined socio-technical software networks on the fault-proneness of individual software components within a system. The network properties of a software component in this combined network are able to predict if an entity is failure prone with greater accuracy than prior methods which use dependency or contribution information in isolation. We evaluate our approach in different settings by using it on Windows Vista and across six releases of the Eclipse development environment including using models built from one release to predict failure prone components in the next release. We compare this to previous work. In every case, our method performs as well or better and is able to more accurately identify those software components that have more post-release failures, with precision and recall rates as high as 85%. %B Proceedings of the 17th International Symposium on Software Reliability Engineering %> https://flosshub.org/sites/flosshub.org/files/bird2009pat.pdf %0 Conference Paper %B Proceedings of the 2005 international workshop on Mining software repositories %D 2005 %T Mining evolution data of a product family %A Fischer, Michael %A Oberleitner, Johann %A Ratzinger, Jacek %A Gall, Harald %K bsd %K change analysis %K change history %K cvs %K evolution %K freebsd %K netbsd %K openbsd %K release history %K source code %K text mining %X Diversification of software assets through changing requirements impose a constant challenge on the developers and maintainers of large software systems. Recent research has addressed the mining for data in software repositories of single products ranging from fine- to coarse grained analyses. But so far, little attention has been payed to mining data about the evolution of product families. In this work, we study the evolution and commonalities of three variants of the BSD (Berkeley Software Distribution), a large open source operating system. The research questions we tackle are concerned with how to generate high level views of the system discovering and indicating evolutionary highlights. To process the large amount of data, we extended our previously developed approach for storing release history information to support the analysis of product families. In a case study we apply our approach on data from three different code repositories representing about 8.5GB of data and 10 years of active development. %B Proceedings of the 2005 international workshop on Mining software repositories %S MSR '05 %I ACM %C New York, NY, USA %P 12-16 %@ 1-59593-123-6 %U http://doi.acm.org/10.1145/1082983.1083145 %R http://doi.acm.org/10.1145/1082983.1083145 %> https://flosshub.org/sites/flosshub.org/files/12MiningEvolution.pdf