Mining Eclipse Developer Contributions via Author-Topic Models

TitleMining Eclipse Developer Contributions via Author-Topic Models
Publication TypeConference Paper
Year of Publication2007
AuthorsLinstead, E, Rigor, P, Bajracharya, S, Lopes, C, Baldi, P
Secondary TitleFourth International Workshop on Mining Software Repositories (MSR'07:ICSE Workshops 2007)
Pagination30 - 30
Place PublishedMinneapolis, MN, USA
ISBN Number0-7695-2950-X
Keywordscontributions, developers, eclipse, expertise, mining challenge, msr challenge, source code, topics

We present the results of applying statistical author-topic models to a subset of the Eclipse 3.0 source code consisting of 2,119 source files and 700,000 lines of code from 59 developers. This technique provides an intuitive and automated framework with which to mine developer contributions and competencies from a given code base while simultaneously extracting software function in the form of topics. In addition to serving as a convenient summary for program function and developer activities, our study shows that topic models provide a meaningful, effective, and statistical basis for developer similarity analysis.

Full Text
PDF icon 28300030.pdf101.31 KB