Process Mining Software Repositories

TitleProcess Mining Software Repositories
Publication TypeConference Paper
Year of Publication2011
AuthorsPoncin, W, Serebrenik, A, van den Brand, M
Secondary Title15th European Conference on Software Maintenance and Reengineering (CSMR 2011)
Pagination5 - 14
Place PublishedOldenburg, Germany
ISBN Number978-1-61284-259-2
Keywordsamsn, email, email archives, gcc, mailing list, Process mining, software repositories

Software developers’ activities are in general recorded in software repositories such as version control systems, bug trackers and mail archives. While abundant information is usually present in such repositories, successful information extraction is often challenged by the necessity to simultaneously analyze different repositories and to combine the information obtained.

We propose to apply process mining techniques, originally developed for business process analysis, to address this challenge. However, in order for process mining to become applicable, different software repositories should be combined, and “related” software development events should be matched: e.g., mails sent about a file, modifications of the file and bug reports that can be traced back to it.

The combination and matching of events has been implemented in FRASR (FRamework for Analyzing Software Repositories), augmenting the process mining framework ProM. FRASR has been successfully applied in a series of case studies addressing such aspects of the development process as roles of different developers and the way bug reports are handled.

Full Text
PDF icon 2011-03_CSMR.pdf794.15 KB