Mining sequences of changed-files from version histories

TitleMining sequences of changed-files from version histories
Publication TypeConference Paper
Year of Publication2006
AuthorsKagdi, H, Yusuf, S, Maletic, JI
Secondary TitleProceedings of the 2006 international workshop on Mining software repositories
Place PublishedNew York, NY, USA
ISBN Number1-59593-397-2
Keywordschange, change history, change management, change sequences, heuristics, kde, mining software repositories, scm, sequences, source code

Modern source-control systems, such as Subversion, preserve change-sets of files as atomic commits. However, the specific ordering information in which files were changed is typically not found in these source-code repositories. In this paper, a set of heuristics for grouping change-sets (i.e., log-entries) found in source-code repositories is presented. Given such groups of change-sets, sequences of files that frequently change together are uncovered. This approach not only gives the (unordered) sets of files but supplements them with (partial temporal) ordering information. The technique is demonstrated on a subset of KDE source-code repository. The results show that the approach is able to find sequences of changed-files.

Full Text
PDF icon 47MiningSequences.pdf310.2 KB