Mining Open Source Software (OSS) Data Using Association Rules Network
|Title||Mining Open Source Software (OSS) Data Using Association Rules Network|
|Publication Type||Conference Paper|
|Year of Publication||2003|
|Authors||Chawla, Sanjay, Arunasalam Bavani, and Davis Joseph G.|
|Secondary Title||Lecture Notes in Computer Science|
|Keywords||arn, association rules, factor analysis, project success, sourceforge, svd|
The Open Source Software(OSS) movement has attracted considerable attention in the last few years. In this paper we report our results of mining data acquired from SourceForge.net, the largest open source software hosting website. In the process we introduce Association Rules Network(ARN), a (hyper)graphical model to represent a special class of association rules. Using ARNs we discover important relationships between the attributes of successful OSS projects. We verify and validate these relationships using Factor Analysis, a classical statistical technique related to Singular Value Decomposition(SVD).