An extension of fault-prone filtering using precise training and a dynamic threshold

TitleAn extension of fault-prone filtering using precise training and a dynamic threshold
Publication TypeConference Paper
Year of Publication2008
AuthorsHata, H, Mizuno, O, Kikuno, T
Secondary TitleProceedings of the 2008 international working conference on Mining software repositories
Place PublishedNew York, NY, USA
ISBN Number978-1-60558-024-1
Keywordseclipse, fault-prone modules, spam filter, text mining

Fault-prone module detection in source code is important for assurance of software quality. Most previous fault-prone detection approaches have been based on software metrics. Such approaches, however, have difficulties in collecting the metrics and in constructing mathematical models based on the metrics. To mitigate such difficulties, we have proposed a novel approach for detecting fault-prone modules using a spam-filtering technique, named Fault-Prone Filtering. In our approach, fault-prone modules are detected in such a way that the source code modules are considered as text files and are applied to the spam filter directly. In practice, we use the training only errors procedure and apply this procedure to fault-prone. Since no pre-training is required, this procedure can be applied to an actual development field immediately. This paper describes an extension of the training only errors procedures. We introduce a precise unit of training, "modified lines of code," instead of methods. In addition, we introduce the dynamic threshold for classification. The result of the experiment shows that our extension leads to twice the precision with about the same recall, and improves 15% on the best F1 measurement.

Full Text
PDF icon p89-hata.pdf1.35 MB