An extension of fault-prone filtering using precise training and a dynamic threshold

Submitted by msquire on Wed, 2011-04-13 14:15

Title	An extension of fault-prone filtering using precise training and a dynamic threshold
Publication Type	Conference Paper
Year of Publication	2008
Authors	Hata, H, Mizuno, O, Kikuno, T
Secondary Title	Proceedings of the 2008 international working conference on Mining software repositories
Pagination	89–98
Publisher	ACM
Place Published	New York, NY, USA
ISBN Number	978-1-60558-024-1
Keywords	eclipse, fault-prone modules, spam filter, text mining
Abstract	Fault-prone module detection in source code is important for assurance of software quality. Most previous fault-prone detection approaches have been based on software metrics. Such approaches, however, have difficulties in collecting the metrics and in constructing mathematical models based on the metrics. To mitigate such difficulties, we have proposed a novel approach for detecting fault-prone modules using a spam-filtering technique, named Fault-Prone Filtering. In our approach, fault-prone modules are detected in such a way that the source code modules are considered as text files and are applied to the spam filter directly. In practice, we use the training only errors procedure and apply this procedure to fault-prone. Since no pre-training is required, this procedure can be applied to an actual development field immediately. This paper describes an extension of the training only errors procedures. We introduce a precise unit of training, "modified lines of code," instead of methods. In addition, we introduce the dynamic threshold for classification. The result of the experiment shows that our extension leads to twice the precision with about the same recall, and improves 15% on the best F1 measurement.
URL	http://doi.acm.org/10.1145/1370750.1370772
DOI	10.1145/1370750.1370772
Full Text