An Implementation of Just-in-Time Fault-Prone Prediction Technique Using Text Classifier

Since the fault prediction is an important technique to help allocating software maintenance effort, much research on fault prediction has been proposed so far. The goal of these studies is applying their prediction technique to actual software development. In this paper, we implemented a prototype fault-prone module prediction tool using a text-filtering based technique named "Fault-Prone Filtering". Our tool aims to show the result of fault prediction for each change (i.e., Commits) as a probability that a source code file to be faulty. The result is shown on a Web page and easy to track the histories of prediction. A case study performed on three open source projects shows that our tool could detect 90 percent of the actual fault modules (i.e., The recall of 0.9) with the accuracy of 0.67 and the precision of 0.63 on average.

[1]  Osamu Mizuno,et al.  Training on errors experiment to detect fault-prone software modules by spam filter , 2007, ESEC-FSE '07.

[2]  Audris Mockus,et al.  A large-scale empirical study of just-in-time quality assurance , 2013, IEEE Transactions on Software Engineering.

[3]  Tim Menzies,et al.  Data Mining Static Code Attributes to Learn Defect Predictors , 2007, IEEE Transactions on Software Engineering.

[4]  Pierfrancesco Bellini,et al.  Comparing fault-proneness estimation models , 2005, 10th IEEE International Conference on Engineering of Complex Computer Systems (ICECCS'05).

[5]  Taghi M. Khoshgoftaar,et al.  Analyzing software quality with limited fault-proneness defect data , 2005, Ninth IEEE International Symposium on High-Assurance Systems Engineering (HASE'05).

[6]  Andreas Zeller,et al.  When do changes induce fixes? , 2005, ACM SIGSOFT Softw. Eng. Notes.