A Study on the Significance of Software Metrics in Defect Prediction

In the case of metrics-based software defect prediction, an intelligent selection of metrics plays an important role in improving the model performance. In this paper, we use different ways for feature selection and dimensionality reduction to determine the most important software metrics. Three different classifiers are utilized, namely Naïve Bayes, support vector machine and decision tree. On the publicly NASA data, a comparative experiment results show that instead of 22 or more metrics, less than 10 metrics can get better performance.

[1]  Taghi M. Khoshgoftaar,et al.  An empirical investigation of filter attribute selection techniques for software quality classification , 2009, 2009 IEEE International Conference on Information Reuse & Integration.

[2]  Marian Jureczko,et al.  Significance of Different Software Metrics in Defect Prediction , 2011 .

[3]  Taghi M. Khoshgoftaar,et al.  A Comparative Study of Threshold-Based Feature Selection Techniques , 2010, 2010 IEEE International Conference on Granular Computing.

[4]  Sunghun Kim,et al.  Reducing Features to Improve Bug Prediction , 2009, 2009 IEEE/ACM International Conference on Automated Software Engineering.

[5]  S. Nickolas,et al.  Feature Selection Using Decision Tree Induction in Class level Metrics Dataset for Software Defect Predictions , 2010 .