论文信息 - Further thoughts on precision

Further thoughts on precision

Background: There has been much discussion amongst automated software defect prediction researchers regarding use of the precision and false positive rate classifier performance metrics. Aim: To demonstrate and explain why failing to report precision when using data with highly imbalanced class distributions may provide an overly optimistic view of classifier performance. Method: Well documented examples of how dependent class distribution affects the suitability of performance measures. Conclusions: When using data where the minority class represents less than around 5 to 10 percent of data points in total, failing to report precision may be a critical mistake. Furthermore, deriving the precision values omitted from studies can reveal valuable insight into true classifier performance.

[1] Tim Menzies,et al. Problems with Precision , 2007 .

[2] Taghi M. Khoshgoftaar,et al. Ordering Fault-Prone Software Modules , 2003, Software Quality Journal.

[3] Mark Goadrich,et al. The relationship between Precision-Recall and ROC curves , 2006, ICML.

[4] Tim Menzies,et al. Data Mining Static Code Attributes to Learn Defect Predictors , 2007, IEEE Transactions on Software Engineering.

[5] Rainer Koschke,et al. Revisiting the evaluation of defect prediction models , 2009, PROMISE '09.

[6] Rainer Koschke,et al. Effort-Aware Defect Prediction Models , 2010, 2010 14th European Conference on Software Maintenance and Reengineering.

[7] Janez Demsar,et al. Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[8] Lionel C. Briand,et al. Data Mining Techniques for Building Fault-proneness Models in Telecom Java Software , 2007, The 18th IEEE International Symposium on Software Reliability (ISSRE '07).

[9] Ayse Basar Bener,et al. Defect prediction from static code features: current results, limitations, new approaches , 2010, Automated Software Engineering.

[10] Yue Jiang,et al. Techniques for evaluating fault prediction models , 2008, Empirical Software Engineering.

[11] Gustavo E. A. P. A. Batista,et al. A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.

[12] Yue Jiang,et al. Misclassification cost-sensitive fault prediction models , 2009, PROMISE '09.

[13] Xiuzhen Zhang,et al. Comments on "Data Mining Static Code Attributes to Learn Defect Predictors" , 2007, IEEE Trans. Software Eng..

[14] Bart Baesens,et al. Benchmarking Classification Models for Software Defect Prediction: A Proposed Framework and Novel Findings , 2008, IEEE Transactions on Software Engineering.