论文信息 - AUC Optimization vs. Error Rate Minimization

AUC Optimization vs. Error Rate Minimization

The area under an ROC curve (AUC) is a criterion used in many applications to measure the quality of a classification algorithm. However, the objective function optimized in most of these algorithms is the error rate and not the AUC value. We give a detailed statistical analysis of the relationship between the AUC and the error rate, including the first exact expression of the expected value and the variance of the AUC for a fixed error rate. Our results show that the average AUC is monotonically increasing as a function of the classification accuracy, but that the standard deviation for uneven distributions and higher error rates is noticeable. Thus, algorithms designed to minimize the error rate may not lead to the best possible AUC values. We show that, under certain conditions, the global function optimized by the RankBoost algorithm is exactly the AUC. We report the results of our experiments with RankBoost in several datasets demonstrating the benefits of an algorithm specifically designed to globally optimize the AUC over other existing algorithms optimizing an approximation of the AUC or only locally optimizing the AUC.

Mehryar Mohri | Corinna Cortes | M. Mohri | Corinna Cortes

[1] D. M. Green,et al. Signal detection theory and psychophysics , 1966 .

[2] James P. Egan,et al. Signal detection theory and ROC analysis , 1975 .

[3] J. Hanley,et al. The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[4] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[5] Tom Fawcett,et al. Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Distributions , 1997, KDD.

[6] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[7] Yoram Singer,et al. An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[8] Gregory Piatetsky-Shapiro,et al. Measuring lift quality in database marketing , 2000, SKDD.

[9] Thomas de Quincey. [C] , 2000, The Works of Thomas De Quincey, Vol. 1: Writings, 1799–1820.

[10] J. Chauchat,et al. Targeting Customer Groups using Gain and Cost Matrix : a Marketing Application , 2001 .

[11] Yizhak Idan,et al. Evaluation of prediction models for marketing campaigns , 2001, KDD '01.

[12] Michael C. Mozer,et al. Prodding the ROC Curve: Constrained Optimization of Classifier Performance , 2001, NIPS.

[13] Peter A. Flach,et al. Learning Decision Trees Using the Area Under the ROC Curve , 2002, ICML.

[14] Michael C. Mozer,et al. Optimizing Classifier Performance Via the Wilcoxon-Mann-Whitney Statistic , 2003, ICML 2003.

[15] Jeffrey S. Simonoff,et al. Tree Induction Vs Logistic Regression: A Learning Curve Analysis , 2001, J. Mach. Learn. Res..

[16] A. Zients. Andy , 2003 .