论文信息 - Comparison of non-parametric methods for assessing classifier performance in terms of ROC parameters

Comparison of non-parametric methods for assessing classifier performance in terms of ROC parameters

The most common metric to assess a classifier's performance is the classification error rate, or the probability of misclassification (PMC). Receiver operating characteristic (ROC) analysis is a more general way to measure the performance. Some metrics that summarize the ROC curve are the two normal-deviate-axes parameters, i.e., a and b, and the area under the curve (AUC). The parameters "a" and "b" represent the intercept and slope, respectively, for the ROC curve if plotted on normal-deviate-axes scale. AUC represents the average of the classifier TPF over FPF resulting from considering different threshold values. In the present work, we used Monte-Carlo simulations to compare different bootstrap-based estimators, e.g., leave-one-out, .632, and .632+ bootstraps, to estimate the AUC. The results show the comparable performance of the different estimators in terms of RMS, while the .632+ is the least biased.

Murray H. Loew | Robert F. Wagner | Waleed A. Yousef

[1] R. Tibshirani,et al. Improvements on Cross-Validation: The 632+ Bootstrap Method , 1997 .

[2] M. Stone. Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[3] Keinosuke Fukunaga,et al. Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[4] R. F. Wagner,et al. Classifier design for computer-aided diagnosis: effects of finite sample size on the mean performance of classical and neural network classifiers. , 1999, Medical physics.

[5] Keinosuke Fukunaga,et al. Estimation of Classifier Performance , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[6] B. Efron. Estimating the Error Rate of a Prediction Rule: Improvement on Cross-Validation , 1983 .

[7] Marcus A. Maloof,et al. A General Model for Finite-Sample Effects in Training and Testing of Competing Classifiers , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[8] Keinosuke Fukunaga,et al. Effects of Sample Size in Classifier Design , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[9] P. Zhang,et al. Assessing prediction error in non-parametric regression , 1995 .