An ensemble classifier learning approach to ROC optimization

An ensemble learning framework is proposed to optimize the receiver operating characteristic (ROC) curve corresponding to a given classifier. The proposed ensemble maximal figure-of-merit (E-MFoM) learning framework meets four key requirements desirable for ROC optimization, namely: (1) each classifier in the ensemble can be learned with any specified performance metric for any given classifier design; (2) such a classifier is discriminative in nature and attempts to optimize a particular operating point on the ROC curve of the classifier; (3) an ensemble approximation to the overall behavior of the ROC curve can be established by sampling a set of operating points; and (4) ensemble decision rules can be formulated by grouping these sampled classifiers with a uniform scoring function. We evaluate the proposed framework using 3 testing databases, the Reuters and two UCI sets. Our experimental results clearly show that E-MFoM learning outperforms the state-of-the-art algorithms using Wilcoxon-Mann-Whitney rank statistics

[1]  Peter A. Flach,et al.  Learning Decision Trees Using the Area Under the ROC Curve , 2002, ICML.

[2]  Bhavani Raskutti,et al.  Optimising area under the ROC curve using gradient descent , 2004, ICML.

[3]  Mehryar Mohri,et al.  AUC Optimization vs. Error Rate Minimization , 2003, NIPS.

[4]  Peter A. Flach The Geometry of ROC Space: Understanding Machine Learning Metrics through ROC Isometrics , 2003, ICML.

[5]  Chin-Hui Lee,et al.  A MFoM learning approach to robust multiclass multi-label text categorization , 2004, ICML.

[6]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[7]  Eric Horvitz,et al.  On the Path to an Ideal ROC Curve: Considering Cost Asymmetry in Learning Classifiers , 2005, AISTATS.

[8]  Robert C. Holte,et al.  Explicitly representing expected cost: an alternative to ROC representation , 2000, KDD '00.

[9]  Thorsten Joachims,et al.  Learning to classify text using support vector machines - methods, theory and algorithms , 2002, The Kluwer international series in engineering and computer science.

[10]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[11]  Douglas A. Reynolds,et al.  The NIST speaker recognition evaluation - Overview, methodology, systems, results, perspective , 2000, Speech Commun..

[12]  Michael C. Mozer,et al.  Optimizing Classifier Performance via an Approximation to the Wilcoxon-Mann-Whitney Statistic , 2003, ICML.

[13]  Peter A. Flach,et al.  ROCCER: A ROC convex hull rule learning algorithm , 2004 .

[14]  Tat-Seng Chua,et al.  A maximal figure-of-merit learning approach to text categorization , 2003, SIGIR.