Structure-Embedded AUC-SVM

AUC-SVM directly maximizes the area under the ROC curve (AUC) through minimizing its hinge loss relaxation, and the decision function is determined by those support vector sample pairs playing the same roles as the support vector samples in SVM. Such a learning paradigm generally emphasizes more on the local discriminative information just associated with these support vectors whereas hardly takes the overall view of data into account, thereby it may incur loss of the global distribution information in data favorable for classification. Moreover, due to the high computational complexity of AUC-SVM induced by the large number of training sample pairs quadratic in the number of samples, sampling is usually adopted, incurring a further loss of the distribution information in data. In order to compensate the distribution information loss and simultaneously boost the AUC-SVM performance, in this paper, we develop a novel structure-embedded AUC-SVM (SAUC-SVM for short) through embedding the global structure information in the whole data into AUC-SVM. With such an embedding, the proposed SAUC-SVM incorporates the local discriminative information and global structure information in data into a uniform formulation and consequently guarantees better generalization performance. Comparative experiments on both synthetic and real datasets confirm its effectiveness.

[1]  Michael C. Mozer,et al.  Optimizing Classifier Performance Via the Wilcoxon-Mann-Whitney Statistic , 2003, ICML 2003.

[2]  Kai Zhang,et al.  Support vector machine networks for multi-class classification , 2005, Int. J. Pattern Recognit. Artif. Intell..

[3]  Michael C. Mozer,et al.  Optimizing Classifier Performance via an Approximation to the Wilcoxon-Mann-Whitney Statistic , 2003, ICML.

[4]  Qiang Yang,et al.  Structural Support Vector Machine , 2008, ISNN.

[5]  Alain Rakotomamonjy,et al.  Optimizing Area Under Roc Curve with SVMs , 2004, ROCAI.

[6]  Sungzoon Cho,et al.  Neighborhood Property-Based Pattern Selection for Support Vector Machines , 2007, Neural Comput..

[7]  Qiang Yang,et al.  Discriminatively regularized least-squares classification , 2009, Pattern Recognit..

[8]  Bhavani Raskutti,et al.  Optimising area under the ROC curve using gradient descent , 2004, ICML.

[9]  Mehryar Mohri,et al.  AUC Optimization vs. Error Rate Minimization , 2003, NIPS.

[10]  Michael R. Lyu,et al.  Learning large margin classifiers locally and globally , 2004, ICML.

[11]  Sargur N. Srihari,et al.  Comparison of ROC and Likelihood Decision Methods in Automatic Fingerprint Verification , 2008, Int. J. Pattern Recognit. Artif. Intell..

[12]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[13]  William Nick Street,et al.  Learning to Rank by Maximizing AUC with Linear Programming , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[14]  Stephen P. Boyd,et al.  A minimax theorem with applications to machine learning, signal processing, and finance , 2007, 2007 46th IEEE Conference on Decision and Control.

[15]  Cor J. Veenman,et al.  Turning the hyperparameter of an AUC-optimized classifier , 2005, BNAIC.

[16]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[17]  Michael R. Lyu,et al.  Local Learning vs. Global Learning: An Introduction to Maxi-Min Margin Machine , 2005 .

[18]  Ramani Duraiswami,et al.  A Fast Algorithm for Learning a Ranking Function from Large-Scale Data Sets , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[20]  P. Gallinari,et al.  A Data-dependent Generalisation Error Bound for the AUC , 2005 .

[21]  Charles X. Ling,et al.  AUC: A Better Measure than Accuracy in Comparing Learning Algorithms , 2003, Canadian Conference on AI.

[22]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[23]  Lipo Wang Support vector machines : theory and applications , 2005 .

[24]  Michael R. Lyu,et al.  Maxi–Min Margin Machine: Learning Large Margin Classifiers Locally and Globally , 2008, IEEE Transactions on Neural Networks.