论文信息 - In-sample Model Selection for Trimmed Hinge Loss Support Vector Machine

In-sample Model Selection for Trimmed Hinge Loss Support Vector Machine

In this letter, we target the problem of model selection for support vector classifiers through in-sample methods, which are particularly appealing in the small-sample regime. In particular, we describe the application of a trimmed hinge loss function to the Rademacher complexity and maximal discrepancy-based in-sample approaches and show that the selected classifiers outperform the ones obtained with other in-sample model selection techniques, which exploit a soft loss function, in classifying microarray data.

[1] J. Mesirov,et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[2] Davide Anguita,et al. Selecting the hypothesis space for improving the generalization ability of Support Vector Machines , 2011, The 2011 International Joint Conference on Neural Networks.

[3] Graziano Pesole,et al. On the statistical assessment of classifiers using DNA microarray data , 2006, BMC Bioinformatics.

[4] Constantin F. Aliferis,et al. GEMS: A system for automated cancer diagnosis and biomarker discovery from microarray gene expression data , 2005, Int. J. Medical Informatics.

[5] Edward R. Dougherty,et al. Is cross-validation valid for small-sample microarray classification? , 2004, Bioinform..

[6] Davide Anguita,et al. Model selection for support vector machines: Advantages and disadvantages of the Machine Learning Theory , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[7] David Page. Comparative Data Mining for Microarrays : A Case Study Based on Multiple Myeloma , 2002 .

[8] Davide Anguita,et al. In-sample model selection for Support Vector Machines , 2011, The 2011 International Joint Conference on Neural Networks.

[9] R. Spang,et al. Predicting the clinical status of human breast cancer by using gene expression profiles , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[10] Nello Cristianini,et al. Kernel Methods for Pattern Analysis , 2004 .

[11] Johan A. K. Suykens,et al. Morozov, Ivanov and Tikhonov Regularization Based LS-SVMs , 2004, ICONIP.

[12] S. Sathiya Keerthi,et al. Evaluation of simple performance measures for tuning SVM hyperparameters , 2003, Neurocomputing.

[13] U. Alon,et al. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[14] Vladimir Vapnik,et al. Statistical learning theory , 1998 .

[15] Jason Weston,et al. Trading convexity for scalability , 2006, ICML.

[16] J. Platt. Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[17] Constantin F. Aliferis,et al. A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis , 2004, Bioinform..

[18] Peter L. Bartlett,et al. Model Selection and Error Estimation , 2000, Machine Learning.

[19] Davide Anguita,et al. Maximal Discrepancy for Support Vector Machines , 2011, ESANN.