Tree-Based Integration of One-versus-Some (OVS) Classifiers for Multiclass Classification

Motivated by applications such as gene expression analysis, binary classification has achieved notable success. (e.g., cancer samples versus normal samples) When comes to multiclass classification, the extension is not straightforward. There has been two main directions on such extensions: 1) via a sequence of nested binary classifiers in a classification tree or 2) via classifier ensembles that integrate votes from all oneversus all (OVA) classifiers or all all-pairs (AP) classifiers. In this article, we present a new way to combine both strategies in a multiclass classification.

[1]  Thomas G. Dietterich,et al.  Error-Correcting Output Codes: A General Method for Improving Multiclass Inductive Learning Programs , 1991, AAAI.

[2]  Constantin F. Aliferis,et al.  A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis , 2004, Bioinform..

[3]  Trevor Hastie,et al.  Handwritten Digit Recognition via Deformable Prototypes , 1994 .

[4]  Tao Li,et al.  A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression , 2004, Bioinform..

[5]  Yi Lin Multicategory Support Vector Machines, Theory, and Application to the Classification of . . . , 2003 .

[6]  Yoram Singer,et al.  Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers , 2000, J. Mach. Learn. Res..

[7]  Dhammika Amaratunga,et al.  Exploration and Analysis of DNA Microarray and Protein Array Data , 2003, Wiley series in probability and statistics.

[8]  Sayan Mukherjee,et al.  Molecular classification of multiple tumor types , 2001, ISMB.

[9]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[10]  S. Dudoit,et al.  Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data , 2002 .

[11]  Jason Weston,et al.  Support vector machines for multi-class pattern recognition , 1999, ESANN.

[12]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[13]  Sharon L R Kardia,et al.  Accurate molecular classification of human cancers based on gene expression using a simple classifier with a pathological tree-based framework. , 2003, The American journal of pathology.

[14]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[15]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[16]  Kristin P. Bennett,et al.  Multicategory Classification by Support Vector Machines , 1999, Comput. Optim. Appl..

[17]  T. Poggio,et al.  Multiclass cancer diagnosis using tumor gene expression signatures , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Yoonkyung Lee,et al.  Classification of Multiple Cancer Types by Multicategory Support Vector Machines Using Gene Expression Data , 2003, Bioinform..