A Novel Support Vector Machine with Class-dependent Features for Biomedical Data

In this paper we propose a novel support vector machine (SVM) with class-dependent features. According to an importance measure, e.g., the RELIEF weight measure or class separability measure, we rank the features importance for each class against the rest of classes. For each class we select an optimal feature subset using a classifier, e.g., the support vector machine (SVM). For the classification on these class-dependent feature subsets, we propose to construct a novel SVM using "one-against-all" in 2 processes: (1) construct one model for each class by training the classifier with the class's optimal feature subset; (2) during testing, each test pattern is tested on all models and the model with the maximum output decides the class of the test pattern. The method's performance is evaluated on two benchmark datasets. Our results indicate that our novel SVM classifier can effectively realize the classification of class-dependent feature subsets found by our wrapper approach which can remove irrelevant features for each class and at the same time maintain or even improve the classification accuracy in comparison with other feature selection methods.

[1]  W. Malina,et al.  Two-parameter Fisher criterion , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[2]  Pat Langley,et al.  Induction of Selective Bayesian Classifiers , 1994, UAI.

[3]  Il-Seok Oh,et al.  Using class separation for feature analysis and combination of class-dependent features , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[4]  James A. Casbon,et al.  Protein Secondary Structure Prediction with Support Vector Machines , 2002 .

[5]  Walker H. Land,et al.  Applying support vector machines to breast cancer diagnosis using screen film mammogram data , 2004, Proceedings. 17th IEEE Symposium on Computer-Based Medical Systems.

[6]  Isabelle Guyon,et al.  Comparison of classifier methods: a case study in handwritten digit recognition , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[7]  S. Griffis EDITOR , 1997, Journal of Navigation.

[8]  Naftali Tishby,et al.  Margin based feature selection - theory and algorithms , 2004, ICML.

[9]  Constantin F. Aliferis,et al.  A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis , 2004, Bioinform..

[10]  Lipo Wang,et al.  Data Mining With Computational Intelligence , 2006, IEEE Transactions on Neural Networks.

[11]  Lipo Wang,et al.  A GA-based RBF classifier with class-dependent features , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[12]  Jesús S. Aguilar-Ruiz,et al.  SOAP: Efficient Feature Selection of Numeric Attributes , 2002, IBERAMIA.

[13]  Thomas Lengauer,et al.  BMC Bioinformatics Methodology article Local protein structure prediction using discriminative models , 2006 .

[14]  Shyang Chang,et al.  An adaptive learning algorithm for principal component analysis , 1995, IEEE Trans. Neural Networks.

[15]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[16]  D. Haussler,et al.  Knowledge-based analysis of microarray gene expression , 2000 .

[17]  S. Hua,et al.  A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach. , 2001, Journal of molecular biology.

[18]  George H. John Enhancements to the data mining process , 1997 .

[19]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[20]  Nello Cristianini,et al.  Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[21]  Ching Y. Suen,et al.  Analysis of Class Separation and Combination of Class-Dependent Features for Handwriting Recognition , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Josef Kittler,et al.  Pattern recognition : a statistical approach , 1982 .

[23]  Lipo Wang,et al.  Data dimensionality reduction with application to simplifying RBF network structure and improving classification performance , 2003, IEEE Trans. Syst. Man Cybern. Part B.

[24]  Ulrich H.-G. Kreßel,et al.  Pairwise classification and support vector machines , 1999 .

[25]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[26]  Abraham Kandel,et al.  Information-theoretic algorithm for feature selection , 2001, Pattern Recognit. Lett..

[27]  M. W. Foster,et al.  Integrating ethics and science in the International HapMap Project , 2004, Nature Reviews Genetics.