Applying 1-norm SVM with squared loss to gene selection for cancer classification

Gene selection methods available have high computational complexity. This paper applies an 1-norm support vector machine with the squared loss (1-norm SVMSL) to implement fast gene selection for cancer classification. The 1-norm SVMSL, a variant of the 1-norm support vector machine (1-norm SVM) has been proposed. Basically, the 1-norm SVMSL can perform gene selection and classification at the same. However, to improve classification performance, we only use the 1-norm SVMSL as a gene selector, and adopt a subsequent classifier to classify the selected genes. We perform extensive experiments on four DNA microarray data sets. Experimental results indicate that the 1-norm SVMSL has a very fast gene selection speed compared with other methods. For example, the 1-norm SVMSL is almost an order of magnitude faster than the 1-norm SVM, and at least four orders of magnitude faster than SVM-RFE (recursive feature elimination), a state-of-the-art method.

[1]  Olvi L. Mangasarian,et al.  Generalized Support Vector Machines , 1998 .

[2]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[3]  E. Lander,et al.  Gene expression correlates of clinical prostate cancer behavior. , 2002, Cancer cell.

[4]  T. Poggio,et al.  Prediction of central nervous system embryonal tumour outcome based on gene expression , 2002, Nature.

[5]  Michael Elad,et al.  Stable recovery of sparse overcomplete representations in the presence of noise , 2006, IEEE Transactions on Information Theory.

[6]  Le Thi Hoai An,et al.  Efficient approaches for ℓ2-ℓ0 regularization and applications to feature selection in SVM , 2016, Applied Intelligence.

[7]  Yungho Leu,et al.  A novel hybrid feature selection method for microarray data analysis , 2011, Appl. Soft Comput..

[8]  S. Dudoit,et al.  Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data , 2002 .

[9]  Jin Cao,et al.  A fast gene selection method for multi-cancer classification using multiple support vector data description , 2015, J. Biomed. Informatics.

[10]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[11]  S. Ramaswamy,et al.  Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma. , 2002, Cancer research.

[12]  Hau-San Wong,et al.  A neural network-based biomarker association information extraction approach for cancer classification , 2009, J. Biomed. Informatics.

[13]  Li Zhang,et al.  Linear programming support vector machines , 2002, Pattern Recognit..

[14]  Li Zhang,et al.  A fast approximation algorithm for 1-norm SVM with squared loss , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[15]  Olvi L. Mangasarian,et al.  Exact 1-Norm Support Vector Machines Via Unconstrained Convex Differentiable Minimization , 2006, J. Mach. Learn. Res..

[16]  Lei Liu,et al.  Ensemble gene selection by grouping for microarray data classification , 2010, J. Biomed. Informatics.

[17]  Julio López,et al.  Embedded heterogeneous feature selection for conjoint analysis: A SVM approach using L1 penalty , 2017, Applied Intelligence.

[18]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[19]  F. Azuaje,et al.  Multiple SVM-RFE for gene selection in cancer classification with expression data , 2005, IEEE Transactions on NanoBioscience.

[20]  Joel A. Tropp,et al.  Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit , 2007, IEEE Transactions on Information Theory.

[21]  Glenn Fung,et al.  A Feature Selection Newton Method for Support Vector Machine Classification , 2004, Comput. Optim. Appl..

[22]  Li Zhang,et al.  A fast algorithm for kernel 1-norm support vector machines , 2013, Knowl. Based Syst..

[23]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[24]  Qi Shen,et al.  Simultaneous genes and training samples selection by modified particle swarm optimization for gene expression data classification , 2009, Comput. Biol. Medicine.

[25]  Ayhan Demiriz,et al.  Linear Programming Boosting via Column Generation , 2002, Machine Learning.

[26]  Sayan Mukherjee,et al.  Feature Selection for SVMs , 2000, NIPS.

[27]  Li Zhang,et al.  Multiple SVM-RFE for multi-class gene selection on DNA Microarray data , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[28]  Hau-San Wong,et al.  Constructing the gene regulation-level representation of microarray data for cancer classification , 2008, J. Biomed. Informatics.

[29]  Emmanuel J. Candès,et al.  NESTA: A Fast and Accurate First-Order Method for Sparse Recovery , 2009, SIAM J. Imaging Sci..

[30]  Cheng Wang,et al.  Optimal feature selection for sparse linear discriminant analysis and its applications in gene expression data , 2013, Comput. Stat. Data Anal..

[31]  Li Zhang,et al.  Analysis of programming properties and the row-column generation method for 1-norm support vector machines , 2013, Neural Networks.

[32]  Ingo Steinwart,et al.  Sparseness of Support Vector Machines , 2003, J. Mach. Learn. Res..

[33]  Xin Zhou,et al.  MSVM-RFE: extensions of SVM-RFE for multiclass gene selection on DNA microarray data , 2007, Bioinform..

[34]  Yingmin Jia,et al.  Adaptive huberized support vector machine and its application to microarray classification , 2011, Neural Computing and Applications.

[35]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[36]  Thomas A. Darden,et al.  Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method , 2001, Bioinform..

[37]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[38]  Kristin P. Bennett,et al.  Combining support vector and mathematical programming methods for classification , 1999 .

[39]  Zhicheng Wang,et al.  Salient object detection using biogeography-based optimization to combine features , 2015, Applied Intelligence.

[40]  Eiji Oki,et al.  GLPK (GNU Linear Programming Kit) , 2012 .

[41]  Jian Yang,et al.  Sparse maximum margin discriminant analysis for feature extraction and gene selection on gene expression data , 2013, Comput. Biol. Medicine.

[42]  Robert Tibshirani,et al.  1-norm Support Vector Machines , 2003, NIPS.

[43]  Andrew Kusiak,et al.  Cancer gene search with data-mining and genetic algorithms , 2007, Comput. Biol. Medicine.

[44]  Li Zhang,et al.  On the sparseness of 1-norm support vector machines , 2010, Neural Networks.

[45]  S. Mallat,et al.  Adaptive greedy approximations , 1997 .

[46]  Jinbo Bi,et al.  Dimensionality Reduction via Sparse Support Vector Machines , 2003, J. Mach. Learn. Res..

[47]  Federico Girosi,et al.  An Equivalence Between Sparse Approximation and Support Vector Machines , 1998, Neural Computation.