An Improved Elastic Net for Cancer Classification and Gene Selection

Abstract This paper presents an improved elastic net to identify relevant genes for cancer classification. By introducing the data-driven weight coefficients, the improved elastic net can adaptively select genes in groups and reduce the shrinkage bias for the coefficients of significant genes. Moreover, the irrelevant observations on the augmented dataset are removed and the computational complexity is largely reduced. Experiment results on the acute leukaemia data are provided to verify the proposed method.

[1]  Ding-Xuan Zhou,et al.  SVM Soft Margin Classifiers: Linear Programming versus Quadratic Programming , 2005, Neural Computation.

[2]  S. Sathiya Keerthi,et al.  A simple and efficient algorithm for gene selection using sparse logistic regression , 2003, Bioinform..

[3]  Yiming Ying,et al.  Learning Rates of Least-Square Regularized Regression , 2006, Found. Comput. Math..

[4]  Robert Tibshirani,et al.  1-norm Support Vector Machines , 2003, NIPS.

[5]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[6]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[7]  H. Zou,et al.  The doubly regularized support vector machine , 2006 .

[8]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[9]  Kam D. Dahlquist,et al.  Regression Approaches for Microarray Data Analysis , 2002, J. Comput. Biol..

[10]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[11]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[12]  Yiming Ying,et al.  Support Vector Machine Soft Margin Classifiers: Error Analysis , 2004, J. Mach. Learn. Res..

[13]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[14]  T. Hastie,et al.  Classification of gene microarrays by penalized logistic regression. , 2004, Biostatistics.

[15]  Trevor Hastie,et al.  Averaged gene expressions for regression. , 2007, Biostatistics.

[16]  Li Wang,et al.  Hybrid huberized support vector machines for microarray classification and gene selection , 2008, Bioinform..

[17]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[18]  Gavin C. Cawley,et al.  Gene Selection in Cancer Classification using Sparse Logistic Regression with Bayesian Regularisation , 2006 .