Supervised Wavelet Method to Predict Patient Survival from Gene Expression Data

In microarray studies, the number of samples is relatively small compared to the number of genes per sample. An important aspect of microarray studies is the prediction of patient survival based on their gene expression profile. This naturally calls for the use of a dimension reduction procedure together with the survival prediction model. In this study, a new method based on combining wavelet approximation coefficients and Cox regression was presented. The proposed method was compared with supervised principal component and supervised partial least squares methods. The different fitted Cox models based on supervised wavelet approximation coefficients, the top number of supervised principal components, and partial least squares components were applied to the data. The results showed that the prediction performance of the Cox model based on supervised wavelet feature extraction was superior to the supervised principal components and partial least squares components. The results suggested the possibility of developing new tools based on wavelets for the dimensionally reduction of microarray data sets in the context of survival analysis.

[1]  Yihui Liu,et al.  Dimensionality reduction and main component extraction of mass spectrometry cancer data , 2012, Knowl. Based Syst..

[2]  R. Tibshirani,et al.  Prediction by Supervised Principal Components , 2006 .

[3]  Hege M. Bøvelstad,et al.  Survival prediction from clinico-genomic models - a comparative study , 2009, BMC Bioinformatics.

[4]  Loris Nanni,et al.  Wavelet selection for disease classification by DNA microarray data , 2011, Expert Syst. Appl..

[5]  L. Staudt,et al.  The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma. , 2002, The New England journal of medicine.

[6]  Uwe Aickelin,et al.  Wavelet Feature Extraction and Genetic Algorithm for Biomarker Detection in Colorectal Cancer Data , 2013, Knowl. Based Syst..

[7]  Kwong-Sak Leung,et al.  Adaptive L1/2 Shooting Regularization Method for Survival Analysis Using Gene Expression Data , 2013, TheScientificWorldJournal.

[8]  M. Gonen,et al.  Concordance probability and discriminatory power in proportional hazards regression , 2005 .

[9]  Ahmad M. Sarhan,et al.  Wavelet-based feature extraction for DNA microarray classification , 2013, Artificial Intelligence Review.

[10]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[11]  Hongzhe Li,et al.  Kernel Cox Regression Models for Linking Gene Expression Profiles to Censored Survival Data , 2002, Pacific Symposium on Biocomputing.

[12]  I. Langner Survival Analysis: Techniques for Censored and Truncated Data , 2006 .

[13]  M. Pencina,et al.  Overall C as a measure of discrimination in survival analysis: model specific population value and confidence interval estimation , 2004, Statistics in medicine.

[14]  Lu Tian,et al.  Linking gene expression data with patient survival times using partial least squares , 2002, ISMB.

[15]  E Graf,et al.  Assessment and comparison of prognostic classification schemes for survival data. , 1999, Statistics in medicine.

[16]  R. Tibshirani,et al.  Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data , 2004, PLoS biology.

[17]  Anne-Laure Boulesteix,et al.  Survival prediction using gene expression data: A review and comparison , 2009, Comput. Stat. Data Anal..

[18]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[19]  Yingdong Zhao,et al.  BRB-ArrayTools Data Archive for Human Cancer Gene Expression: A Unique and Efficient Data Sharing Resource , 2008, Cancer informatics.

[20]  Yihui Liu,et al.  Feature extraction and dimensionality reduction for mass spectrometry data , 2009, Comput. Biol. Medicine.

[21]  Danh V. Nguyen,et al.  Partial least squares proportional hazard regression for application to DNA microarray survival data , 2002, Bioinform..

[22]  N. Nagelkerke,et al.  A note on a general definition of the coefficient of determination , 1991 .

[23]  Ajay N. Jain,et al.  Wavelet transforms for the analysis of microarray experiments , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[24]  Arnoldo Frigessi,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm305 Gene expression Predicting survival from microarray data—a comparative study , 2022 .

[25]  David E. Misek,et al.  Gene-expression profiles predict survival of patients with lung adenocarcinoma , 2002, Nature Medicine.