Evaluation of numbers of top ranked genes

One important application of the statistical analysis of microarray gene expression data is to predict the clinical out- comes of diseased patients. The accurate selection of significant genes is crucial to establishing an effective predictive model. Statistical p-value and Cox score are used to rank the lung cancer patients' genes, and principal component analysis, supervised principal components and the partial least squares methods are used to study the effect of the number of top ranked genes. Sim- ulations are performed to evaluate the performance of the proposed methods. A real-life dataset is analyzed using the proposed methods, which are compared to each other. The predictive performance of each method is evaluated using three evaluation criteria. The results show that our predictive methods that involve gene selection have better predictive performance then other currently used methods (5).

[1]  E. Lander,et al.  Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[3]  Arnoldo Frigessi,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm305 Gene expression Predicting survival from microarray data—a comparative study , 2022 .

[4]  Jiang Gui,et al.  Threshold Gradient Descent Method for Censored Data Regression with Applications in Pharmacogenomics , 2004, Pacific Symposium on Biocomputing.

[5]  Q Tan,et al.  Gene Selection for Predicting Survival Outcomes of Cancer Patients in Microarray Studies , 2007, SCSS.

[6]  R. Tibshirani,et al.  Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data , 2004, PLoS biology.

[7]  T. Yeatman,et al.  A Predictive Risk Probability Approach for Microarray Data with Survival as an Endpoint , 2008, Journal of biopharmaceutical statistics.

[8]  R. Tibshirani,et al.  Prediction by Supervised Principal Components , 2006 .

[9]  Chun-Houh Chen,et al.  A method for analyzing censored survival phenotype with gene expression data , 2008, BMC Bioinformatics.

[10]  Jiang Gui,et al.  Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data , 2005, Bioinform..

[11]  Anne-Laure Boulesteix,et al.  Partial least squares: a versatile tool for the analysis of high-dimensional genomic data , 2006, Briefings Bioinform..

[12]  Marina Vannucci,et al.  Bioinformatics Original Paper Bayesian Variable Selection for the Analysis of Microarray Data with Censored Outcomes , 2022 .