Hybrid Models Identified a 12-Gene Signature for Lung Cancer Prognosis and Chemoresponse Prediction

Background Lung cancer remains the leading cause of cancer-related deaths worldwide. The recurrence rate ranges from 35–50% among early stage non-small cell lung cancer patients. To date, there is no fully-validated and clinically applied prognostic gene signature for personalized treatment. Methodology/Principal Findings From genome-wide mRNA expression profiles generated on 256 lung adenocarcinoma patients, a 12-gene signature was identified using combinatorial gene selection methods, and a risk score algorithm was developed with Naïve Bayes. The 12-gene model generates significant patient stratification in the training cohort HLM & UM (n = 256; log-rank P = 6.96e-7) and two independent validation sets, MSK (n = 104; log-rank P = 9.88e-4) and DFCI (n = 82; log-rank P = 2.57e-4), using Kaplan-Meier analyses. This gene signature also stratifies stage I and IB lung adenocarcinoma patients into two distinct survival groups (log-rank P<0.04). The 12-gene risk score is more significant (hazard ratio = 4.19, 95% CI: [2.08, 8.46]) than other commonly used clinical factors except tumor stage (III vs. I) in multivariate Cox analyses. The 12-gene model is more accurate than previously published lung cancer gene signatures on the same datasets. Furthermore, this signature accurately predicts chemoresistance/chemosensitivity to Cisplatin, Carboplatin, Paclitaxel, Etoposide, Erlotinib, and Gefitinib in NCI-60 cancer cell lines (P<0.017). The identified 12 genes exhibit curated interactions with major lung cancer signaling hallmarks in functional pathway analysis. The expression patterns of the signature genes have been confirmed in RT-PCR analyses of independent tumor samples. Conclusions/Significance The results demonstrate the clinical utility of the identified gene signature in prognostic categorization. With this 12-gene risk score algorithm, early stage patients at high risk for tumor recurrence could be identified for adjuvant chemotherapy; whereas stage I and II patients at low risk could be spared the toxic side effects of chemotherapeutic drugs.

[1]  Charles A Powell,et al.  Lung adenocarcinoma global profiling identifies type II transforming growth factor-beta receptor as a repressor of invasiveness. , 2005, American journal of respiratory and critical care medicine.

[2]  Rafael A. Irizarry,et al.  A Model-Based Background Adjustment for Oligonucleotide Expression Arrays , 2004 .

[3]  M. Pepe,et al.  Limitations of the odds ratio in gauging the performance of a diagnostic, prognostic, or screening marker. , 2004, American journal of epidemiology.

[4]  Chronic Disease Division Cancer facts and figures , 2010 .

[5]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[6]  Jeffrey T. Chang,et al.  Oncogenic pathway signatures in human cancers as a guide to targeted therapies , 2006, Nature.

[7]  Igor Jurisica,et al.  Gene expression–based survival prediction in lung adenocarcinoma: a multi-site, blinded validation study , 2008, Nature Medicine.

[8]  U. G. Dailey Cancer,Facts and Figures about. , 2022, Journal of the National Medical Association.

[9]  S. Friend,et al.  Cancer Biomarkers—An Invitation to the Table , 2006, Science.

[10]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[11]  M. Cronin,et al.  A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. , 2004, The New England journal of medicine.

[12]  S. Mukherjee,et al.  A genomic strategy to refine prognosis in early-stage non-small-cell lung cancer. , 2006, The New England journal of medicine.

[13]  Yong Qian,et al.  Confirmation of Gene Expression–Based Prediction of Survival in Non–Small Cell Lung Cancer , 2008, Clinical Cancer Research.

[14]  Richard Lugg,et al.  Mutation analysis of 24 known cancer genes in the NCI-60 cell line set , 2006, Molecular Cancer Therapeutics.

[15]  Jae K. Lee,et al.  Transcript and protein expression profiles of the NCI-60 cancer cell panel: an integromic microarray study , 2007, Molecular Cancer Therapeutics.

[16]  Sudhir Srivastava,et al.  Markers for early detection of cancer: Statistical guidelines for nested case-control studies , 2002, BMC medical research methodology.

[17]  Ian Witten,et al.  Data Mining , 2000 .

[18]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[19]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Thomas D. Schmittgen,et al.  Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. , 2001, Methods.

[21]  S. Wieand,et al.  Analysis of repeated markers used to predict progression of cancer. , 1998, Statistics in medicine.

[22]  Yudong D. He,et al.  A Gene-Expression Signature as a Predictor of Survival in Breast Cancer , 2002 .

[23]  Igor Jurisica,et al.  Three-gene prognostic classifier for early-stage non small-cell lung cancer. , 2007, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[24]  Igor Jurisica,et al.  Prognostic gene signatures for non-small-cell lung cancer , 2009, Proceedings of the National Academy of Sciences.

[25]  H. Hansen,et al.  Lung cancer. , 1990, Cancer chemotherapy and biological response modifiers.

[26]  Vincent Castranova,et al.  An integrative genomic and proteomic approach to chemosensitivity prediction. , 2009, International journal of oncology.

[27]  Richard Simon,et al.  Gene expression-based prognostic signatures in lung cancer: ready for clinical use? , 2010, Journal of the National Cancer Institute.

[28]  David E. Misek,et al.  Gene-expression profiles predict survival of patients with lung adenocarcinoma , 2002, Nature Medicine.

[29]  Zhifu Sun,et al.  A Gene Expression Signature Predicts Survival of Patients with Stage I Non-Small Cell Lung Cancer , 2006, PLoS medicine.

[30]  Zhenyu Ding,et al.  Predicting Cancer Drug Response by Proteomic Profiling , 2006, Clinical Cancer Research.

[31]  Geoff Holmes,et al.  Benchmarking Attribute Selection Techniques for Discrete Class Data Mining , 2003, IEEE Trans. Knowl. Data Eng..

[32]  Xianglin Shi,et al.  Constructing Molecular Classifiers for the Accurate Prognosis of Lung Adenocarcinoma , 2006, Clinical Cancer Research.

[33]  R. Tsuchiya,et al.  Prognosis and survival in resected lung carcinoma based on the new international staging system. , 1988, The Journal of thoracic and cardiovascular surgery.

[34]  Michael E Phelps,et al.  Systems Biology and New Technologies Enable Predictive and Preventative Medicine , 2004, Science.

[35]  Jeremy J. W. Chen,et al.  A five-gene signature and clinical outcome in non-small-cell lung cancer. , 2007, The New England journal of medicine.

[36]  Yi Zhang,et al.  Gene expression signatures for predicting prognosis of squamous cell and adenocarcinomas of the lung. , 2006, Cancer research.