Competing Risks Data Analysis with High-dimensional Covariates: An Application in Bladder Cancer

Analysis of microarray data is associated with the methodological problems of high dimension and small sample size. Various methods have been used for variable selection in high-dimension and small sample size cases with a single survival endpoint. However, little effort has been directed toward addressing competing risks where there is more than one failure risks. This study compared three typical variable selection techniques including Lasso, elastic net, and likelihood-based boosting for high-dimensional time-to-event data with competing risks. The performance of these methods was evaluated via a simulation study by analyzing a real dataset related to bladder cancer patients using time-dependent receiver operator characteristic (ROC) curve and bootstrap .632+ prediction error curves. The elastic net penalization method was shown to outperform Lasso and boosting. Based on the elastic net, 33 genes out of 1381 genes related to bladder cancer were selected. By fitting to the Fine and Gray model, eight genes were highly significant (P < 0.001). Among them, expression of RTN4, SON, IGF1R, SNRPE, PTGR1, PLEK, and ETFDH was associated with a decrease in survival time, whereas SMARCAD1 expression was associated with an increase in survival time. This study indicates that the elastic net has a higher capacity than the Lasso and boosting for the prediction of survival time in bladder cancer patients. Moreover, genes selected by all methods improved the predictive power of the model based on only clinical variables, indicating the value of information contained in the microarray features.

[1]  IGF1(CA)19 and IGFBP-3-202A/C Gene Polymorphism and Cancer Risk: A Meta-analysis , 2014, Cell Biochemistry and Biophysics.

[2]  Anestis Antoniadis,et al.  The Dantzig Selector in Cox's Proportional Hazards Model , 2009 .

[3]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[4]  H. Ozen Bladder cancer. , 1998, Current opinion in oncology.

[5]  Yiling Lu,et al.  Predicting time to ovarian carcinoma recurrence using protein markers. , 2013, The Journal of clinical investigation.

[6]  Mee Young Park,et al.  L1‐regularization path algorithm for generalized linear models , 2007 .

[7]  Christopher J. Hickey,et al.  New Discoveries of Old SON: A Link Between RNA Splicing and Cancer , 2014, Journal of cellular biochemistry.

[8]  L. Alberghina,et al.  The insulin-like growth factor receptor I promotes motility and invasion of bladder cancer cells through Akt- and mitogen-activated protein kinase-dependent activation of paxillin. , 2010, The American journal of pathology.

[9]  K. Majidzadeh-A,et al.  BsmI but not FokI polymorphism of VDR gene is contributed in breast cancer , 2013, Medical Oncology.

[10]  Yiling Lu,et al.  Erratum: Predicting time to ovarian carcinoma recurrence using protein markers (Journal of Clinical Investigation (2013) 123:9 (3740-3750) DOI: 10.1172/JCI68509) , 2013 .

[11]  G. Morgan,et al.  A Gene Expression–Based Predictor for Myeloma Patients at High Risk of Developing Bone Disease on Bisphosphonate Treatment , 2011, Clinical Cancer Research.

[12]  Anne-Mette K. Hein,et al.  Gene Expression Signatures Predict Outcome in Non–Muscle-Invasive Bladder Carcinoma: A Multicenter Validation Study , 2007, Clinical Cancer Research.

[13]  S. Weremowicz,et al.  SMARCAD1, a novel human helicase family-defining member associated with genetic instability: cloning, expression, and mapping to 4q22-q23, a band rich in breakpoints and deletion mutants involved in several human diseases. , 2000, Genomics.

[14]  J. Ogutu,et al.  Genomic selection using regularized linear regression models: ridge regression, lasso, elastic net and their extensions , 2012, BMC Proceedings.

[15]  H. Caron,et al.  Risk of second malignancies in long-term survivors of childhood cancer. , 2007, European journal of cancer.

[16]  John Quackenbush,et al.  Differentiating progressive from nonprogressive T1 bladder cancer by gene expression profiling: applying RNA-sequencing analysis on archived specimens. , 2014, Urologic oncology.

[17]  Jinchi Lv,et al.  High-Dimensional Sparse Additive Hazards Regression , 2012, 1212.6232.

[18]  R. Iozzo,et al.  Dichotomy of decorin activity on the insulin‐like growth factor‐I system , 2013, The FEBS journal.

[19]  L. Zeng,et al.  Identification of differentially expressed proteins in chemotherapy-sensitive and chemotherapy-resistant diffuse large B cell lymphoma by proteomic methods , 2013, Medical Oncology.

[20]  Kuang-Chi Lai,et al.  Allyl isothiocyanate inhibits cell metastasis through suppression of the MAPK pathways in epidermal growth factor‑stimulated HT29 human colorectal adenocarcinoma cells. , 2014, Oncology reports.

[21]  Jiang Gui,et al.  Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data , 2005, Bioinform..

[22]  P. Chang,et al.  Comparative and targeted proteomic analyses of urinary microparticles from bladder cancer and hernia patients. , 2012, Journal of proteome research.

[23]  Robert Gray,et al.  A Proportional Hazards Model for the Subdistribution of a Competing Risk , 1999 .

[24]  Qiqin Yin-Goen,et al.  Molecular classification of renal tumors by gene expression profiling. , 2005, The Journal of molecular diagnostics : JMD.

[25]  Klaus Jung,et al.  A New Algorithm for Integrated Analysis of miRNA-mRNA Interactions Based on Individual Classification Reveals Insights into Bladder Cancer , 2013, PloS one.

[26]  survivalBIV: Estimation of the Bivariate Distribution Function for Sequentially Ordered Events Under Univariate Censoring , 2012 .

[27]  Jiang Gui,et al.  Threshold Gradient Descent Method for Censored Data Regression with Applications in Pharmacogenomics , 2004, Pacific Symposium on Biocomputing.

[28]  Hongzhe Li,et al.  Kernel Cox Regression Models for Linking Gene Expression Profiles to Censored Survival Data , 2002, Pacific Symposium on Biocomputing.

[29]  Yi Li,et al.  Statistical Applications in Genetics and Molecular Biology Survival Analysis with High-Dimensional Covariates : An Application in Microarray Studies , 2011 .

[30]  Robert J Tibshirani,et al.  Statistical Applications in Genetics and Molecular Biology , 2011 .

[31]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[32]  David R Davies,et al.  The type 1 insulin‐like growth factor receptor is over‐expressed in bladder cancer , 2007, BJU international.

[33]  Harald Binder,et al.  Boosting for high-dimensional time-to-event data with competing risks , 2009, Bioinform..

[34]  J. Goeman L1 Penalized Estimation in the Cox Proportional Hazards Model , 2009, Biometrical journal. Biometrische Zeitschrift.

[35]  Harald Binder,et al.  Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models , 2008, BMC Bioinformatics.

[36]  M. L. Calle,et al.  Genetic Variation in the TP53 Pathway and Bladder Cancer Risk. A Comprehensive Analysis , 2014, PloS one.

[37]  Daniela M Witten,et al.  Extensions of Sparse Canonical Correlation Analysis with Applications to Genomic Data , 2009, Statistical applications in genetics and molecular biology.

[38]  J. Niederhuber,et al.  Up-Regulation of Human Prostaglandin Reductase 1 Improves the Efficacy of Hydroxymethylacylfulvene, an Antitumor Chemotherapeutic Agent , 2012, Journal of Pharmacology and Experimental Therapeutics.

[39]  Yusuke Nakamura,et al.  Molecular features of hormone-refractory prostate cancer cells by genome-wide gene expression profiles. , 2007, Cancer research.

[40]  Franziska Michor,et al.  Combination of a Novel Gene Expression Signature with a Clinical Nomogram Improves the Prediction of Survival in High-Risk Bladder Cancer , 2012, Clinical Cancer Research.

[41]  F. André,et al.  Targeting the deregulated spliceosome core machinery in cancer cells triggers mTOR blockade and autophagy. , 2013, Cancer research.

[42]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .