Pathway‐structured predictive modeling for multi‐level drug response in multiple myeloma

Motivation: Molecular analyses suggest that myeloma is composed of distinct sub‐types that have different molecular pathologies and various response rates to certain treatments. Drug responses in multiple myeloma (MM) are usually recorded as a multi‐level ordinal outcome. One of the goals of drug response studies is to predict which response category any patients belong to with high probability based on their clinical and molecular features. However, as most of genes have small effects, gene‐based models may provide limited predictive accuracy. In that case, methods for predicting multi‐level ordinal drug responses by incorporating biological pathways are desired but have not been developed yet. Results: We propose a pathway‐structured method for predicting multi‐level ordinal responses using a two‐stage approach. We first develop hierarchical ordinal logistic models and an efficient quasi‐Newton algorithm for jointly analyzing numerous correlated variables. Our two‐stage approach first obtains the linear predictor (called the pathway score) for each pathway by fitting all predictors within each pathway using the hierarchical ordinal logistic approach, and then combines the pathway scores as new predictors to build a predictive model. We applied the proposed method to two publicly available datasets for predicting multi‐level ordinal drug responses in MM using large‐scale gene expression data and pathway information. Our results show that our approach not only significantly improved the predictive performance compared with the corresponding gene‐based model but also allowed us to identify biologically relevant pathways. Availability and implementation: The proposed approach has been implemented in our R package BhGLM, which is freely available from the public GitHub repository https://github.com/abbyyan3/BhGLM.

[1]  M van Duin,et al.  Pharmacogenomics and chemical library screens reveal a novel SCFSKP2 inhibitor that overcomes Bortezomib resistance in multiple myeloma , 2016, Leukemia.

[2]  C. O’Brien Statistical Learning with Sparsity: The Lasso and Generalizations , 2016 .

[3]  Justin Zobel,et al.  Prediction of breast cancer prognosis using gene set statistics provides signature stability and biological context , 2010, BMC Bioinformatics.

[4]  R. Tibshirani,et al.  Statistical Applications in Genetics and Molecular Biology Pre-validation and inference in microarrays , 2011 .

[5]  Carlos Caldas,et al.  A comprehensive analysis of prognostic signatures reveals the high predictive capacity of the Proliferation, Immune response and RNA splicing modules in breast cancer , 2008, Breast Cancer Research.

[6]  Gastone Castellani,et al.  The genetic and genomic background of multiple myeloma patients achieving complete response after induction therapy with bortezomib, thalidomide and dexamethasone (VTD) , 2015, Oncotarget.

[7]  Hong Chang,et al.  Multiple myeloma acquires resistance to EGFR inhibitor via induction of pentose phosphate pathway , 2015, Scientific Reports.

[8]  Anthony Boral,et al.  Gene expression profiling and correlation with outcome in clinical trials of the proteasome inhibitor bortezomib. , 2006, Blood.

[9]  Lana X. Garmire,et al.  A Novel Model to Combine Clinical and Pathway-Based Transcriptomic Information for the Prognosis Prediction of Breast Cancer , 2014, PLoS Comput. Biol..

[10]  N. Cox,et al.  Clinical drug response can be predicted using baseline gene expression levels and in vitro drug sensitivity in cell lines , 2014, Genome Biology.

[11]  S. Rajkumar,et al.  Treatment of multiple myeloma , 2011, Nature Reviews Clinical Oncology.

[12]  S. Jagannath,et al.  CRITERIA FOR EVALUATING DISEASE RESPONSE AND PROGRESSION IN PATIENTS WITH MULTIPLE MYELOMA TREATED BY HIGH‐DOSE THERAPY AND HAEMOPOIETIC STEM CELL TRANSPLANTATION , 1998, British journal of haematology.

[13]  S Vincent Rajkumar,et al.  Multiple myeloma. , 2009, Current problems in cancer.

[14]  N. Normanno,et al.  Epidermal growth factor receptor (EGFR) signaling in cancer. , 2006, Gene.

[15]  Xinyan Zhang,et al.  Pathway-Structured Predictive Model for Cancer Survival Prediction: A Two-Stage Approach , 2016 .

[16]  M van Duin,et al.  RPL5 on 1p22.1 is recurrently deleted in multiple myeloma and its expression is linked to bortezomib response , 2016, Leukemia.

[17]  Yuqiu Jiang,et al.  Personalized medicine in oncology: tailoring the right drug to the right patient. , 2010, Biomarkers in medicine.

[18]  中尾 光輝,et al.  KEGG(Kyoto Encyclopedia of Genes and Genomes)〔和文〕 (特集 ゲノム医学の現在と未来--基礎と臨床) -- (データベース) , 2000 .

[19]  G. Roodman,et al.  Chemokines in multiple myeloma. , 2006, Experimental hematology.

[20]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[21]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[22]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[23]  D. Dingli,et al.  Improved survival in multiple myeloma and the impact of novel therapies. , 2008, Blood.

[24]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[25]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[26]  Christina Kendziorski,et al.  Pathway index models for construction of patient‐specific risk profiles , 2013, Statistics in medicine.

[27]  A. Wells,et al.  DU145 human prostate carcinoma invasiveness is modulated by urokinase receptor (uPAR) downstream of epidermal growth factor receptor (EGFR) signaling. , 2004, Experimental cell research.

[28]  Andrei Zinovyev,et al.  Computational Systems Biology of Cancer , 2020 .

[29]  Alex Arenas,et al.  Improved prognostic classification of breast cancer defined by antagonistic activation patterns of immune response pathway modules , 2010, BMC Cancer.

[30]  Y. Kang,et al.  Epidermal growth factor signalling and bone metastasis , 2009, British Journal of Cancer.

[31]  P. Warner Ordinal logistic regression , 2008, Journal of Family Planning and Reproductive Health Care.

[32]  Trevor Hastie,et al.  Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent. , 2011, Journal of statistical software.

[33]  K. Podar,et al.  Inhibition of VEGF Signaling Pathways in Multiple Myeloma and Other Malignancies , 2007, Cell cycle.

[34]  Doheon Lee,et al.  Inferring Pathway Activity toward Precise Disease Classification , 2008, PLoS Comput. Biol..

[35]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2006 .

[36]  R. Fonseca,et al.  Trends in overall survival and costs of multiple myeloma, 2000–2014 , 2016, Leukemia.

[37]  M. G. Pittau,et al.  A weakly informative default prior distribution for logistic and other regression models , 2008, 0901.4011.