Structured penalized regression for drug sensitivity prediction

Large‐scale in vitro drug sensitivity screens are an important tool in personalized oncology to predict the effectiveness of potential cancer drugs. The prediction of the sensitivity of cancer cell lines to a panel of drugs is a multivariate regression problem with high dimensional heterogeneous multiomics data as input data and with potentially strong correlations between the outcome variables which represent the sensitivity to the different drugs. We propose a joint penalized regression approach with structured penalty terms which enable us to utilize the correlation structure between drugs with group‐lasso‐type penalties and at the same time address the heterogeneity between ‘omics’ data sources by introducing data‐source‐specific penalty factors to penalize different data sources differently. By combining integrative penalty factors (IPFs) with the tree‐guided group lasso, we create a method called ‘IPF‐tree‐lasso’. We present a unified framework to transform more general IPF‐type methods to the original penalized method. Because the structured penalty terms have multiple parameters, we demonstrate how the interval search ‘Efficient parameter selection via global optimization’ algorithm can be used to optimize multiple penalty parameters efficiently. Simulation studies show that IPF‐tree‐lasso can improve the prediction performance compared with other lasso‐type methods, in particular for heterogeneous sources of data. Finally, we employ the new methods to analyse data from the ‘Genomics of drug sensitivity in cancer’ project.

[1]  Donald R. Jones,et al.  Efficient Global Optimization of Expensive Black-Box Functions , 1998, J. Glob. Optim..

[2]  Zijing Wang,et al.  Efficient parameter selection for support vector machines , 2019, Enterp. Inf. Syst..

[3]  Yu Jiang,et al.  A Selective Review of Multi-Level Omics Data Integration Using Variable Selection , 2019, High-throughput.

[4]  Roman Hornung,et al.  Priority-Lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data , 2018, BMC Bioinformatics.

[5]  Frank Dondelinger,et al.  The joint lasso: high-dimensional regression for group structured data , 2018, Biostatistics.

[6]  Q. Ding,et al.  Nutlin‐3a as a novel anticancer agent for adrenocortical carcinoma with CTNNB1 mutation , 2018, Cancer medicine.

[7]  Krister Wennerberg,et al.  Global proteomics profiling improves drug sensitivity prediction: results from a multi-omics, pan-cancer modeling approach , 2017, Bioinform..

[8]  Daniel C. Liebler,et al.  Colorectal Cancer Cell Line Proteomes Are Representative of Primary Tumors and Predict Drug Sensitivity. , 2017, Gastroenterology.

[9]  Reza Safdari,et al.  Computational prediction of drug-drug interactions based on drugs functional similarities , 2017, J. Biomed. Informatics.

[10]  Daniel V. Samarov,et al.  A Coordinate-Descent-Based Approach to Solving the Sparse Group Elastic Net , 2017, Technometrics.

[11]  A. Lusis,et al.  Multi-omics approaches to disease , 2017, Genome Biology.

[12]  Xiaoyu Jiang,et al.  IPF-LASSO: Integrative L 1-Penalized Regression with Penalty Factors for Prediction Based on Multi-Omics Data , 2017, Comput. Math. Methods Medicine.

[13]  A. Pandey,et al.  Small molecule inhibitor screening identifified HSP90 inhibitor 17-AAG as potential therapeutic agent for gallbladder cancer , 2017, Oncotarget.

[14]  C. I. Smith,et al.  From identification of the BTK kinase to effective management of leukemia , 2016, Oncogene.

[15]  D. Chan,et al.  Precision medicine: from pharmacogenomics to pharmacoproteomics , 2016, Clinical Proteomics.

[16]  R. Reis,et al.  Vemurafenib resistance increases melanoma invasiveness and modulates the tumor microenvironment by MMP-2 upregulation. , 2016, Pharmacological research.

[17]  Sungkyoung Choi,et al.  Pathway-based approach using hierarchical components of collapsed rare variants , 2016, Bioinform..

[18]  Scott E. Martin,et al.  Reproducible pharmacogenomic profiling of cancer cell line panels , 2016, Nature.

[19]  I. Laurenzana,et al.  Targeting the p53-MDM2 interaction by the small-molecule MDM2 antagonist Nutlin-3a: a new challenged target therapy in adult Philadelphia positive acute lymphoblastic leukemia patients , 2016, Oncotarget.

[20]  Aida Moreno-Moral,et al.  MT-HESS: an efficient Bayesian approach for simultaneous association detection in OMICS datasets, with application to eQTL mapping in multiple tissues , 2015, Bioinform..

[21]  R. Plummer,et al.  Randomized phase II study evaluating veliparib (ABT-888) with temozolomide in patients with metastatic melanoma. , 2015, Annals of oncology : official journal of the European Society for Medical Oncology.

[22]  J. Jin,et al.  NSC-87877 inhibits DUSP26 function in neuroblastoma resulting in p53-mediated apoptosis , 2015, Cell Death and Disease.

[23]  Bin Nan,et al.  Multivariate sparse group lasso for the multivariate multiple linear regression with an arbitrary group structure , 2015, Biometrics.

[24]  H. Chen,et al.  β2-AR signaling controls trastuzumab resistance-dependent pathway , 2015, Oncogene.

[25]  Martin Sill,et al.  c060: Extended Inference with Lasso and Elastic-Net Regularized Cox and Generalized Linear Models , 2014 .

[26]  Wessel N van Wieringen,et al.  Better prediction by use of co‐data: adaptive group‐regularized ridge regression , 2014, Statistics in medicine.

[27]  Robert Clarke,et al.  Enhancing reproducibility in cancer drug screening: how do we move forward? , 2014, Cancer research.

[28]  Trevor Hastie,et al.  A Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression , 2013, 1311.6529.

[29]  Laura M. Heiser,et al.  Modeling precision treatment of breast cancer , 2013, Genome Biology.

[30]  Corbin E. Meacham,et al.  Tumour heterogeneity and cancer cell plasticity , 2013, Nature.

[31]  O. Thas,et al.  EMLasso: logistic lasso with missing data , 2013, Statistics in medicine.

[32]  G. Silvestri,et al.  Activation of p53 with Nutlin-3a radiosensitizes lung cancer cells via enhancing radiation-induced premature senescence. , 2013, Lung cancer.

[33]  Noah Simon,et al.  A Sparse-Group Lasso , 2013 .

[34]  Sridhar Ramaswamy,et al.  Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells , 2012, Nucleic Acids Res..

[35]  Hsiu-Fang Lee,et al.  HSP90 inhibitor 17-AAG selectively eradicates lymphoma stem cells. , 2012, Cancer research.

[36]  S. Ramaswamy,et al.  Systematic identification of genomic markers of drug sensitivity in cancer cells , 2012, Nature.

[37]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[38]  F. Hodi,et al.  Vemurafenib and BRAF Inhibition: A New Class of Treatment for Metastatic Melanoma , 2011, Clinical Cancer Research.

[39]  I. Glad,et al.  Weighted Lasso with Data Integration , 2011, Statistical applications in genetics and molecular biology.

[40]  J. Blay,et al.  Nilotinib: a novel, selective tyrosine kinase inhibitor. , 2011, Seminars in oncology.

[41]  G. Rassidakis,et al.  Activation of the p53 pathway by the MDM2 inhibitor nutlin-3a overcomes BCL2 overexpression in a preclinical model of diffuse large B-cell lymphoma associated with t(14;18)(q32;q21) , 2011, Leukemia.

[42]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[43]  Xi Chen,et al.  Smoothing proximal gradient method for general structured sparse regression , 2010, The Annals of Applied Statistics.

[44]  Joel Greshock,et al.  Molecular target class is predictive of in vitro response profile. , 2010, Cancer research.

[45]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[46]  E. Xing,et al.  Tree-guided group lasso for multi-response regression with structured sparsity, with an application to eQTL mapping , 2009, 0909.1373.

[47]  R. Hamatake,et al.  RDEA119/BAY 869766: a potent, selective, allosteric inhibitor of MEK1/2 for the treatment of cancer. , 2009, Cancer research.

[48]  Jean-Philippe Vert,et al.  Group lasso with overlap and graph lasso , 2009, ICML '09.

[49]  J. M. Najeb,et al.  Efficient Parameter Selection of Support Vector Machines , 2008 .

[50]  Hongzhe Li,et al.  Network-constrained regularization and variable selection for analysis of genomic data , 2008 .

[51]  A. Zell,et al.  Efficient parameter selection for support vector machines in classification and regression via model-based global optimization , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[52]  Stephen J. Wright,et al.  Simultaneous Variable Selection , 2005, Technometrics.

[53]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[54]  N. Ueno,et al.  Tumor-targeted gene delivery via anti-HER2 antibody (trastuzumab, Herceptin) conjugated polyethylenimine. , 2004, Journal of controlled release : official journal of the Controlled Release Society.

[55]  E. Jacoby,et al.  Chemogenomics: an emerging strategy for rapid target and drug discovery , 2004, Nature Reviews Genetics.

[56]  R. Gilbertson,et al.  Medulloblastoma Sensitivity to 17-Allylamino-17-demethoxygeldanamycin Requires MEK/ERK* , 2003, Journal of Biological Chemistry.

[57]  C. Harris,et al.  The IARC TP53 database: New online mutation analysis and recommendations to users , 2002, Human mutation.

[58]  J. Sebolt-Leopold Development of anticancer drugs targeting the MAP kinase pathway , 2000, Oncogene.

[59]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[60]  D. Goodsell,et al.  The Molecular Perspective: Methotrexate , 1999, Stem cells.

[61]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .