Prediction of cancer drug sensitivity using high‐dimensional omic features

Summary A large number of cancer drugs have been developed to target particular genes/pathways that are crucial for cancer growth. Drugs that share a molecular target may also have some common predictive omic features, e.g., somatic mutations or gene expression. Therefore, it is desirable to analyze these drugs as a group to identify the associated omic features, which may provide biological insights into the underlying drug response. Furthermore, these omic features may be robust predictors for any drug sharing the same target. The high dimensionality and the strong correlations among the omic features are the main challenges of this task. Motivated by this problem, we develop a new method for high‐dimensional bilevel feature selection using a group of response variables that may share a common set of predictors in addition to their individual predictors. Simulation results show that our method has a substantially higher sensitivity and specificity than existing methods. We apply our method to two large‐scale drug sensitivity studies in cancer cell lines. Both within‐study and between‐study validation demonstrate the good efficacy of our method.

[1]  Hao Wu,et al.  R/qtl: QTL Mapping in Experimental Crosses , 2003, Bioinform..

[2]  Cun-Hui Zhang,et al.  A group bridge approach for variable selection , 2009, Biometrika.

[3]  Y. Pak,et al.  Regulation of cancer cell proliferation by caveolin-2 down-regulation and re-expression. , 2011, International journal of oncology.

[4]  Noah Simon,et al.  A Sparse-Group Lasso , 2013 .

[5]  William R. Sellers,et al.  Advances in the preclinical testing of cancer therapeutic hypotheses , 2011, Nature Reviews Drug Discovery.

[6]  Michele De Palma,et al.  The biology of personalized cancer medicine: Facing individual complexities underlying hallmark capabilities , 2012, Molecular oncology.

[7]  T. Hastie,et al.  SparseNet: Coordinate Descent With Nonconvex Penalties , 2011, Journal of the American Statistical Association.

[8]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[9]  Jiahua Chen,et al.  Extended Bayesian information criteria for model selection with large model spaces , 2008 .

[10]  Pang Du,et al.  Variable selection in linear models , 2014 .

[11]  X. Chen,et al.  Yes-Associated Protein 1 Promotes Adenocarcinoma Growth and Metastasis through Activation of the Receptor Tyrosine Kinase Axl , 2012, International journal of immunopathology and pharmacology.

[12]  Patrick Breheny,et al.  The group exponential lasso for bi‐level variable selection , 2015, Biometrics.

[13]  Peng Zhao,et al.  On Model Selection Consistency of Lasso , 2006, J. Mach. Learn. Res..

[14]  Jianqing Fan,et al.  A Selective Overview of Variable Selection in High Dimensional Feature Space. , 2009, Statistica Sinica.

[15]  Jian Huang,et al.  Penalized methods for bi-level variable selection. , 2009, Statistics and its interface.

[16]  J. Friedman Fast sparse regression and classification , 2012 .

[17]  Y. Ward,et al.  TDAG51 is an ERK signaling target that opposes ERK-mediated HME16C mammary epithelial cell transformation , 2008, BMC Cancer.

[18]  Lincoln Stein,et al.  Reactome: a database of reactions, pathways and biological processes , 2010, Nucleic Acids Res..

[19]  Jian Huang,et al.  A Selective Review of Group Selection in High-Dimensional Models. , 2012, Statistical science : a review journal of the Institute of Mathematical Statistics.

[20]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[21]  J. Ibrahim,et al.  Genomewide Multiple-Loci Mapping in Experimental Crosses by Iterative Adaptive Penalized Regression , 2010, Genetics.

[22]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[23]  Wei Sun,et al.  Designing penalty functions in high dimensional problems: The role of tuning parameters. , 2016, Electronic journal of statistics.

[24]  Adam A. Margolin,et al.  Addendum: The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity , 2012, Nature.

[25]  S. Ramaswamy,et al.  Systematic identification of genomic markers of drug sensitivity in cancer cells , 2012, Nature.

[26]  Hansheng Wang,et al.  Computational Statistics and Data Analysis a Note on Adaptive Group Lasso , 2022 .