Investigation of REFINED CNN ensemble learning for anti-cancer drug sensitivity prediction

Anti-cancer drug sensitivity prediction using deep learning models for individual cell line is a significant challenge in personalized medicine. REFINED (REpresentation of Features as Images with NEighborhood Dependencies) CNN (Convolutional Neural Network) based models have shown promising results in drug sensitivity prediction. The primary idea behind REFINED CNN is representing high dimensional vectors as compact images with spatial correlations that can benefit from convolutional neural network architectures. However, the mapping from a vector to a compact 2D image is not unique due to variations in considered distance measures and neighborhoods. In this article, we consider predictions based on ensembles built from such mappings that can improve upon the best single REFINED CNN model prediction. Results illustrated using NCI60 and NCIALMANAC databases shows that the ensemble approaches can provide significant performance improvement as compared to individual models. We further illustrate that a single mapping created from the amalgamation of the different mappings can provide performance similar to stacking ensemble but with significantly lower computational complexity.

[1]  Peter I. Frazier,et al.  A Tutorial on Bayesian Optimization , 2018, ArXiv.

[2]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[3]  P. Sorger,et al.  Growth rate inhibition metrics correct for confounders in measuring sensitivity to cancer drugs , 2016, Nature Methods.

[4]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[5]  Yan Guo,et al.  Architectures and accuracy of artificial neural network for disease classification from omics data , 2019, BMC Genomics.

[6]  Robert Tibshirani,et al.  Estimating the number of clusters in a data set via the gap statistic , 2000 .

[7]  CHUN WEI YAP,et al.  PaDEL‐descriptor: An open source software to calculate molecular descriptors and fingerprints , 2011, J. Comput. Chem..

[8]  Ranadip Pal,et al.  IntegratedMRF: random forest‐based framework for integrating prediction from different data types , 2017, Bioinform..

[9]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[10]  Ranadip Pal,et al.  Investigation of model stacking for drug sensitivity prediction , 2017, BMC Bioinformatics.

[11]  Shibu Yooseph,et al.  Learning a mixture of microbial networks using minorization–maximization , 2019, Bioinform..

[12]  Yufei Huang,et al.  Convolutional neural network models for cancer type prediction based on gene expression , 2019, BMC Medical Genomics.

[13]  Andreas Bender,et al.  DeepSynergy: predicting anti-cancer drug synergy with Deep Learning , 2017, Bioinform..

[14]  Yu Li,et al.  Deep learning in bioinformatics: introduction, application, and perspective in big data era , 2019, bioRxiv.

[15]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[16]  Yu Li,et al.  Deep learning in bioinformatics: Introduction, application, and perspective in the big data era. , 2019, Methods.

[17]  Larry Rubinstein,et al.  The National Cancer Institute ALMANAC: A Comprehensive Screening Resource for the Detection of Anticancer Drug Pairs with Enhanced Therapeutic Activity. , 2017, Cancer research.

[18]  D. Thompson,et al.  A History of Greek Mathematics , 1922, Nature.

[19]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[20]  Ranadip Pal,et al.  Representation of features as images with neighborhood dependencies for compatibility with convolutional neural networks , 2020, Nature communications.

[21]  Laura M. Heiser,et al.  A community effort to assess and improve drug sensitivity prediction algorithms , 2014, Nature Biotechnology.

[22]  J. Mockus Bayesian Approach to Global Optimization: Theory and Applications , 1989 .

[23]  David D. Cox,et al.  Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures , 2013, ICML.

[24]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[25]  Fangfang Xia,et al.  Predicting tumor cell line response to drug pairs with deep learning , 2018, BMC Bioinformatics.

[26]  R. Pal,et al.  An Ensemble Based Top Performing Approach for NCI-DREAM Drug Sensitivity Prediction Challenge , 2014, PloS one.

[27]  J. Sáez-Rodríguez,et al.  Stratification and prediction of drug synergy based on target functional similarity , 2019, npj Systems Biology and Applications.

[28]  Tae Soon Kim,et al.  Cancer Drug Response Profile scan (CDRscan): A Deep Learning Model That Predicts Drug Effectiveness from Cancer Genomic Signature , 2018, Scientific Reports.

[29]  S. Gerson,et al.  Chapter 57 – Pharmacology and Molecular Mechanisms of Antineoplastic Agents for Hematologic Malignancies , 2017 .

[30]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[31]  Igor F Tsigelny,et al.  Artificial Intelligence in Drug Treatment. , 2020, Annual review of pharmacology and toxicology.

[32]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[33]  Byunghan Lee,et al.  Deep learning in bioinformatics , 2016, Briefings Bioinform..

[34]  Matthew A. Brown,et al.  When Ensembling Smaller Models is More Efficient than Single Large Models , 2020, ArXiv.

[35]  H. Zou,et al.  Addendum: Regularization and variable selection via the elastic net , 2005 .

[36]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[37]  Yufei Huang,et al.  Deep learning of pharmacogenomics resources: moving towards precision oncology , 2019, Briefings Bioinform..

[38]  C. I. Bliss THE TOXICITY OF POISONS APPLIED JOINTLY1 , 1939 .

[39]  Debopam Chakrabarti,et al.  DeepMalaria: Artificial Intelligence Driven Discovery of Potent Antiplasmodials , 2020, Frontiers in Pharmacology.

[40]  S. Ramaswamy,et al.  Systematic identification of genomic markers of drug sensitivity in cancer cells , 2012, Nature.

[41]  Kwong-Sak Leung,et al.  Improving prediction of phenotypic drug response on cancer cell lines using deep convolutional network , 2018, BMC Bioinformatics.

[42]  R. Shoemaker The NCI60 human tumour cell line anticancer drug screen , 2006, Nature Reviews Cancer.