Predicting Drug Sensitivity of Cancer Cell Lines via Collaborative Filtering with Contextual Attention

Accurate anti-cancer drug recommendations and the identification of essential biomarkers for this task are crucial to precision oncology. Large-scale drug response assays on cancer cell lines provide a potential way to understand the interplay of drugs and cancer cells. In this work, we present CADRE (Contextual Attention-based Drug REsponse), a model that accurately infers the response of cancer cell lines to a panel of candidate compounds based on the omics profiles, such as gene expressions, of cancer cells. CADRE builds on the framework of collaborative filtering, which provides robustness to the noise of biological data by leveraging similarities within drugs and cell lines. It utilizes the contextual attention mechanism to identify informative biomarkers of these cell lines, which boosts prediction accuracy and affords interpretability of results. In addition, CADRE incorporates external knowledge of drug target pathways and co-expression patterns of genes to further improve feature representations and model performance. Comprehensive evaluations of CADRE and competing models on two large-scale pharmacogenomic datasets show its superiority in both prediction performance and interpretability. CADRE identifies as vital biomarkers genes related to intracellular vesicles and signaling receptor binding, shedding light on its translational potential in the clinical practice of cancer treatment. 1. Code is available at https://github.com/yifengtao/CADRE. c © 2020 Y. Tao1,2,†, S. Ren3,4,†, M.Q. Ding3, R. Schwartz1,5,∗ & X. Lu3,4,6,∗. Drug Sensitivity Prediction via Attention-based Collaborative Filtering

[1]  R. Shoemaker The NCI60 human tumour cell line anticancer drug screen , 2006, Nature Reviews Cancer.

[2]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[3]  Howard A. Fine,et al.  Predicting in vitro drug sensitivity using Random Forests , 2011, Bioinform..

[4]  S. Ramaswamy,et al.  Systematic identification of genomic markers of drug sensitivity in cancer cells , 2012, Nature.

[5]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[6]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[7]  N. Cox,et al.  Clinical drug response can be predicted using baseline gene expression levels and in vitro drug sensitivity in cell lines , 2014, Genome Biology.

[8]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[9]  Mehrbakhsh Nilashi,et al.  Collaborative filtering recommender systems , 2013 .

[10]  Anushya Muruganujan,et al.  Large-scale gene function analysis with the PANTHER classification system , 2013, Nature Protocols.

[11]  Sridhar Ramaswamy,et al.  Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells , 2012, Nucleic Acids Res..

[12]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[13]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[14]  Laura M. Heiser,et al.  A community effort to assess and improve drug sensitivity prediction algorithms , 2014, Nature Biotechnology.

[15]  Mehmet Gönen,et al.  Drug susceptibility prediction against a panel of drugs using kernelized Bayesian multitask learning , 2014, Bioinform..

[16]  Yoshua Bengio,et al.  Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[17]  Jun Wang,et al.  Predicting Anticancer Drug Responses Using a Dual-Layer Integrated Cell Line-Drug Network Model , 2015, PLoS Comput. Biol..

[18]  Chun Xing Li,et al.  Anticancer drug sensitivity prediction in cell lines from baseline gene expression through recursive feature selection , 2015, BMC Cancer.

[19]  M. Snyder,et al.  High-throughput sequencing technologies. , 2015, Molecular cell.

[20]  Vinay Prasad,et al.  Perspective: The precision-oncology illusion , 2016, Nature.

[21]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[22]  Hristo S. Paskov,et al.  Multitask learning improves prediction of cancer drug sensitivity , 2016, Scientific Reports.

[23]  Shaker A. Mousa,et al.  Microvesicle removal of anticancer drugs contributes to drug resistance in human pancreatic cancer cells , 2016, Oncotarget.

[24]  Lorenzo Beretta,et al.  Nearest neighbor imputation algorithms: a critical evaluation , 2016, BMC Medical Informatics and Decision Making.

[25]  Louxin Zhang,et al.  Improved anticancer drug response prediction in cell lines using matrix factorization with similarity regularization , 2017, BMC Cancer.

[26]  Adrian V. Lee,et al.  Intrinsic Subtype Switching and Acquired ERBB2/HER2 Amplifications and Mutations in Breast Cancer Brain Metastases , 2017, JAMA oncology.

[27]  A. Schäffer,et al.  The evolution of tumour phylogenetics: principles and practice , 2017, Nature Reviews Genetics.

[28]  Michael Q. Ding,et al.  Precision Oncology beyond Targeted Therapy: Combining Omics Data with Machine Learning Matches the Majority of Cancer Cells to Effective Therapeutics , 2017, Molecular Cancer Research.

[29]  Yufei Huang,et al.  Predicting drug response of tumors from integrated genomic profiles by deep neural networks , 2018, BMC Medical Genomics.

[30]  Karsten M. Borgwardt,et al.  Kernelized rank learning for personalized drug recommendation , 2017, Bioinform..

[31]  Tae Soon Kim,et al.  Cancer Drug Response Profile scan (CDRscan): A Deep Learning Model That Predicts Drug Effectiveness from Cancer Genomic Signature , 2018, Scientific Reports.

[32]  Jingcheng Du,et al.  Gene2vec: distributed representation of genes based on co-expression , 2018, BMC Genomics.

[33]  Xing Chen,et al.  Anti-cancer Drug Response Prediction Using Neighbor-Based Collaborative Filtering with Global Effect Removal , 2018, Molecular therapy. Nucleic acids.

[34]  Leslie N. Smith,et al.  A disciplined approach to neural network hyper-parameters: Part 1 - learning rate, batch size, momentum, and weight decay , 2018, ArXiv.

[35]  Vigneshwari Subramanian,et al.  PaccMann: Prediction of anticancer compound sensitivity with multi-modal attention-based neural networks , 2018, ArXiv.

[36]  Dong Wei,et al.  Comprehensive anticancer drug response prediction based on a simple cell line-drug complex network model , 2019, BMC Bioinformatics.

[37]  Veerabhadran Baladandayuthapani,et al.  Network-Based Matching of Patients and Targeted Therapies for Precision Oncology , 2019, bioRxiv.

[38]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[39]  C. Collins,et al.  MOLI: multi-omics late integration with deep neural networks for drug response prediction , 2019, bioRxiv.

[40]  Maxim N. Artyomov,et al.  Complete deconvolution of cellular mixtures based on linearity of transcriptional signatures , 2019, Nature Communications.

[41]  Christopher Potts,et al.  Effective Feature Representation for Clinical Text Concept Extraction , 2018, Proceedings of the 2nd Clinical Natural Language Processing Workshop.

[42]  Russell Schwartz,et al.  Phylogenies Derived from Matched Transcriptome Reveal the Evolution of Cell Populations and Temporal Order of Perturbed Pathways in Breast Cancer Brain Metastases , 2019, ISMCO.

[43]  Eric P. Xing,et al.  Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning , 2018, bioRxiv.

[44]  William W. Cohen,et al.  From genome to phenome: Predicting multiple cancer phenotypes based on somatic genomic alterations via the genomic impact transformer , 2019, PSB.

[45]  Russell Schwartz,et al.  Tumor heterogeneity assessed by sequencing and fluorescence in situ hybridization (FISH) data , 2020, bioRxiv.

[46]  Jian Ma,et al.  Robust and accurate deconvolution of tumor populations uncovers evolutionary mechanisms of breast cancer metastasis , 2020, Bioinform..

[47]  Russell Schwartz,et al.  Tumor heterogeneity assessed by sequencing and fluorescence in situ hybridization (FISH) data , 2021, Bioinform..