Predicting clinical drug response from model systems by non-linear subspace-based transfer learning

Pre-clinical models have been the workhorse of cancer research for decades. While powerful, these models do not fully recapitulate the complexity of human tumors. Consequently, translating biomarkers of drug response from pre-clinical models to human tumors has been particularly challenging. To explicitly take these differences into account and enable an efficient exploitation of the vast pre-clinical drug response resources, we developed TRANSACT, a novel computational framework for clinical drug response prediction. First, TRANSACT employs non-linear manifold learning to capture biological processes active in pre-clinical models and human tumors. Then, TRANSACT builds predictors on cell line response only and transfers these to Patient-Derived Xenografts (PDXs) and human tumors. TRANSACT outperforms four competing approaches, including Deep Learning approaches, for a set of 15 drugs on PDXs, TCGA cohorts and 226 metastatic tumors from the Hartwig Medical Foundation data. For only four drugs Deep Learning outperforms TRANSACT. We further derived an algorithmic approach to interpret TRANSACT and used it to validate the approach by identifying known biomarkers to targeted therapies and we propose novel putative biomarkers of resistance to Paclitaxel and Gemcitabine.

[1]  Wouter M. Kouw,et al.  A Review of Domain Adaptation without Target Labels , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  James M. McFarland,et al.  Global computational alignment of tumor and cell line transcriptional profiles , 2020, Nature Communications.

[3]  Aryo Pradipta Gema,et al.  pyComBat, a Python tool for batch effects correction in high-throughput molecular data using empirical Bayes methods , 2020, bioRxiv.

[4]  Benjamin Haibe-Kains,et al.  Assessment of modelling strategies for drug response prediction in cell lines and xenografts , 2020, Scientific Reports.

[5]  D. Lauffenburger,et al.  Translating preclinical models to humans , 2020, Science.

[6]  Martin Ester,et al.  AITL: Adversarial Inductive Transfer Learning with input and output space adaptation for pharmacogenomics , 2020, bioRxiv.

[7]  Zhe-Sheng Chen,et al.  The PI3K subunits, P110α and P110β are potential targets for overcoming P-gp and BCRP-mediated MDR in cancer , 2020, Molecular Cancer.

[8]  Joel Nothman,et al.  SciPy 1.0-Fundamental Algorithms for Scientific Computing in Python , 2019, ArXiv.

[9]  Aristotelis Tsirigos,et al.  A Deep Learning Framework for Predicting Response to Therapy in Cancer. , 2019, Cell reports.

[10]  Martin Ester,et al.  Adversarial Inductive Transfer Learning with input and output space adaptation , 2019 .

[11]  Michael Gruenstaeudl,et al.  PACVr: plastome assembly coverage visualization in R , 2019, BMC Bioinformatics.

[12]  Paul J. Hoffman,et al.  Comprehensive Integration of Single-Cell Data , 2018, Cell.

[13]  Marcel J. T. Reinders,et al.  PRECISE: a domain adaptation approach to transfer predictors of drug response from pre-clinical models to tumors , 2019, bioRxiv.

[14]  C. Collins,et al.  MOLI: multi-omics late integration with deep neural networks for drug response prediction , 2019, bioRxiv.

[15]  Wilma A. Bainbridge,et al.  Drawings of real-world scenes during free recall reveal detailed object and spatial information in memory , 2019, Nature Communications.

[16]  Matteo Manica,et al.  PIMKL: Pathway-Induced Multiple Kernel Learning , 2018, npj Systems Biology and Applications.

[17]  James T. Webber,et al.  Integration of Tumor Genomic Data with Cell Lines Using Multi-dimensional Network Modules Improves Cancer Pharmacogenomics. , 2018, Cell systems.

[18]  Ji‐Won Kim,et al.  BGJ398, A Pan-FGFR Inhibitor, Overcomes Paclitaxel Resistance in Urothelial Carcinoma with FGFR1 Overexpression , 2018, International journal of molecular sciences.

[19]  Xudong Lin,et al.  Deep Variational Metric Learning , 2018, ECCV.

[20]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection , 2018, J. Open Source Softw..

[21]  Tero Aittokallio,et al.  Machine learning and feature selection for drug response prediction in precision oncology applications , 2018, Biophysical Reviews.

[22]  Luis Tobalina,et al.  How to find the right drug for each patient? Advances and challenges in pharmacogenomics , 2018, Current opinion in systems biology.

[23]  Joshua M. Dempster,et al.  Genetic and transcriptional evolution alters cancer cell line drug response , 2018, Nature.

[24]  Age K. Smilde,et al.  iTOP: inferring the topology of omics data , 2018, bioRxiv.

[25]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction , 2018, ArXiv.

[26]  Karsten M. Borgwardt,et al.  Kernelized rank learning for personalized drug recommendation , 2017, Bioinform..

[27]  Alioune Ngom,et al.  A review on machine learning principles for multi-view biological data integration , 2016, Briefings Bioinform..

[28]  P. Pourquier,et al.  [Genetic and transcriptional evolution alters cancer cell line drug response]. , 2018, Bulletin du cancer.

[29]  J. Ji,et al.  Dual PI3K/mTOR inhibitor BEZ235 as a promising therapeutic strategy against paclitaxel-resistant gastric cancer via targeting PI3K/Akt/mTOR pathway , 2018, Cell Death & Disease.

[30]  R. Grossman,et al.  Discovering novel pharmacogenomic biomarkers by imputing drug response in cancer patients from large genomics studies , 2017, Genome research.

[31]  Rameen Beroukhim,et al.  Patient-derived xenografts undergo murine-specific tumor evolution , 2017, Nature Genetics.

[32]  Geet Duggal,et al.  Salmon: fast and bias-aware quantification of transcript expression using dual-phase inference , 2017, Nature Methods.

[33]  Rob Patro,et al.  Salmon provides fast and bias-aware quantification of transcript expression , 2017, Nature Methods.

[34]  Jin Gu,et al.  Evaluating the molecule-based prediction of clinical drug responses in cancer , 2016, Bioinform..

[35]  Lodewyk F. A. Wessels,et al.  TANDEM: a two-stage approach to maximize interpretability of drug response models based on multiple molecular data types , 2016, Bioinform..

[36]  Emanuel J. V. Gonçalves,et al.  A Landscape of Pharmacogenomic Interactions in Cancer , 2016, Cell.

[37]  P. Chakravarty,et al.  ABCB1 (MDR1) induction defines a common resistance mechanism in paclitaxel- and olaparib-resistant ovarian cancer cells , 2016, British Journal of Cancer.

[38]  Måns Magnusson,et al.  MultiQC: summarize analysis results for multiple tools and samples in a single report , 2016, Bioinform..

[39]  Tero Aittokallio,et al.  Drug response prediction by inferring pathway-response associations with kernelized Bayesian matrix factorization , 2016, Bioinform..

[40]  Joshua M. Korn,et al.  High-throughput screening using patient-derived tumor xenografts to predict clinical trial drug response , 2015, Nature Medicine.

[41]  L. Hudson,et al.  R-Ketorolac Targets Cdc42 and Rac1 and Alters Ovarian Cancer Cell Behaviors Critical for Invasion and Metastasis , 2015, Molecular Cancer Therapeutics.

[42]  Joshy George,et al.  Whole–genome characterization of chemoresistant ovarian cancer , 2015, Nature.

[43]  Rama Chellappa,et al.  Unsupervised Adaptation Across Domain Shifts by Generating Intermediate Data Representations , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Sarah A Heerboth,et al.  Drug Resistance in Cancer: An Overview , 2014, Cancers.

[45]  Laura M. Heiser,et al.  A community effort to assess and improve drug sensitivity prediction algorithms , 2014, Nature Biotechnology.

[46]  F. De Filippis,et al.  A Selected Core Microbiome Drives the Early Stages of Three Popular Italian Cheese Manufactures , 2014, PloS one.

[47]  Clarence C Lee,et al.  Genomic and transcriptomic plasticity in treatment-naïve ovarian cancer , 2014, Genome research.

[48]  Zhuowen Tu,et al.  Similarity network fusion for aggregating data types on a genomic scale , 2014, Nature Methods.

[49]  M. Ghert,et al.  Lost in translation: animal models and clinical trials in cancer treatment. , 2014, American journal of translational research.

[50]  Harald Binder,et al.  Transforming RNA-Seq Data to Improve the Performance of Prognostic Gene Signatures , 2014, PloS one.

[51]  Justin Guinney,et al.  Systematic Assessment of Analytical Methods for Drug Sensitivity Prediction from Cancer Cell Line Data , 2013, Pacific Symposium on Biocomputing.

[52]  Wei Shi,et al.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features , 2013, Bioinform..

[53]  N. Cox,et al.  Clinical drug response can be predicted using baseline gene expression levels and in vitro drug sensitivity in cell lines , 2014, Genome Biology.

[54]  P. Cui,et al.  Dynamic regulation of genome-wide pre-mRNA splicing and stress tolerance by the Sm-like protein LSm5 in Arabidopsis , 2014, Genome Biology.

[55]  Tinne Tuytelaars,et al.  Unsupervised Visual Domain Adaptation Using Subspace Alignment , 2013, 2013 IEEE International Conference on Computer Vision.

[56]  Nicolas Servant,et al.  A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis , 2013, Briefings Bioinform..

[57]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[58]  Jean-Pierre Gillet,et al.  The clinical relevance of cancer cell lines. , 2013, Journal of the National Cancer Institute.

[59]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[60]  Yuan Shi,et al.  Geodesic flow kernel for unsupervised domain adaptation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[61]  R. Bernards,et al.  Unresponsiveness of colon cancer to BRAF(V600E) inhibition through feedback activation of EGFR , 2012, Nature.

[62]  Jean-Pierre Gillet,et al.  Redefining the relevance of established cancer cell lines to the study of mechanisms of clinical anti-cancer drug resistance , 2011, Proceedings of the National Academy of Sciences.

[63]  Gaël Varoquaux,et al.  The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[64]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[65]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[66]  Koby Crammer,et al.  A theory of learning from different domains , 2010, Machine Learning.

[67]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[68]  S. Dooley,et al.  Y-box protein-1 is actively secreted through a non-classical pathway and acts as an extracellular mitogen , 2009, EMBO reports.

[69]  E. O’Reilly,et al.  Combination of human tumor necrosis factor-alpha (hTNF-α) gene delivery with gemcitabine is effective in models of pancreatic cancer , 2009, Cancer Gene Therapy.

[70]  Qiang Yang,et al.  Transfer Learning via Dimensionality Reduction , 2008, AAAI.

[71]  B. Schölkopf,et al.  Kernel methods in machine learning , 2007, math/0701907.

[72]  Y. Oda,et al.  Akt-dependent nuclear localization of Y-box-binding protein 1 in acquisition of malignant characteristics by human ovarian cancer cells , 2007, Oncogene.

[73]  Don R. Hush,et al.  An Explicit Description of the Reproducing Kernel Hilbert Spaces of Gaussian RBF Kernels , 2006, IEEE Transactions on Information Theory.

[74]  J. Au,et al.  Expression of Basic Fibroblast Growth Factor Correlates with Resistance to Paclitaxel in Human Patient Tumors , 2006, Pharmaceutical Research.

[75]  F. Bertucci,et al.  Gene expression profiling of breast cell lines identifies potential new basal markers , 2006, Oncogene.

[76]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[77]  A. Hall,et al.  Cdc42 controls the polarity of the actin and microtubule cytoskeletons through two distinct signal transduction pathways , 2005, Journal of Cell Science.

[78]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[79]  Susan Branford,et al.  Frequency of major molecular responses to imatinib or interferon alfa plus cytarabine in newly diagnosed chronic myeloid leukemia. , 2003, The New England journal of medicine.

[80]  A. Nicholson,et al.  Mutations of the BRAF gene in human cancer , 2002, Nature.

[81]  G. Mills,et al.  Inhibition of phosphatidylinositol 3'-kinase increases efficacy of paclitaxel in in vitro and in vivo ovarian cancer models. , 2002, Cancer research.

[82]  M. van Glabbeke,et al.  New guidelines to evaluate the response to treatment in solid tumors , 2000, Journal of the National Cancer Institute.

[83]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[84]  L. Denis European Organization for Research and Treatment of Cancer (EORTC) prostate cancer trials, 1976-1996. , 1998, Urology.

[85]  L. Goldstein MDR1 gene expression in solid tumours. , 1996, European journal of cancer.

[86]  N. Aronszajn Theory of Reproducing Kernels. , 1950 .

[87]  Paul Valéry,et al.  Mauvaises pensées et autres , 1948 .