Kinome-wide activity classification of small molecules by deep learning

Deep learning is a machine learning technique that attempts to model high-level abstractions in data by utilizing a graph composed of multiple processing layers that experience various linear and non-linear transformations. This technique has been shown to perform well for applications in drug discovery, utilizing structural features of small molecules to predict activity. However, the application of deep learning to discriminating features of kinase inhibitors has not been well explored. Small molecule kinase inhibitors are an important class of anti-cancer agents and have demonstrated impressive clinical efficacy in several different diseases. However, resistance is often observed mediated by adaptive Kinome reprogramming or subpopulation diversity. Therefore, polypharmacology and combination therapies offer potential therapeutic strategies for patients with resistant disease. Their development would benefit from more comprehensive and dense knowledge of small-molecule inhibition across the human Kinome. Because such data is not publicly available, we evaluated multiple machine learning methods to predict small molecule inhibition of 342 kinases using over 650K aggregated bioactivity annotations for over 300K small molecules curated from ChEMBL and the Kinase Knowledge Base (KKB). Our results demonstrated that multi-task deep neural networks outperform classical single-task methods, offering potential towards predicting activity profiles and filling gaps in the available data. TOC Graphic

[1]  Martin Romacker,et al.  Evolving BioAssay Ontology (BAO): modularization, integration and applications , 2014, Journal of Biomedical Semantics.

[2]  L. Urban,et al.  Major partial response to crizotinib, a dual MET/ALK inhibitor, in a squamous cell lung (SCC) carcinoma patient with de novo c-MET amplification in the absence of ALK rearrangement. , 2014, Lung cancer.

[3]  A. Chella,et al.  Response to erlotinib in a patient with lung adenocarcinoma harbouring the EML4-ALK translocation: A case report , 2015, Oncology letters.

[4]  George Papadatos,et al.  Evaluation of machine-learning methods for ligand-based virtual screening , 2007, J. Comput. Aided Mol. Des..

[5]  Nagi Ayad,et al.  Large-Scale Computational Screening Identifies First in Class Multitarget Inhibitor of EGFR Kinase and BRD4 , 2015, Scientific Reports.

[6]  John P. Overington,et al.  The ChEMBL database: a taster for medicinal chemists. , 2014, Future medicinal chemistry.

[7]  Andreas Mayr,et al.  Deep Learning as an Opportunity in Virtual Screening , 2015 .

[8]  Ravi Iyengar,et al.  The Library of Integrated Network-Based Cellular Signatures NIH Program: System-Level Cataloging of Human Cells Response to Perturbations. , 2017, Cell systems.

[9]  Vasileios Stathias,et al.  Sustainable data and metadata management at the BD2K-LINCS Data Coordination and Integration Center , 2018, Scientific Data.

[10]  P. Cohen Protein kinases — the major drug targets of the twenty-first century? , 2002, Nature reviews. Drug discovery.

[11]  C. Ruan,et al.  Deep molecular response by IFN-α and dasatinib combination in a patient with T315I-mutated chronic myeloid leukemia. , 2016, Pharmacogenomics.

[12]  K. Sauer,et al.  Mechanisms of drug resistance in kinases , 2011, Expert opinion on investigational drugs.

[13]  Jia Jia,et al.  Comparative analysis of machine learning methods in ligand-based virtual screening of large compound libraries. , 2009, Combinatorial chemistry & high throughput screening.

[14]  V. Canzonieri,et al.  Tumor response is predicted by patient genetic profile in rectal cancer patients treated with neo-adjuvant chemo-radiotherapy , 2011, The Pharmacogenomics Journal.

[15]  Stephan C. Schürer,et al.  Kinome-wide Activity Modeling from Diverse Public High-Quality Data Sets , 2013, J. Chem. Inf. Model..

[16]  George Papadatos,et al.  The ChEMBL bioactivity database: an update , 2013, Nucleic Acids Res..

[17]  David Wu,et al.  Durable Response to Tyrosine Kinase Inhibitor Therapy in a Lung Cancer Patient Harboring Epidermal Growth Factor Receptor Tandem Kinase Domain Duplication. , 2015, Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer.

[18]  Vasileios Stathias,et al.  Data Portal for the Library of Integrated Network-based Cellular Signatures (LINCS) program: integrated access to diverse large-scale cellular perturbation response data , 2017, Nucleic Acids Res..

[19]  Anthony E Klon Bayesian modeling in virtual high throughput screening. , 2009, Combinatorial chemistry & high throughput screening.

[20]  Vijay S. Pande,et al.  Massively Multitask Networks for Drug Discovery , 2015, ArXiv.

[21]  S. Muskal,et al.  High quality, small molecule-activity datasets for kinase research , 2016, F1000Research.

[22]  James S. Duncan,et al.  Inhibition of Lapatinib-Induced Kinome Reprogramming in ERBB2-Positive Breast Cancer by Targeting BET Family Bromodomains. , 2015, Cell reports.

[23]  N. James,et al.  Sorafenib Induces Therapeutic Response in a Patient with Metastatic Collecting Duct Carcinoma of Kidney , 2009, Oncology Research and Treatment.

[24]  Rajarshi Guha,et al.  Drug target ontology to classify and integrate drug discovery data , 2017, J. Biomed. Semant..

[25]  M. Clausen,et al.  Small-molecule kinase inhibitors: an analysis of FDA-approved drugs. , 2016, Drug discovery today.

[26]  Mindy I. Davis,et al.  Comprehensive analysis of kinase inhibitor selectivity , 2011, Nature Biotechnology.