Drug Discovery Maps, a Machine Learning Model That Visualizes and Predicts Kinome–Inhibitor Interaction Landscapes

The interpretation of high-dimensional structure–activity data sets in drug discovery to predict ligand–protein interaction landscapes is a challenging task. Here we present Drug Discovery Maps (DDM), a machine learning model that maps the activity profile of compounds across an entire protein family, as illustrated here for the kinase family. DDM is based on the t-distributed stochastic neighbor embedding (t-SNE) algorithm to generate a visualization of molecular and biological similarity. DDM maps chemical and target space and predicts the activities of novel kinase inhibitors across the kinome. The model was validated using independent data sets and in a prospective experimental setting, where DDM predicted new inhibitors for FMS-like tyrosine kinase 3 (FLT3), a therapeutic target for the treatment of acute myeloid leukemia. Compounds were resynthesized, yielding highly potent, cellularly active FLT3 inhibitors. Biochemical assays confirmed most of the predicted off-targets. DDM is further unique in that it is completely open-source and available as a ready-to-use executable to facilitate broad and easy adoption.

[1]  P. Willett,et al.  A Comparison of Some Measures for the Determination of Inter‐Molecular Structural Similarity Measures of Inter‐Molecular Structural Similarity , 1986 .

[2]  Sean C. Bendall,et al.  viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia , 2013, Nature Biotechnology.

[3]  P. Kirkpatrick,et al.  Chemical space , 2004, Nature.

[4]  J. Adams,et al.  Kinetic and catalytic mechanisms of protein kinases. , 2001, Chemical reviews.

[5]  T. Willson,et al.  Seeding Collaborations to Advance Kinase Science with the GSK Published Kinase Inhibitor Set (PKIS) , 2014, Current topics in medicinal chemistry.

[6]  B. Merget,et al.  Profiling Prediction of Kinase Inhibitors: Toward the Virtual Assay. , 2017, Journal of medicinal chemistry.

[7]  Gisbert Schneider,et al.  Nonlinear dimensionality reduction and mapping of compound libraries for drug discovery. , 2012, Journal of molecular graphics & modelling.

[8]  Toru Okamoto,et al.  The pseudokinase MLKL mediates necroptosis via a molecular switch mechanism. , 2013, Immunity.

[9]  Phillip Jeffrey,et al.  The Practice of Medicinal Chemistry , 2004 .

[10]  Boudewijn P F Lelieveldt,et al.  Data-driven identification of prognostic tumor subpopulations using spatially mapped t-SNE of mass spectrometry imaging data , 2016, Proceedings of the National Academy of Sciences.

[11]  T. Hunter,et al.  The Protein Kinase Complement of the Human Genome , 2002, Science.

[12]  Philip Hardwicke,et al.  Crystal structures of human RIP2 kinase catalytic domain complexed with ATP-competitive inhibitors: Foundations for understanding inhibitor selectivity. , 2015, Bioorganic & medicinal chemistry.

[13]  L. Johnson,et al.  Structural basis for control by phosphorylation. , 1997, Chemical reviews.

[14]  J. Reymond The chemical space project. , 2015, Accounts of chemical research.

[15]  R. Glen,et al.  Molecular similarity: a key technique in molecular informatics. , 2004, Organic & biomolecular chemistry.

[16]  Marcel J. T. Reinders,et al.  2D Representation of Transcriptomes by t-SNE Exposes Relatedness between Human Tissues , 2016, PloS one.

[17]  Jack W Scannell,et al.  When Quality Beats Quantity: Decision Theory, Drug Discovery, and the Reproducibility Crisis , 2016, PloS one.

[18]  Nathalie Josso,et al.  Transduction pathway of anti-Müllerian hormone, a sex-specific member of the TGF-β family , 2003, Trends in Endocrinology & Metabolism.

[19]  Pierre Baldi,et al.  When is Chemical Similarity Significant? The Statistical Distribution of Chemical Similarity Scores and Its Extreme Values , 2010, J. Chem. Inf. Model..

[20]  E. Solary,et al.  AC 220 is a uniquely potent and selective inhibitor of FLT 3 for the treatment of acute myeloid leukemia ( AML ) , 2009 .

[21]  James Briscoe,et al.  An intuitive graphical visualization technique for the interrogation of transcriptome data , 2011, Nucleic acids research.

[22]  Simone Fulle,et al.  Kinome‐Wide Profiling Prediction of Small Molecules , 2018, ChemMedChem.

[23]  George Papadatos,et al.  Unprecedently Large-Scale Kinase Inhibitor Set Enabling the Accurate Prediction of Compound–Kinase Activities: A Way toward Selective Promiscuity by Design? , 2016, J. Chem. Inf. Model..

[24]  Randall W King,et al.  High-throughput kinase profiling: a more efficient approach toward the discovery of new kinase inhibitors. , 2011, Chemistry & biology.

[25]  Heiner Koch,et al.  The target landscape of clinical kinase drugs , 2017, Science.

[26]  John P. Overington,et al.  Comprehensive characterization of the Published Kinase Inhibitor Set , 2016, Nature Biotechnology.

[27]  S. Knapp,et al.  The ins and outs of selective kinase inhibitor development. , 2015, Nature chemical biology.

[28]  Juho Rousu,et al.  Computational-experimental approach to drug-target interaction mapping: A case study on kinase inhibitors , 2017, PLoS Comput. Biol..

[29]  Ahmed Mahfouz,et al.  Visualizing the spatial gene expression organization in the brain through non-linear similarity embeddings. , 2015, Methods.

[30]  Mindy I. Davis,et al.  A quantitative analysis of kinase inhibitor selectivity , 2008, Nature Biotechnology.

[31]  P. Hajduk,et al.  Navigating the kinome. , 2011, Nature chemical biology.

[32]  Thomas Lengauer,et al.  Bioinformatics Original Paper Computational Recognition of Potassium Channel Sequences , 2022 .

[33]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[34]  Mindy I. Davis,et al.  Comprehensive analysis of kinase inhibitor selectivity , 2011, Nature Biotechnology.

[35]  Jacob K. Asiedu,et al.  The Drug Repurposing Hub: a next-generation drug library and information resource , 2017, Nature Medicine.

[36]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[37]  Theonie Anastassiadis,et al.  Comprehensive assay of kinase catalytic activity reveals features of kinase inhibitor selectivity , 2011, Nature biotechnology.

[38]  Gerard J. P. van Westen,et al.  Proteochemometric modeling as a tool to design selective compounds and for extrapolating to novel targets , 2011 .

[39]  Jun Qin,et al.  A robust methodology to subclassify pseudokinases based on their nucleotide-binding properties. , 2014, The Biochemical journal.

[40]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[41]  Friedrich Rippmann,et al.  KinMap: a web-based tool for interactive navigation through human kinome data , 2017, BMC Bioinformatics.

[42]  Thorsten Meinl,et al.  KNIME: The Konstanz Information Miner , 2007, GfKl.