Deep Docking: A Deep Learning Platform for Augmentation of Structure Based Drug Discovery

Drug discovery is a rigorous process that requires billion dollars of investments and decades of research to bring a molecule “from bench to a bedside”. While virtual docking can significantly accelerate the process of drug discovery, it ultimately lags the current rate of expansion of chemical databases that already exceed billions of molecular records. This recent surge of small molecules availability presents great drug discovery opportunities, but also demands much faster screening protocols. In order to address this challenge, we herein introduce Deep Docking (DD), a novel deep learning platform that is suitable for docking billions of molecular structures in a rapid, yet accurate fashion. The DD approach utilizes quantitative structure–activity relationship (QSAR) deep models trained on docking scores of subsets of a chemical library to approximate the docking outcome for yet unprocessed entries and, therefore, to remove unfavorable molecules in an iterative manner. The use of DD methodology in conjunction with the FRED docking program allowed rapid and accurate calculation of docking scores for 1.36 billion molecules from the ZINC15 library against 12 prominent target proteins and demonstrated up to 100-fold data reduction and 6000-fold enrichment of high scoring molecules (without notable loss of favorably docked entities). The DD protocol can readily be used in conjunction with any docking program and was made publicly available.

[1]  A. Cohen,et al.  Structure of the pentameric ligand-gated ion channel GLIC bound with anesthetic ketamine. , 2012, Structure.

[2]  E. Segala,et al.  Structures of Human A1 and A2A Adenosine Receptors with Xanthines Reveal Determinants of Selectivity. , 2017, Structure.

[3]  Artem Cherkasov,et al.  Best Practices of Computer-Aided Drug Discovery: Lessons Learned from the Development of a Preclinical Candidate for Prostate Cancer with a New Mechanism of Action , 2017, J. Chem. Inf. Model..

[4]  Brion W. Murray,et al.  Molecular conformations, interactions, and properties associated with drug efficiency and clinical performance among VEGFR TK inhibitors , 2012, Proceedings of the National Academy of Sciences.

[5]  Benjamin A. Ellingson,et al.  Conformer Generation with OMEGA: Algorithm and Validation Using High Quality Structures from the Protein Databank and Cambridge Structural Database , 2010, J. Chem. Inf. Model..

[6]  S. Rees,et al.  Principles of early drug discovery , 2011, British journal of pharmacology.

[7]  Jun Li,et al.  Structural basis of Nav1.7 inhibition by an isoform-selective small-molecule antagonist , 2015, Science.

[8]  J L Sussman,et al.  Protein Data Bank (PDB): database of three-dimensional structural information of biological macromolecules. , 1998, Acta crystallographica. Section D, Biological crystallography.

[9]  John J. Irwin,et al.  ZINC 15 – Ligand Discovery for Everyone , 2015, J. Chem. Inf. Model..

[10]  Michael M. Mysinger,et al.  Directory of Useful Decoys, Enhanced (DUD-E): Better Ligands and Decoys for Better Benchmarking , 2012, Journal of medicinal chemistry.

[11]  Brian K. Shoichet,et al.  ZINC - A Free Database of Commercially Available Compounds for Virtual Screening , 2005, J. Chem. Inf. Model..

[12]  Jürgen Bajorath,et al.  Current Trends, Overlooked Issues, and Unmet Challenges in Virtual Screening , 2020, J. Chem. Inf. Model..

[13]  Andreas Bender,et al.  A Discussion of Measures of Enrichment in Virtual Screening: Comparing the Information Content of Descriptors with Increasing Levels of Sophistication , 2005, J. Chem. Inf. Model..

[14]  R. W. Hansen,et al.  Journal of Health Economics , 2016 .

[15]  Tudor I. Oprea,et al.  A comprehensive map of molecular drug targets , 2016, Nature Reviews Drug Discovery.

[16]  Benoit Playe,et al.  Evaluation of deep and shallow learning methods in chemogenomics for the prediction of drugs specificity , 2020, Journal of Cheminformatics.

[17]  Mark McGann,et al.  FRED and HYBRID docking performance on standardized datasets , 2012, Journal of Computer-Aided Molecular Design.

[18]  Thomas Blaschke,et al.  The rise of deep learning in drug discovery. , 2018, Drug discovery today.

[19]  David A. Scott,et al.  An open-source drug discovery platform enables ultra-large virtual screens , 2020, Nature.

[20]  R. M. Walsh,et al.  Structure of a human synaptic GABA-A receptor , 2018, Nature.

[21]  Artem Cherkasov,et al.  Toxic Colors: The Use of Deep Learning for Predicting Toxicity of Compounds Merely from Their Graphic Images , 2018, J. Chem. Inf. Model..

[22]  Shigeyuki Yokoyama,et al.  Crystal Structure of the Ca2+/Calmodulin-dependent Protein Kinase Kinase in Complex with the Inhibitor STO-609* , 2011, The Journal of Biological Chemistry.

[23]  Hanne Grøn,et al.  Recognition and Accommodation at the Androgen Receptor Coactivator Binding Interface , 2004, PLoS biology.

[24]  John D. McCorvy,et al.  Virtual discovery of melatonin receptor ligands to modulate circadian rhythms , 2020, Nature.

[25]  C. Lipinski Lead- and drug-like compounds: the rule-of-five revolution. , 2004, Drug discovery today. Technologies.

[26]  Artem Cherkasov,et al.  A NEW APPROACH TO THE THEORETICAL ESTIMATION OF INDUCTIVE CONSTANTS , 1998 .

[27]  Vladimir Vovk,et al.  A tutorial on conformal prediction , 2007, J. Mach. Learn. Res..

[28]  Sébastien Boutet,et al.  Structure of the Angiotensin Receptor Revealed by Serial Femtosecond Crystallography , 2015, Cell.

[29]  Hui Zhang,et al.  Structural basis for ligand recognition of the human thromboxane A2 receptor , 2018, Nature Chemical Biology.

[30]  Patrick R Griffin,et al.  SR2067 Reveals a Unique Kinetic and Structural Signature for PPARγ Partial Agonism. , 2016, ACS chemical biology.

[31]  Ola Spjuth,et al.  Efficient iterative virtual screening with Apache Spark and conformal prediction , 2018, Journal of Cheminformatics.

[32]  Ping Chen,et al.  Spectrum and Degree of CDK Drug Interactions Predicts Clinical Performance , 2016, Molecular Cancer Therapeutics.

[33]  Andreas Bender,et al.  Improving Screening Efficiency through Iterative Screening Using Docking and Conformal Prediction , 2017, J. Chem. Inf. Model..

[34]  Zbigniew Dauter,et al.  Molecular basis of agonism and antagonism in the oestrogen receptor , 1997, Nature.

[35]  Yurii S. Moroz,et al.  Ultra-large library docking for discovering new chemotypes , 2019, Nature.