Combining generative artificial intelligence and on-chip synthesis for de novo drug design

Generative deep learning and miniaturized bench-top synthesis were combined to automate the design of novel LXR agonists. Automating the molecular design-make-test-analyze cycle accelerates hit and lead finding for drug discovery. Using deep learning for molecular design and a microfluidics platform for on-chip chemical synthesis, liver X receptor (LXR) agonists were generated from scratch. The computational pipeline was tuned to explore the chemical space of known LXRα agonists and generate novel molecular candidates. To ensure compatibility with automated on-chip synthesis, the chemical space was confined to the virtual products obtainable from 17 one-step reactions. Twenty-five de novo designs were successfully synthesized in flow. In vitro screening of the crude reaction products revealed 17 (68%) hits, with up to 60-fold LXR activation. The batch resynthesis, purification, and retesting of 14 of these compounds confirmed that 12 of them were potent LXR agonists. These results support the suitability of the proposed design-make-test-analyze framework as a blueprint for automated drug design with artificial intelligence and miniaturized bench-top synthesis.

[1]  Thorsten Meinl,et al.  KNIME - the Konstanz information miner: version 2.0 and beyond , 2009, SKDD.

[2]  Petra Schneider,et al.  Multi-objective molecular de novo design by adaptive fragment prioritization. , 2014, Angewandte Chemie.

[3]  Daniel Moser,et al.  DrugBank screening revealed alitretinoin and bexarotene as liver X receptor modulators. , 2017, Bioorganic & medicinal chemistry letters.

[4]  Riccardo Petraglia,et al.  Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy† , 2020, Chemical science.

[5]  F. von Delft,et al.  Rapid optimisation of fragments and hits to lead compounds from screening of crude reaction mixtures , 2020, Communications Chemistry.

[6]  Tim Chapman Lab automation and robotics: Automation on the move , 2003, Nature.

[7]  Leroy Cronin,et al.  Organic synthesis in a modular robotic system driven by a chemical programming language , 2019, Science.

[8]  Mike Preuss,et al.  Planning chemical syntheses with deep neural networks and symbolic AI , 2017, Nature.

[9]  Ken E. Whelan,et al.  The Automation of Science , 2009, Science.

[10]  Alexander L. Button,et al.  Combining generative artificial intelligence and on-chip synthesis for de novo drug design , 2020, Science Advances.

[11]  Alexander J. Lawson,et al.  The Making of Reaxys—Towards Unobstructed Access to Relevant Chemistry Information , 2014 .

[12]  Francesca Grisoni,et al.  Bidirectional Molecule Generation with Recurrent Neural Networks , 2020, J. Chem. Inf. Model..

[13]  Gisbert Schneider,et al.  Synthetic Activators of Cell Migration Designed by Constructive Machine Learning , 2019, ChemistryOpen.

[14]  Elizabeth Farrant,et al.  Rapid discovery of a novel series of Abl kinase inhibitors by application of an integrated microfluidic synthesis and screening platform. , 2013, Journal of medicinal chemistry.

[15]  S. Takizawa,et al.  Exploration of flow reaction conditions using machine-learning for enantioselective organocatalyzed Rauhut-Currier and [3+2] annulation sequence. , 2020, Chemical communications.

[16]  Alán Aspuru-Guzik,et al.  Deep learning enables rapid identification of potent DDR1 kinase inhibitors , 2019, Nature Biotechnology.

[17]  J. Medina,et al.  Discovery and optimization of a novel series of liver X receptor-α agonists , 2006 .

[18]  Leroy Cronin,et al.  Controlling an organic synthesis robot with machine learning to search for new reactivity , 2018, Nature.

[19]  Noel M. O'Boyle Towards a Universal SMILES representation - A standard method to generate canonical SMILES based on the InChI , 2012, Journal of Cheminformatics.

[20]  Davide Ballabio,et al.  NURA: A curated dataset of nuclear receptor modulators. , 2020, Toxicology and applied pharmacology.

[21]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[22]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[23]  Adam Nelson,et al.  Streamlining bioactive molecular discovery through integration and automation , 2018, Nature Reviews Chemistry.

[24]  M. Makishima,et al.  Design, synthesis, and biological evaluation of novel transrepression-selective liver X receptor (LXR) ligands with 5,11-dihydro-5-methyl-11-methylene-6H-dibenz[b,e]azepin-6-one skeleton. , 2012, Journal of medicinal chemistry.

[25]  Thomas Blaschke,et al.  Molecular de-novo design through deep reinforcement learning , 2017, Journal of Cheminformatics.

[26]  Richmond Sarpong,et al.  Automation and computer-assisted planning for chemical synthesis , 2021, Nature Reviews Methods Primers.

[27]  Ross McGuire,et al.  X-ray structures of the LXRalpha LBD in its homodimeric form and implications for heterodimer signaling. , 2010, Journal of molecular biology.

[28]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[29]  Evan Bolton,et al.  PubChem 2019 update: improved access to chemical data , 2018, Nucleic Acids Res..

[30]  R. A. Thompson,et al.  Hypothesis driven drug design: improving quality and effectiveness of the design-make-test-analyse cycle. , 2012, Drug discovery today.

[31]  Gisbert Schneider,et al.  Automated de novo molecular design by hybrid machine intelligence and rule-driven chemical synthesis , 2019, Nature Machine Intelligence.

[32]  Kenta Hongo,et al.  Bayesian molecular design with a chemical language model , 2017, Journal of Computer-Aided Molecular Design.

[33]  Petra Schneider,et al.  Identifying the macromolecular targets of de novo-designed chemical entities through self-organizing map consensus , 2014, Proceedings of the National Academy of Sciences.

[34]  Gisbert Schneider,et al.  Automating drug discovery , 2017, Nature Reviews Drug Discovery.

[35]  J. Collins,et al.  Therapeutic opportunities for liver X receptor modulators. , 2004, Current opinion in drug discovery & development.

[36]  N. Chandra,et al.  Computational antimicrobial peptide design and evaluation against multidrug-resistant clinical isolates of bacteria , 2017, The Journal of Biological Chemistry.

[37]  Andrew R. Leach,et al.  ChEMBL: towards direct deposition of bioassay data , 2018, Nucleic Acids Res..

[38]  Petra Schneider,et al.  Accessing new chemical entities through microfluidic systems. , 2014, Angewandte Chemie.

[39]  Pieter P. Plehiers,et al.  A robotic platform for flow synthesis of organic compounds informed by AI planning , 2019, Science.

[40]  Sherry Sun,et al.  The Three-dimensional Structure of the Liver X Receptor β Reveals a Flexible Ligand-binding Pocket That Can Accommodate Fundamentally Different Ligands* , 2003, Journal of Biological Chemistry.

[41]  Ola Engkvist,et al.  AI-assisted synthesis prediction. , 2019, Drug discovery today. Technologies.

[42]  Richard D. Taylor,et al.  Improved protein–ligand docking using GOLD , 2003, Proteins.

[43]  Peter Tontonoz,et al.  Liver X receptors in lipid metabolism: opportunities for drug discovery , 2014, Nature Reviews Drug Discovery.

[44]  Gisbert Schneider,et al.  De Novo Design of Bioactive Small Molecules by Artificial Intelligence , 2018, Molecular informatics.

[45]  George Papadatos,et al.  SureChEMBL: a large-scale, chemically annotated patent document database , 2015, Nucleic Acids Res..

[46]  Gisbert Schneider,et al.  Tuning artificial intelligence on the de novo design of natural-product-inspired retinoid X receptor modulators , 2018, Communications Chemistry.

[47]  Gisbert Schneider,et al.  Active-learning strategies in computer-assisted drug discovery. , 2015, Drug discovery today.

[48]  Petra Schneider,et al.  Chemically Advanced Template Search (CATS) for Scaffold-Hopping and Prospective Target Prediction for ‘Orphan’ Molecules , 2013, Molecular informatics.

[49]  G. Bemis,et al.  The properties of known drugs. 1. Molecular frameworks. , 1996, Journal of medicinal chemistry.

[50]  Adam Nelson,et al.  Efficient discovery of bioactive scaffolds by activity-directed synthesis , 2014, Nature Chemistry.

[51]  T. Burris,et al.  Recent Advances in the Medicinal Chemistry of Liver X Receptors. , 2018, Journal of medicinal chemistry.

[52]  Xingyu Jiang,et al.  Why microfluidics? Merits and trends in chemical synthesis. , 2017, Lab on a chip.

[53]  Kristian Birchall,et al.  Design, Synthesis, and Testing of Potent, Selective Hepsin Inhibitors via Application of an Automated Closed-Loop Optimization Platform. , 2018, Journal of medicinal chemistry.

[54]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[55]  Thierry Kogej,et al.  Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks , 2017, ACS central science.

[56]  P. Liu,et al.  E17110 promotes reverse cholesterol transport with liver X receptor β agonist activity in vitro , 2016, Acta pharmaceutica Sinica. B.

[57]  Robert Tibshirani,et al.  Chemical Space Mimicry for Drug Discovery , 2017, J. Chem. Inf. Model..

[58]  Hui Li,et al.  Liver X Receptor Agonist Therapy Prevents Diffuse Alveolar Hemorrhage in Murine Lupus by Repolarizing Macrophages , 2018, Front. Immunol..

[59]  Petra Schneider,et al.  Generative Recurrent Networks for De Novo Drug Design , 2017, Molecular informatics.

[60]  Gunnar Rätsch,et al.  Active Learning with Support Vector Machines in the Drug Discovery Process , 2003, J. Chem. Inf. Comput. Sci..

[61]  I. Davies The digitization of organic synthesis , 2019, Nature.

[62]  Francesca Grisoni,et al.  Generative molecular design in low data regimes , 2020, Nature Machine Intelligence.

[63]  Alán Aspuru-Guzik,et al.  Inverse molecular design using machine learning: Generative models for matter engineering , 2018, Science.

[64]  Jean-Louis Reymond,et al.  Drug Analogs from Fragment-Based Long Short-Term Memory Generative Neural Networks , 2018, J. Chem. Inf. Model..