Evening the Score: Targeting SARS-CoV-2 Protease Inhibition in Graph Generative Models for Therapeutic Candidates

We examine a pair of graph generative models for the therapeutic design of novel drug candidates targeting SARS-CoV-2 viral proteins. Due to a sense of urgency, we chose well-validated models with unique strengths: an autoencoder that generates molecules with similar structures to a dataset of drugs with anti-SARS activity and a reinforcement learning algorithm that generates highly novel molecules. During generation, we explore optimization toward several design targets to balance druglikeness, synthetic accessability, and anti-SARS activity based on IC50. This generative framework1 will accelerate drug discovery in future pandemics through the high-throughput generation of targeted therapeutic candidates.

[1]  J. Reymond The chemical space project. , 2015, Accounts of chemical research.

[2]  M. Waring Lipophilicity in drug discovery , 2010, Expert Opinion on Drug Discovery.

[3]  C. Lipinski Lead- and drug-like compounds: the rule-of-five revolution. , 2004, Drug discovery today. Technologies.

[4]  Barrie Wilkinson,et al.  Drug discovery beyond the 'rule-of-five'. , 2007, Current opinion in biotechnology.

[5]  Peter C. Jurs,et al.  Prediction of IC50 Values for ACAT Inhibitors from Molecular Structure , 2000, J. Chem. Inf. Comput. Sci..

[6]  Kar Wai Lim,et al.  Target-Specific and Selective Drug Design for COVID-19 Using Deep Generative Models , 2020, NeurIPS 2020.

[7]  J L Sebaugh,et al.  Guidelines for accurate EC50/IC50 estimation , 2011, Pharmaceutical statistics.

[8]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[9]  Peter Ertl,et al.  Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions , 2009, J. Cheminformatics.

[10]  Benjamin J. Polacco,et al.  A SARS-CoV-2 Protein Interaction Map Reveals Targets for Drug-Repurposing , 2020, Nature.

[11]  Yibo Li,et al.  Multi-objective de novo drug design with conditional graph generative model , 2018, Journal of Cheminformatics.

[12]  Artem Cherkasov,et al.  SimBoost: a read-across approach for predicting drug–target binding affinities using gradient boosting machines , 2017, Journal of Cheminformatics.

[13]  A. Kumari,et al.  Identification of potential molecules against COVID-19 main protease through structure-guided virtual screening approach , 2020, Journal of biomolecular structure & dynamics.

[14]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[15]  Alán Aspuru-Guzik,et al.  A machine learning workflow for molecular analysis: application to melting points , 2020, Mach. Learn. Sci. Technol..

[16]  Jimeng Sun,et al.  DeepPurpose: a Deep Learning Based Drug Repurposing Toolkit , 2020, ArXiv.

[17]  Jure Leskovec,et al.  Graph Convolutional Policy Network for Goal-Directed Molecular Graph Generation , 2018, NeurIPS.

[18]  G. V. Paolini,et al.  Quantifying the chemical beauty of drugs. , 2012, Nature chemistry.

[19]  Identification of key interactions between SARS-CoV-2 main protease and inhibitor drug candidates , 2020, Scientific Reports.

[20]  Shayakhmetov Rim,et al.  Potential COVID-2019 3C-like Protease Inhibitors Designed Using Generative Deep Learning Approaches , 2020 .

[21]  Lixia Chen,et al.  Analysis of therapeutic targets for SARS-CoV-2 and discovery of potential drugs by computational methods , 2020, Acta Pharmaceutica Sinica B.

[22]  Jonas Boström,et al.  Deep Reinforcement Learning for Multiparameter Optimization in de novo Drug Design , 2019, J. Chem. Inf. Model..

[23]  Charlotte Harrison,et al.  Coronavirus puts drug repurposing on the fast track , 2020, Nature Biotechnology.

[24]  Jeremy C. Smith,et al.  Repurposing Therapeutics for COVID-19: Supercomputer-Based Docking to the SARS-CoV-2 Viral Spike Protein and Viral Spike Protein-Human ACE2 Interface , 2020 .

[25]  Dong Xu,et al.  AI-aided design of novel targeted covalent inhibitors against SARS-CoV-2 , 2020, bioRxiv.

[26]  Li Li,et al.  Optimization of Molecules via Deep Reinforcement Learning , 2018, Scientific Reports.

[27]  Fan Hu,et al.  Generating Novel Compounds Targeting SARS-CoV-2 Main Protease Based on Imbalanced Dataset , 2020, 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[28]  Arijit Bag,et al.  Development of Quantum Chemical Method to Calculate Half Maximal Inhibitory Concentration (IC50) , 2016, Molecular informatics.

[29]  Ashutosh Kumar,et al.  Advances in the Development of Shape Similarity Methods and Their Application in Drug Discovery , 2018, Front. Chem..

[30]  Neeraj Kumar,et al.  Artificial Intelligence based Autonomous Molecular Design for Medical Therapeutic: A Perspective , 2021, ArXiv.

[31]  Regina Barzilay,et al.  Junction Tree Variational Autoencoder for Molecular Graph Generation , 2018, ICML.

[32]  Samuel S. Schoenholz,et al.  Neural Message Passing for Quantum Chemistry , 2017, ICML.

[33]  George Papadatos,et al.  The ChEMBL bioactivity database: an update , 2013, Nucleic Acids Res..

[34]  Károly Héberger,et al.  Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations? , 2015, Journal of Cheminformatics.

[35]  Rampi Ramprasad,et al.  Screening of Therapeutic Agents for COVID-19 Using Machine Learning and Ensemble Docking Studies , 2020, The journal of physical chemistry letters.

[36]  I. Halil Kavakli,et al.  Classification of drug molecules considering their IC50 values using mixed-integer linear programming based hyper-boxes method , 2008, BMC Bioinformatics.

[37]  Niloy Ganguly,et al.  NeVAE: A Deep Generative Model for Molecular Graphs , 2018, AAAI.

[38]  Regina Barzilay,et al.  Multi-Objective Molecule Generation using Interpretable Substructures , 2020, ICML.

[39]  G. Gyebi,et al.  Potential inhibitors of coronavirus 3-chymotrypsin-like protease (3CLpro): an in silico screening of alkaloids and terpenoids from African medicinal plants , 2020, Journal of biomolecular structure & dynamics.

[40]  Ah Chung Tsoi,et al.  The Graph Neural Network Model , 2009, IEEE Transactions on Neural Networks.

[41]  David Weininger,et al.  SMILES, 3. DEPICT. Graphical depiction of chemical structures , 1990, J. Chem. Inf. Comput. Sci..

[42]  Matteo Manica,et al.  PaccMannRL on SARS-CoV-2: Designing antiviral candidates with conditional generative models , 2020, ArXiv.

[43]  Danushka Bollegala,et al.  DeepGraphMol, a multi-objective, computational strategy for generating molecules with desirable properties: a graph convolution and reinforcement learning approach , 2020, bioRxiv.

[44]  Esben Jannik Bjerrum,et al.  SMILES Enumeration as Data Augmentation for Neural Network Modeling of Molecules , 2017, ArXiv.

[45]  Michael F. Crowley,et al.  Message-passing neural networks for high-throughput polymer screening , 2018, The Journal of chemical physics.

[46]  Kwok-Yin Wong,et al.  Prediction of the SARS-CoV-2 (2019-nCoV) 3C-like protease (3CL pro) structure: virtual screening reveals velpatasvir, ledipasvir, and other drug repurposing candidates. , 2020, F1000Research.

[47]  Marcin J. Skwark,et al.  Designing a Prospective COVID-19 Therapeutic with Reinforcement Learning , 2020, ArXiv.

[48]  Potential Therapeutic Agents for COVID-19 Based on the Analysis of Protease and RNA Polymerase Docking , 2020 .