A Reinforcement Learning-Based Reward Mechanism for Molecule Generation that Introduces Activity Information

In this paper, we propose an activity prediction method for molecule generation based on the framework of reinforcement learning. The method is used as a scoring module for the molecule generation process. By introducing information about known active molecules for specific set of target conformations, it overcomes the traditional molecular optimization strategy where the method only uses computable properties. Eventually, our prediction method improves the quality of the generated molecules. The prediction method utilized fusion features that consist of traditional countable properties of molecules such as atomic number and the binding property of the molecule to the target. Furthermore, this paper designs a ultra large-scale molecular docking parallel computing method, which greatly improves the performance of the molecular docking [1] scoring process. The computing method makes the high-quality docking computing to predict molecular activity possible. The final experimental result shows that the molecule generation model using the prediction method can produce nearly twenty percent active molecules, which shows that the method proposed in this paper can effectively improve the performance of molecule generation.

[1]  David S. Goodsell,et al.  RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy , 2018, Nucleic Acids Res..

[2]  D. Diz,et al.  COVID-19, ACE2, and the cardiovascular consequences , 2020, American journal of physiology. Heart and circulatory physiology.

[3]  Richard J. Hall,et al.  Protein-Ligand Docking against Non-Native Protein Conformers , 2008, J. Chem. Inf. Model..

[4]  Thomas Blaschke,et al.  Molecular de-novo design through deep reinforcement learning , 2017, Journal of Cheminformatics.

[5]  Shunmugiah Karutha Pandian,et al.  Quinolines-Based SARS-CoV-2 3CLpro and RdRp Inhibitors and Spike-RBD-ACE2 Inhibitor for Drug-Repurposing Against COVID-19: An in silico Analysis , 2020, Frontiers in Microbiology.

[6]  Hualiang Jiang,et al.  Structure of Mpro from SARS-CoV-2 and discovery of its inhibitors , 2020, Nature.

[7]  Wenjie Zhu,et al.  The complex structure of GRL0617 and SARS-CoV-2 PLpro reveals a hot spot for antiviral drug discovery , 2021, Nature communications.

[8]  Jonas Boström,et al.  Deep Reinforcement Learning for Multiparameter Optimization in de novo Drug Design , 2019, J. Chem. Inf. Model..

[9]  Le Zhang,et al.  Progress in molecular docking , 2019, Quantitative Biology.

[10]  B. Munos Lessons from 60 years of pharmaceutical innovation , 2009, Nature Reviews Drug Discovery.

[11]  George Papadatos,et al.  The ChEMBL database in 2017 , 2016, Nucleic Acids Res..

[12]  Danzhi Huang,et al.  Hydrogen Bonding Penalty upon Ligand Binding , 2011, PloS one.

[13]  Kwok-Hung Chan,et al.  Improved Molecular Diagnosis of COVID-19 by the Novel, Highly Sensitive and Specific COVID-19-RdRp/Hel Real-Time Reverse Transcription-PCR Assay Validated In Vitro and with Clinical Specimens , 2020, Journal of Clinical Microbiology.

[14]  M. Cryle,et al.  X-domain of peptide synthetases recruits oxygenases crucial for glycopeptide biosynthesis , 2015, Nature.

[15]  Zhènglì Shí,et al.  The crystal structure of COVID-19 main protease in complex with an inhibitor N3 , 2020 .

[16]  Charles C. Persinger,et al.  How to improve R&D productivity: the pharmaceutical industry's grand challenge , 2010, Nature Reviews Drug Discovery.

[17]  Sarunas Raudys,et al.  Evolution and generalization of a single neurone: I. Single-layer perceptron as seven statistical classifiers , 1998, Neural Networks.

[18]  Bernardete Ribeiro,et al.  Diversity oriented Deep Reinforcement Learning for targeted molecule generation , 2021, Journal of Cheminformatics.

[19]  Olexandr Isayev,et al.  Deep reinforcement learning for de novo drug design , 2017, Science Advances.

[20]  Jacob D. Durrant,et al.  Molecular dynamics simulations and drug discovery , 2011, BMC Biology.

[21]  J. Takagi,et al.  Engineered ACE2 receptor therapy overcomes mutational escape of SARS-CoV-2 , 2021, Nature Communications.

[22]  Danushka Bollegala,et al.  DeepGraphMolGen, a multi-objective, computational strategy for generating molecules with desirable properties: a graph convolution and reinforcement learning approach , 2020, Journal of Cheminformatics.

[23]  A High-Throughput RNA Displacement Assay for Screening SARS-CoV-2 nsp10-nsp16 Complex toward Developing Therapeutics for COVID-19 , 2021, SLAS Discovery.

[24]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[25]  Dong-Sheng Cao,et al.  Artificial intelligence facilitates drug design in the big data era , 2019, Chemometrics and Intelligent Laboratory Systems.