Evolutionary Retrosynthetic Route Planning [Research Frontier]

Molecular retrosynthesis is a significant and complex problem in the field of chemistry, however, traditional manual synthesis methods not only need well-trained experts but also are time-consuming. With the development of Big Data and machine learning, artificial intelligence (AI) based retrosynthesis is attracting more attention and has become a valuable tool for molecular retrosynthesis. At present, Monte Carlo tree search is a mainstream search framework employed to address this problem. Nevertheless, its search efficiency is compromised by its large search space. Therefore, this paper proposes a novel approach for retrosynthetic route planning based on evolutionary optimization, marking the first use of Evolutionary Algorithm (EA) in the field of multi-step retrosynthesis. The proposed method involves modeling the retrosynthetic problem into an optimization problem, defining the search space and operators. Additionally, to improve the search efficiency, a parallel strategy is implemented. The new approach is applied to four case products and compared with Monte Carlo tree search. The experimental results show that, in comparison to the Monte Carlo tree search algorithm, EA significantly reduces the number of calling single-step model by an average of 53.9%. The time required to search three solutions decreases by an average of 83.9%, and the number of feasible search routes increases by 1.38 times. The source code is available at https://github.com/ilog-ecnu/EvoRRP.

[1]  Haipeng Yang,et al.  A Macro-Micro Population-Based Co-Evolutionary Multi-Objective Algorithm for Community Detection in Complex Networks [Research Frontier] , 2023, IEEE Computational Intelligence Magazine.

[2]  Yongna Yao Bayesian network model structure based on binary evolutionary algorithm , 2023, PeerJ Comput. Sci..

[3]  Aimin Zhou,et al.  Expensive Multiobjective Optimization by Relation Learning and Prediction , 2022, IEEE Transactions on Evolutionary Computation.

[4]  J. Goodman,et al.  A review of molecular representation in the age of machine learning , 2022, WIREs Computational Molecular Science.

[5]  Junzhou Huang,et al.  RetroComposer: Composing Templates for Template-Based Retrosynthesis Prediction , 2021, Biomolecules.

[6]  Connor W. Coley,et al.  Permutation invariant graph-to-sequence model for template-free retrosynthesis and reaction prediction , 2021, J. Chem. Inf. Model..

[7]  Shuan Chen,et al.  Deep Retrosynthetic Reaction Prediction using Local Reactivity and Global Attention , 2021, JACS Au.

[8]  Sandi Baressi Segota,et al.  Utilization of multilayer perceptron for determining the inverse kinematics of an industrial robotic manipulator , 2021, International Journal of Advanced Robotic Systems.

[9]  Jinwoo Shin,et al.  GTA: Graph Truncated Attention for Retrosynthesis , 2021, AAAI.

[10]  Huanxiang Liu,et al.  RetroPrime: A Diverse, Plausible and Transformer-based Method for Single-Step Retrosynthesis Predictions , 2021 .

[11]  Regina Barzilay,et al.  Towards efficient discovery of green synthetic pathways with Monte Carlo tree search and reinforcement learning , 2020, Chemical science.

[12]  Bo Dai,et al.  Towards understanding retrosynthesis by energy-based models , 2020, NeurIPS.

[13]  Le Song,et al.  Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search , 2020, ICML.

[14]  Stanislaw Jastrzebski,et al.  Molecule Edit Graph Attention Network: Modeling Chemical Reactions as Sequences of Graph Edits , 2020, J. Chem. Inf. Model..

[15]  Connor W. Coley,et al.  Learning Graph Models for Retrosynthesis Prediction , 2020, NeurIPS.

[16]  Yang Yu,et al.  RetroXpert: Decompose Retrosynthesis Prediction like a Chemist , 2020, NeurIPS.

[17]  Jian Tang,et al.  A Graph to Graphs Framework for Retrosynthesis Prediction , 2020, ICML.

[18]  I. Tetko,et al.  State-of-the-art augmented NLP transformer models for direct and single-step retrosynthesis , 2020, Nature Communications.

[19]  Jianfeng Pei,et al.  Automatic retrosynthetic route planning using template-free models , 2020, Chemical science.

[20]  Brian C. Barnes,et al.  Data Augmentation and Pretraining for Template-Based Retrosynthetic Prediction in Computer-Aided Synthesis Planning , 2020, J. Chem. Inf. Model..

[21]  Le Song,et al.  Retrosynthesis Prediction with Conditional Graph Logic Network , 2020, NeurIPS.

[22]  Jun Xu,et al.  Predicting Retrosynthetic Reactions Using Self-Corrected Transformer Neural Networks , 2019, J. Chem. Inf. Model..

[23]  Ulises Orozco-Rosas,et al.  Hybrid Path Planning Algorithm Based on Membrane Pseudo-Bacterial Potential Field for Autonomous Mobile Robots , 2019, IEEE Access.

[24]  Junzhou Huang,et al.  Molecular Graph Enhanced Transformer for Retrosynthesis Prediction , 2019, bioRxiv.

[25]  Regina Barzilay,et al.  Learning to Make Generalizable and Diverse Predictions for Retrosynthesis , 2019, ArXiv.

[26]  Pieter P. Plehiers,et al.  A robotic platform for flow synthesis of organic compounds informed by AI planning , 2019, Science.

[27]  Luhua Lai,et al.  Novel Allosteric Activators for Ferroptosis Regulator Glutathione Peroxidase 4. , 2019, Journal of medicinal chemistry.

[28]  Piotr Dittwald,et al.  Efficient Syntheses of Diverse, Medicinally Relevant Targets Planned by Computer and Executed in the Laboratory , 2018 .

[29]  Connor W. Coley,et al.  SCScore: Synthetic Complexity Learned from a Reaction Corpus , 2018, J. Chem. Inf. Model..

[30]  William H. Green,et al.  Computer-Assisted Retrosynthesis Based on Molecular Similarity , 2017, ACS central science.

[31]  Asher Mullard,et al.  The drug-maker's guide to the galaxy , 2017, Nature.

[32]  Mike Preuss,et al.  Planning chemical syntheses with deep neural networks and symbolic AI , 2017, Nature.

[33]  Bowen Liu,et al.  Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models , 2017, ACS central science.

[34]  Marwin H. S. Segler,et al.  Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction. , 2017, Chemistry.

[35]  Nikolaus Hansen,et al.  The CMA Evolution Strategy: A Tutorial , 2016, ArXiv.

[36]  R. Sabourin,et al.  Bayesian network as an adaptive parameter setting approach for genetic algorithms , 2016, Complex & Intelligent Systems.

[37]  Qingfu Zhang,et al.  An Estimation of Distribution Algorithm With Cheap and Expensive Local Search Methods , 2015, IEEE Transactions on Evolutionary Computation.

[38]  Igor Jurisica,et al.  Construction of New Medicines via Game Proof Search , 2012, AAAI.

[39]  Adam Lipowski,et al.  Roulette-wheel selection via stochastic acceptance , 2011, ArXiv.

[40]  Pedro Larrañaga,et al.  Learning Factorizations in Estimation of Distribution Algorithms Using Affinity Propagation , 2010, Evolutionary Computation.

[41]  H. Jaap van den Herik,et al.  Parallel Monte-Carlo Tree Search , 2008, Computers and Games.

[42]  T. Back,et al.  Evolutionary algorithms for real world applications [Application Notes] , 2008, IEEE Computational Intelligence Magazine.

[43]  David E. Goldberg,et al.  Multiobjective hBOA, clustering, and scalability , 2005, GECCO '05.

[44]  Petros Koumoutsakos,et al.  A Mixed Bayesian Optimization Algorithm with Variance Adaptation , 2004, PPSN.

[45]  Marco Laumanns,et al.  Bayesian Optimization Algorithms for Multi-objective Optimization , 2002, PPSN.

[46]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[47]  L. Weber Multi-component reactions and evolutionary chemistry. , 2002, Drug discovery today.

[48]  Pedro Larraanaga,et al.  Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation , 2001 .

[49]  David E. Goldberg,et al.  Linkage Problem, Distribution Estimation, and Bayesian Networks , 2000, Evolutionary Computation.

[50]  Xin Yao,et al.  Evolutionary programming made faster , 1999, IEEE Trans. Evol. Comput..

[51]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[52]  Thomas Bck,et al.  Evolutionary computation: Toward a new philosophy of machine intelligence , 1997, Complex..

[53]  Paul A. Viola,et al.  MIMIC: Finding Optima by Estimating Probability Densities , 1996, NIPS.

[54]  H. Mühlenbein,et al.  From Recombination of Genes to the Estimation of Distributions I. Binary Parameters , 1996, PPSN.

[55]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[56]  D. Geiger,et al.  Learning Bayesian networks: The combination of knowledge and statistical data , 1994, Machine Learning.

[57]  John R. Koza,et al.  Genetic programming as a means for programming computers by natural selection , 1994 .

[58]  S. Baluja Population-Based Incremental Learning: A Method for Integrating Genetic Search Based Function Optimization and Competitive Learning , 1994 .

[59]  Hans-Paul Schwefel,et al.  Evolution and Optimum Seeking: The Sixth Generation , 1993 .

[60]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[61]  Aimin Zhou,et al.  Regularity Evolution for Multiobjective Optimization , 2023, IEEE Transactions on Evolutionary Computation.

[62]  Adrià Cereto-Massagué,et al.  Molecular fingerprint similarity search in virtual screening. , 2015, Methods.

[63]  Thomas Bäck,et al.  Evolutionary Algorithms for Real World Applications , 2008 .

[64]  Euntai Kim,et al.  Structure Learning of Bayesian Networks Using Dual Genetic Algorithm , 2008, IEICE Trans. Inf. Syst..

[65]  David E. Goldberg,et al.  Hierarchical Bayesian Optimization Algorithm , 2006, Scalable Optimization via Probabilistic Modeling.

[66]  Dirk Thierens,et al.  Multi-objective Optimization with the Naive MIDEA , 2006 .

[67]  Yaochu Jin,et al.  Connectedness, regularity and the success of local search in evolutionary multi-objective optimization , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[68]  M. Pelikán,et al.  The Bivariate Marginal Distribution Algorithm , 1999 .

[69]  Qingfu Zhang,et al.  This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION 1 RM-MEDA: A Regularity Model-Based Multiobjective Estimation of , 2022 .