Simultaneously improving reaction coverage and computational cost in automated reaction prediction tasks

Automated reaction prediction has the potential to elucidate complex reaction networks for applications ranging from combustion to materials degradation, but computational cost and inconsistent reaction coverage are still obstacles to exploring deep reaction networks. Here we show that cost can be reduced and reaction coverage can be increased simultaneously by relatively straightforward modifications of the reaction enumeration, geometry initialization and transition state convergence algorithms that are common to many prediction methodologies. These components are implemented in the context of yet another reaction program (YARP), our reaction prediction package with which we report reaction discovery benchmarks for organic single-step reactions, thermal degradation of a γ-ketohydroperoxide, and competing ring-closures in a large organic molecule. Compared with recent benchmarks, YARP (re)discovers both established and unreported reaction pathways and products while simultaneously reducing the cost of reaction characterization by nearly 100-fold and increasing convergence of transition states. This combination of ultra-low cost and high reaction coverage creates opportunities to explore the reactivity of larger systems and more complex reaction networks for applications such as chemical degradation, where computational cost is a bottleneck. This work demonstrates that large gains still exist in accelerating and improving the coverage of reaction prediction algorithms. These advances create opportunities for computationally exploring deeper and broader reaction networks.

[1]  Shuhua Li,et al.  Automatic Reaction Pathway Search via Combined Molecular Dynamics and Coordinate Driving Method. , 2017, The journal of physical chemistry. A.

[2]  E J Corey,et al.  Computer-assisted design of complex organic syntheses. , 1969, Science.

[3]  Kevin Van Geem,et al.  Automatic Mechanism and Kinetic Model Generation for Gas‐ and Solution‐Phase Processes: A Perspective on Best Practices, Recent Advances, and Future Challenges , 2015 .

[4]  Paul Zimmerman,et al.  Reliable Transition State Searches Integrated with the Growing String Method. , 2013, Journal of chemical theory and computation.

[5]  Joseph B. Gianino,et al.  Iron(III)-catalysed carbonyl–olefin metathesis , 2016, Nature.

[6]  Stefan Grimme,et al.  A Robust and Accurate Tight-Binding Quantum Chemical Method for Structures, Vibrational Frequencies, and Noncovalent Interactions of Large Molecular Systems Parametrized for All spd-Block Elements (Z = 1-86). , 2017, Journal of chemical theory and computation.

[7]  Lee-Ping Wang,et al.  Geometry optimization made simple with translation and rotation coordinates. , 2016, The Journal of chemical physics.

[8]  R. Friesner,et al.  Automated Transition State Search and Its Application to Diverse Types of Organic Reactions. , 2017, Journal of chemical theory and computation.

[9]  Markus Reiher,et al.  Exploration of Reaction Pathways and Chemical Transformation Networks. , 2018, The journal of physical chemistry. A.

[10]  William H. Green,et al.  Unimolecular Reaction Pathways of a γ-Ketohydroperoxide from Combined Application of Automated Reaction Discovery Methods. , 2018, Journal of the American Chemical Society.

[11]  Joshua W. Allen,et al.  Chemically activated formation of organic acids in reactions of the Criegee intermediate with aldehydes and ketones. , 2013, Physical chemistry chemical physics : PCCP.

[12]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[13]  Stefan Grimme,et al.  GFN2-xTB-An Accurate and Broadly Parametrized Self-Consistent Tight-Binding Quantum Chemical Method with Multipole Electrostatics and Density-Dependent Dispersion Contributions. , 2018, Journal of Chemical Theory and Computation.

[14]  Joshua A Kammeraad,et al.  Discovery of conical intersection mediated photochemistry with growing string methods. , 2018, Physical chemistry chemical physics : PCCP.

[15]  G. Henkelman,et al.  A climbing image nudged elastic band method for finding saddle points and minimum energy paths , 2000 .

[16]  Jon Baker,et al.  The generation and use of delocalized internal coordinates in geometry optimization , 1996 .

[17]  John S. Schreck,et al.  Learning Retrosynthetic Planning through Simulated Experience , 2019, ACS central science.

[18]  Paul M. Zimmerman,et al.  Methods for exploring reaction space in molecular systems , 2018 .

[19]  Satoshi Maeda,et al.  Global mapping of equilibrium and transition structures on potential energy surfaces by the scaled hypersphere search method: applications to ab initio surfaces of formaldehyde and propyne molecules. , 2005, The journal of physical chemistry. A.

[20]  P. Zimmerman,et al.  Simulated Mechanism for Palladium-Catalyzed, Directed γ-Arylation of Piperidine , 2017 .

[21]  Andreas Hansen,et al.  Semi-automated transition state localization for organometallic complexes with semi-empirical quantum chemical methods. , 2020, Journal of chemical theory and computation.

[22]  Kari Laasonen,et al.  Removing External Degrees of Freedom from Transition-State Search Methods using Quaternions. , 2015, Journal of chemical theory and computation.

[23]  Tetsuya Taketsugu,et al.  Exploring transition state structures for intramolecular pathways by the artificial force induced reaction method , 2014, J. Comput. Chem..

[24]  W. Goddard,et al.  UFF, a full periodic table force field for molecular mechanics and molecular dynamics simulations , 1992 .

[25]  Keith E J Tyo,et al.  Exploring De Novo metabolic pathways from pyruvate to propionic acid , 2016, Biotechnology progress.

[26]  Y. Yoneda,et al.  A Computer Program Package for the Analysis, Creation, and Estimation of Generalized Reactions—GRACE. I. Generation of Elementary Reaction Network in Radical Reactions—GRACE(I) , 1979 .

[27]  Satoshi Maeda,et al.  On Benchmarking of Automated Methods for Performing Exhaustive Reaction Path Search. , 2019, Journal of chemical theory and computation.

[28]  Connor W. Coley,et al.  A graph-convolutional neural network model for the prediction of chemical reactivity , 2018, Chemical science.

[29]  Markus Reiher,et al.  Heuristics-Guided Exploration of Reaction Mechanisms. , 2015, Journal of chemical theory and computation.

[30]  Ruben Van de Vijver,et al.  KinBot: Automated stationary point search on potential energy surfaces , 2020, Comput. Phys. Commun..

[31]  K. Morokuma,et al.  The Biginelli Reaction Is a Urea-Catalyzed Organocatalytic Multicomponent Reaction. , 2015, The Journal of organic chemistry.

[32]  Emilio Martínez-Núñez,et al.  An automated method to find transition states using chemical dynamics simulations , 2015, J. Comput. Chem..

[33]  W. Green,et al.  New pathways for formation of acids and carbonyl products in low-temperature oxidation: the Korcek decomposition of γ-ketohydroperoxides. , 2013, Journal of the American Chemical Society.

[34]  William H. Green,et al.  Reaction Mechanism Generator: Automatic construction of chemical kinetic mechanisms , 2016, Comput. Phys. Commun..

[35]  H. Schlegel,et al.  Path optimization by a variational reaction coordinate method. I. Development of formalism and algorithms. , 2015, The Journal of chemical physics.

[36]  Paul M. Zimmerman,et al.  Automated discovery of chemically reasonable elementary reaction steps , 2013, J. Comput. Chem..

[37]  Alfonso Jaramillo,et al.  DESHARKY: automatic design of metabolic pathways for optimal cell growth , 2008, Bioinform..

[38]  Qiyuan Zhao,et al.  A Self-Consistent Component Increment Theory for Predicting Enthalpy of Formation. , 2020, Journal of chemical information and modeling.

[39]  William H Green,et al.  Automated Discovery of Elementary Chemical Reaction Steps Using Freezing String and Berny Optimization Methods. , 2015, Journal of chemical theory and computation.

[40]  Zhi-Pan Liu,et al.  Stochastic Surface Walking Method for Structure Prediction and Pathway Searching. , 2013, Journal of chemical theory and computation.

[41]  F. P. Di Maio,et al.  KING, a KInetic Network Generator , 1992 .

[42]  Riccardo Petraglia,et al.  Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy† , 2020, Chemical science.

[43]  Stefan Grimme,et al.  Automated exploration of the low-energy chemical space with fast quantum chemical methods. , 2020, Physical chemistry chemical physics : PCCP.

[44]  Satoshi Maeda,et al.  Systematic exploration of the mechanism of chemical reactions: the global reaction route mapping (GRRM) strategy using the ADDF and AFIR methods. , 2013, Physical chemistry chemical physics : PCCP.

[45]  Paul M. Zimmerman,et al.  Growing string method with interpolation and optimization in internal coordinates: method and examples. , 2013, The Journal of chemical physics.

[46]  Y. Ju,et al.  Identification of the Criegee intermediate reaction network in ethylene ozonolysis: impact on energy conversion strategies and atmospheric chemistry. , 2019, Physical chemistry chemical physics : PCCP.

[47]  Woo Youn Kim,et al.  Efficient prediction of reaction paths through molecular graph and reaction network analysis† †Electronic supplementary information (ESI) available: Detailed information on reaction networks and pathways for two example reactions, Cartesian coordinates of molecules in the reaction networks obtained , 2017, Chemical science.

[48]  L. Broadbelt,et al.  Discerning complex reaction networks using automated generators , 2019, AIChE Journal.

[49]  Linda J. Broadbelt,et al.  Computer Generated Pyrolysis Modeling: On-the-Fly Generation of Species, Reactions, and Rates , 1994 .

[50]  Tetsuya Taketsugu,et al.  Artificial Force Induced Reaction (AFIR) Method for Exploring Quantum Chemical Potential Energy Surfaces. , 2016, Chemical record.

[51]  Michael Walter,et al.  The atomic simulation environment-a Python library for working with atoms. , 2017, Journal of physics. Condensed matter : an Institute of Physics journal.

[52]  Johann Gasteiger,et al.  New Applications of Computers in Chemistry , 1979 .

[53]  Paul M. Zimmerman,et al.  Navigating molecular space for reaction mechanisms: an efficient, automated procedure , 2015 .

[54]  Connor W. Coley,et al.  Machine Learning in Computer-Aided Synthesis Planning. , 2018, Accounts of chemical research.

[55]  Di Wu,et al.  A Computational Approach To Design and Evaluate Enzymatic Reaction Pathways: Application to 1-Butanol Production from Pyruvate , 2011, J. Chem. Inf. Model..

[56]  C. J. Tsai,et al.  Use of an eigenmode method to locate the stationary points on the potential energy surfaces of selected argon and water clusters , 1993 .

[57]  Todd J. Martínez,et al.  Ab Initio Reactive Computer Aided Molecular Design. , 2017, Accounts of chemical research.

[58]  Yi Luo,et al.  Automated exploration of stable isomers of H+(H2O)n (n = 5–7) via ab initio calculations: An application of the anharmonic downward distortion following algorithm , 2009, J. Comput. Chem..

[59]  William H. Green,et al.  Automatic generation of reaction mechanisms , 2019, Computer Aided Chemical Engineering.

[60]  Paul M Zimmerman,et al.  Efficient exploration of reaction paths via a freezing string method. , 2011, The Journal of chemical physics.

[61]  S. M. Sarathy,et al.  Comprehensive chemical kinetic modeling of the oxidation of 2-methylalkanes from C7 to C20 , 2011 .

[62]  Markus Reiher,et al.  The Exploration of Chemical Reaction Networks. , 2019, Annual review of physical chemistry.

[63]  C. Law,et al.  Toward accommodating realistic fuel chemistry in large-scale computations , 2009 .