Predictive Models for Kinetic Parameters of Cycloaddition Reactions

This paper reports SVR (Support Vector Regression) and GTM (Generative Topographic Mapping) modeling of three kinetic properties of cycloaddition reactions: rate constant (logk), activation energy (Ea) and pre‐exponential factor (logA). A data set of 1849 reactions, comprising (4+2), (3+2) and (2+2) cycloadditions (CA) were studied in different solvents and at different temperatures. The reactions were encoded by the ISIDA fragment descriptors generated for Condensed Graph of Reaction (CGR). For a given reaction, a CGR condenses structures of all the reactants and products into one single molecular graph, described both by conventional chemical bonds and “dynamical” bonds characterizing chemical transformations. Different scenarios of logk assessment were exploited: direct modeling, application of the Arrhenius equation and temperature‐scaled GTM landscapes. The logk models with optimal cross‐validated statistics (Q2=0.78–0.94 RMSE=0.45–0.86) have been challenged to predict rates for the external test set of 200 reactions, comprising both reactions that were not present in the training set, and training set transformations performed under different reaction conditions. The models are freely available on our web‐server: http://cimm.kpfu.ru/models.

[1]  Gebräuchliche Fertigarzneimittel,et al.  V , 1893, Therapielexikon Neurologie.

[2]  K. Houk Frontier molecular orbital theory of cycloaddition reactions , 1975 .

[3]  R. Taft,et al.  The solvatochromic comparison method. I. The .beta.-scale of solvent hydrogen-bond acceptor (HBA) basicities , 1976 .

[4]  R. Taft,et al.  The solvatochromic comparison method. 2. The .alpha.-scale of solvent hydrogen-bond donor (HBD) acidities , 1976 .

[5]  R. Taft,et al.  The solvatochromic comparison method. 6. The .pi.* scale of solvent polarities , 1977 .

[6]  L. Overman,et al.  Modeling chemical reactivity. 1. Regioselectivity of Diels-Alder cycloadditions of electron-rich dienes with electron-deficient dienophiles , 1986 .

[7]  Johann Gasteiger,et al.  Computer-assisted prediction of the degradation of chemicals: hydrolysis of amides and benzoylphenylureas , 1995 .

[8]  Javier Catalán,et al.  Progress towards a generalized solvent polarity scale: The solvatochromism of 2‐(dimethylamino)‐7‐nitrofluorene and its homomorph 2‐fluoro‐7‐nitrofluorene , 1995 .

[9]  Javier Catalán,et al.  A Generalized Solvent Basicity Scale: The Solvatochromism of 5‐Nitroindoline and Its Homomorph 1‐Methyl‐5‐nitroindoline , 1996 .

[10]  Javier Catalán,et al.  A Generalized Solvent Acidity Scale: The Solvatochromism of o-tert-Butylstilbazolium Betaine Dye and Its Homomorph o,o′-Di-tert-butylstilbazolium Betaine Dye† , 1997 .

[11]  Christopher M. Bishop,et al.  GTM: The Generative Topographic Mapping , 1998, Neural Computation.

[12]  William L. Jorgensen,et al.  QM/MM Simulations for Diels−Alder Reactions in Water: Contribution of Enhanced Hydrogen Bonding at the Transition State to the Solvent Effect† , 2002 .

[13]  V. O. Kudyshkin,et al.  QSPR Modeling of the Reactivity Parameters of Monomers in Radical Copolymerizations , 2004 .

[14]  Donald G Truhlar,et al.  Modeling the kinetics of bimolecular reactions. , 2006, Chemical reviews.

[15]  Some modern methods for estimation of reactivity of organic compounds , 2007 .

[16]  W. L. Jorgensen,et al.  Understanding Rate Accelerations for Diels-Alder Reactions in Solution Using Enhanced QM/MM Methodology. , 2007, Journal of chemical theory and computation.

[17]  J. Niu,et al.  Estimation of gas-phase reaction rate constants of alkylnaphthalenes with chlorine, hydroxyl and nitrate radicals. , 2007, Chemosphere.

[18]  William L Jorgensen,et al.  Elucidation of Rate Variations for a Diels-Alder Reaction in Ionic Liquids from QM/MM Simulations. , 2007, Journal of chemical theory and computation.

[19]  I. Tetko,et al.  ISIDA - Platform for Virtual Screening Based on Fragment and Pharmacophoric Descriptors , 2008 .

[20]  Bing Yi,et al.  Quantitative structure–property relationships for the reactivity parameters of acrylate monomers , 2008 .

[21]  T. Ho Kinetic Modeling of Large‐Scale Reaction Systems , 2008 .

[22]  Diogo A. R. S. Latino,et al.  Assignment of EC Numbers to Enzymatic Reactions with MOLMAP Reaction Descriptors and Random Forests , 2009, J. Chem. Inf. Model..

[23]  V. Kiselev,et al.  Internal and external factors influencing the Diels–Alder reaction , 2009 .

[24]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[25]  Jason A. Morrill,et al.  Development of quantitative structure-activity relationships for explanatory modeling of fast reacting (meth)acrylate monomers bearing novel functionality. , 2011, Journal of molecular graphics & modelling.

[26]  Nicolas Lachiche,et al.  A Representation to Apply Usual Data Mining Techniques to Chemical reactions - Illustration on the Rate Constant of SN2 reactions in water , 2011, Int. J. Artif. Intell. Tools.

[27]  Natalia Kireeva,et al.  Toward Navigating Chemical Space of Ionic Liquids: Prediction of Melting Points Using Generative Topographic Maps , 2012 .

[28]  Alban Arrault,et al.  Generative Topographic Mapping-Based Classification Models and Their Applicability Domain: Application to the Biopharmaceutics Drug Disposition Classification System (BDDCS) , 2013, J. Chem. Inf. Model..

[29]  Wendy A Warr,et al.  A Short Review of Chemical Reaction Database Systems, Computer‐Aided Synthesis Design, Reaction Prediction and Synthetic Feasibility , 2014, Molecular informatics.

[30]  Gilles Marcou,et al.  An Evolutionary Optimizer of libsvm Models , 2014 .

[31]  Igor I. Baskin,et al.  Structure-reactivity relationships in terms of the condensed graphs of reactions , 2014, Russian Journal of Organic Chemistry.

[32]  Timur I. Madzhidov,et al.  Structure–reactivity relationship in bimolecular elimination reactions based on the condensed graph of a reaction , 2015, Journal of Structural Chemistry.

[33]  Dragos Horvath,et al.  Mappability of drug-like space: towards a polypharmacologically competent map of drug-relevant compounds , 2015, Journal of Computer-Aided Molecular Design.

[34]  P. Huck,et al.  Predicting the reaction rate constants of micropollutants with hydroxyl radicals in water using QSPR modeling. , 2015, Chemosphere.

[35]  Héléna A. Gaspar,et al.  GTM‐Based QSAR Models and Their Applicability Domains , 2015, Molecular informatics.

[36]  Dragos Horvath,et al.  Expert System for Predicting Reaction Conditions: The Michael Reaction Case , 2015, J. Chem. Inf. Model..

[37]  K. Baumann,et al.  Chemoinformatic Classification Methods and their Applicability Domain , 2016, Molecular informatics.

[38]  Gilles Marcou,et al.  Generative Topographic Mapping Approach to Modeling and Chemical Space Visualization of Human Intestinal Transporters , 2016 .

[39]  G. Szabó,et al.  Substituent effect on the hydrolysis of chlorosilanes: quantum chemical and QSPR study , 2017, Structural Chemistry.

[40]  Igor I. Baskin,et al.  Artificial intelligence in synthetic chemistry: achievements and prospects , 2017 .

[41]  Alexandre Varnek,et al.  Structure–reactivity modeling using mixture-based representation of chemical reactions , 2017, Journal of Computer-Aided Molecular Design.