Predicting the Stereoselectivity of Chemical Transformations by Machine Learning

Stereoselective reactions (both chemical and enzymatic reactions) have been essential for origin of life, evolution, human biology and medicine. Since late 1960s, there have been numerous successes in the exciting new frontier of asymmetric catalysis. However, most industrial and academic asymmetric catalysis nowadays do follow the trial-and-error model, since the energetic difference for success or failure in asymmetric catalysis is incredibly small. Our current understanding about stereoselective reactions is mostly qualitative that stereoselectivity arises from differences in steric effects and electronic effects in multiple competing mechanistic pathways. Quantitatively understanding and modulating the stereoselectivity of for a given chemical reaction still remains extremely difficult. As a proof of principle, we herein present a novel machine learning technique, which combines a LASSO model and two Random Forest model via two Gaussian Mixture models, for quantitatively predicting stereoselectivity of chemical reactions. Compared to the recent ground-breaking approach [1], our approach is able to capture interactions between features and exploit complex data distributions, which are important for predicting stereoselectivity. Experimental results on a recently published dataset demonstrate that our approach significantly outperform [1]. The insight obtained from our results provide a solid foundation for further exploration of other synthetically valuable yet mechanistically intriguing stereoselective reactions.

[1]  Harris Drucker,et al.  Improving Regressors using Boosting Techniques , 1997, ICML.

[2]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[3]  M. Rueping,et al.  Metal‐Free, Enantioselective Strecker Reactions Catalyzed by Chiral BINOL and TADDOL Catalysts , 2007 .

[4]  Qin Yang,et al.  Organocatalytic Asymmetric Reduction of Fluorinated Alkynyl Ketimines. , 2018, The Journal of organic chemistry.

[5]  J. Antilla,et al.  Catalytic asymmetric addition of alcohols to imines: enantioselective preparation of chiral N,O-aminals. , 2008, Journal of the American Chemical Society.

[6]  Y. Ni,et al.  Enantioselective organocatalytic reductive amination. , 2006, Journal of the American Chemical Society.

[7]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[8]  Jolene P Reid,et al.  Holistic Prediction of Enantioselectivity in Asymmetric Catalysis , 2019, Nature.

[9]  T. Akiyama,et al.  Chiral Brønsted Acid Catalyzed Enantioselective Hydrophosphonylation of Imines: Asymmetric Synthesis of α-Amino Phosphonates , 2005 .

[10]  Yang Wang,et al.  Prediction of higher-selectivity catalysts by computer-driven workflow and machine learning , 2019, Science.

[11]  T. Akiyama,et al.  Chiral phosphoric acid catalyzed transfer hydrogenation: facile synthetic access to highly optically active trifluoromethylated amines. , 2011, Angewandte Chemie.

[12]  T. Akiyama,et al.  Chiral phosphoric-acid-catalyzed transfer hydrogenation of ethyl ketimine derivatives by using benzothiazoline. , 2014, Chemistry.

[13]  J. Antilla,et al.  Highly enantioselective hydrogenation of enamides catalyzed by chiral phosphoric acids. , 2009, Organic letters.

[14]  L. Wojtas,et al.  Chiral phosphoric acid-catalyzed addition of thiols to N-acyl imines: access to chiral N,S-acetals. , 2011, Organic letters.

[15]  T. Nugent Chiral amine synthesis : methods, developments and applications , 2010 .

[16]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[17]  M. Snapper,et al.  Simple Organic Molecules as Catalysts for Enantioselective Synthesis of Amines and Alcohols , 2012, Nature.

[18]  Geoffrey J. McLachlan,et al.  Mixture models : inference and applications to clustering , 1989 .