Predictive modeling in homogeneous catalysis: a tutorial.

Predictive modeling has become a practical research tool in homogeneous catalysis. It can help to pinpoint 'good regions' in the catalyst space, narrowing the search for the optimal catalyst for a given reaction. Just like any other new idea, in silico catalyst optimization is accepted by some researchers and met with skepticism by others. The basic requirements for good predictive models are a reliable set of initial experimental data, a method for generating and testing virtual catalyst libraries, and robust validation protocols. Once you have these, the key task is translating the catalysis problems into something that a computer can understand. In this tutorial review we explain in simple terms what predictive modeling actually is, why and when should one use it, and how it can be implemented.

[1]  R. Noyori,et al.  Practical synthesis of (R)- or (S)-2,2'-bis(diarylphosphino)-1,1'-binaphthyls (BINAPs) , 1986 .

[2]  J. M. Serra,et al.  Support vector machines for predictive modeling in heterogeneous catalysis: a comprehensive introduction and overfitting investigation based on two real applications. , 2006, Journal of combinatorial chemistry.

[3]  Jeremy N. Harvey,et al.  Computational descriptors for chelating P,P- and P,N-donor ligands , 2008 .

[4]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[5]  J. Brian Gray,et al.  Introduction to Linear Regression Analysis , 2002, Technometrics.

[6]  Claude Mirodatos,et al.  The development of descriptors for solids: teaching "catalytic intuition" to a computer. , 2004, Angewandte Chemie.

[7]  A. Leo,et al.  Partition coefficients and their uses , 1971 .

[8]  Sylwester Mazurek,et al.  Counter propagation artificial neural networks modeling of an enantioselectivity of artificial metalloenzymes , 2007, Molecular Diversity.

[9]  B. Kowalski,et al.  Partial least-squares regression: a tutorial , 1986 .

[10]  Per-Ola Norrby,et al.  Prediction of enantioselectivity in rhodium catalyzed hydrogenations. , 2009, Journal of the American Chemical Society.

[11]  K. Morokuma Theoretical studies of structure, function and reactivity of molecules— A personal account , 2009, Proceedings of the Japan Academy. Series B, Physical and biological sciences.

[12]  Paul Geladi,et al.  Principal Component Analysis , 1987, Comprehensive Chemometrics.

[13]  R. Noyori,et al.  Ab initio molecular orbital study on rhodium(I)-catalyzed isomerization of allylic amines to enamines , 1992 .

[14]  Walter Thiel,et al.  Enzymatic Hydroxylation in p-Hydroxybenzoate Hydroxylase:  A Case Study for QM/MM Molecular Dynamics. , 2005, Journal of chemical theory and computation.

[15]  Gadi Rothenberg,et al.  Topological Mapping of Bidentate Ligands: A Fast Approach for Screening Homogeneous Catalysts , 2005 .

[16]  C. A. Tolman,et al.  Phosphorus ligand exchange equilibriums on zerovalent nickel. Dominant role for steric effects , 1970 .

[17]  Manuel Urbano-Cuadrado,et al.  New Quantum Mechanics-Based Three-Dimensional Molecular Descriptors for Use in QSSR Approaches: Application to Asymmetric Catalysis , 2007, J. Chem. Inf. Model..

[18]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[19]  Per-Ola Norrby,et al.  Automated molecular mechanics parameterization with simultaneous utilization of experimental and quantum mechanical data , 1998 .

[20]  G. De’ath,et al.  CLASSIFICATION AND REGRESSION TREES: A POWERFUL YET SIMPLE TECHNIQUE FOR ECOLOGICAL DATA ANALYSIS , 2000 .

[21]  Jammalamadaka Introduction to Linear Regression Analysis (3rd ed.) , 2003 .

[22]  A. Orpen,et al.  Development of a ligand knowledge base, part 1: computational descriptors for phosphorus donor ligands. , 2005, Chemistry.

[23]  Hein Putter,et al.  The bootstrap: a tutorial , 2000 .

[24]  A. Rappé,et al.  Rff, Conceptual Development of a Full Periodic Table Force Field for Studying Reaction Potential Surfaces , 1997 .

[25]  Gadi Rothenberg,et al.  Ligand Descriptor Analysis in Nickel‐Catalysed Hydrocyanation: A Combined Experimental and Theoretical Study , 2005 .

[26]  H. Bönnemann Organocobalt Compounds in the Synthesis of Pyridines–An Example of Structure‐Effectivity Relationships in Homogeneous Catalýsis , 1985 .

[27]  Estefania Argente,et al.  Can artificial neural networks help the experimentation in catalysis , 2003 .

[28]  Gadi Rothenberg,et al.  In Silico Design in Homogeneous Catalysis Using Descriptor Modelling , 2006 .

[29]  A. V. van Duin,et al.  Development of the ReaxFF reactive force field for describing transition metal catalyzed reactions, with application to the initial stages of the catalytic formation of carbon nanotubes. , 2005, The journal of physical chemistry. A.

[30]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[31]  K. Artyushkova,et al.  Predictive modeling of electrocatalyst structure based on structure-to-property correlations of x-ray photoelectron spectroscopic and electrochemical measurements. , 2008, Langmuir : the ACS journal of surfaces and colloids.

[32]  I A Basheer,et al.  Artificial neural networks: fundamentals, computing, design, and application. , 2000, Journal of microbiological methods.

[33]  Wray L. Buntine,et al.  Learning classification trees , 1992 .

[34]  J. M. Serra,et al.  Application of artificial neural networks to combinatorial catalysis: modeling and predicting ODHE catalysts. , 2002, Chemphyschem : a European journal of chemical physics and physical chemistry.

[35]  John Skilling,et al.  Data analysis : a Bayesian tutorial , 1996 .

[36]  S. Shaik,et al.  What is the active species of cytochrome P450 during camphor hydroxylation? QM/MM studies of different electronic states of compound I and of reduced and oxidized iron-oxo intermediates. , 2007, Journal of the American Chemical Society.

[37]  K. Morokuma,et al.  Mechanistic studies on the formation of linear polyethylene chain catalyzed by palladium phosphine-sulfonate complexes: experiment and theoretical studies. , 2009, Journal of the American Chemical Society.

[38]  Feliu Maseras,et al.  Computational approaches to asymmetric synthesis , 2007 .

[39]  Alexander Golbraikh,et al.  Rational selection of training and test sets for the development of validated QSAR models , 2003, J. Comput. Aided Mol. Des..

[40]  C. A. Tolman,et al.  Steric effects of phosphorus ligands in organometallic chemistry and homogeneous catalysis , 1977 .

[41]  Paul Ha-Yeon Cheong,et al.  Theory of asymmetric organocatalysis of Aldol and related reactions: rationalizations and predictions. , 2004, Accounts of chemical research.

[42]  F. Arnold Combinatorial and computational challenges for biocatalyst design , 2001, Nature.

[43]  Estefania Argente,et al.  Soft Computing Techniques Applied to Combinatorial Catalysis: A New Approach for the Discovery and Optimization of Catalytic Materials , 2007 .

[44]  Frank Jensen,et al.  Locating minima on seams of intersecting potential energy surfaces. An application to transition structure modeling , 1992 .

[45]  U. Ryde,et al.  A QM/MM investigation of the activation and catalytic mechanism of Fe-only hydrogenases. , 2007, Inorganic chemistry.

[46]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[47]  Walter Thiel,et al.  DL-FIND: an open-source geometry optimizer for atomistic simulations. , 2009, The journal of physical chemistry. A.

[48]  W. Marsden I and J , 2012 .

[49]  Per-Ola Norrby,et al.  Development of a Q2MM Force Field for the Asymmetric Rhodium Catalyzed Hydrogenation of Enamides. , 2008, Journal of chemical theory and computation.

[50]  Robert J Deeth,et al.  Is enantioselectivity predictable in asymmetric catalysis? , 2009, Angewandte Chemie.

[51]  C. A. Tolman,et al.  Formation of three-coordinate nickel(0) complexes by phosphorus ligand dissociation from NiL4 , 1974 .

[52]  S. Goldhor Ecology , 1964, The Yale Journal of Biology and Medicine.

[53]  Christopher R. Corbeil,et al.  Toward a computational tool predicting the stereochemical outcome of asymmetric reactions: development and application of a rapid and accurate program based on organic principles. , 2008, Angewandte Chemie.

[54]  J. Hageman,et al.  Backbone diversity analysis in catalyst design , 2009 .

[55]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[56]  A. W.,et al.  Journal of chemical information and computer sciences. , 1995, Environmental science & technology.

[57]  Yong Zhao,et al.  A priori assessment of the stereoelectronic profile of phosphines and phosphites. , 2003, Journal of the American Chemical Society.

[58]  Sason Shaik,et al.  The elusive oxidant species of cytochrome P450 enzymes: characterization by combined quantum mechanical/molecular mechanical (QM/MM) calculations. , 2002, Journal of the American Chemical Society.

[59]  Igor V. Tetko,et al.  Exhaustive QSPR Studies of a Large Diverse Set of Ionic Liquids: How Accurately Can We Predict Melting Points? , 2007, J. Chem. Inf. Model..

[60]  A. Tropsha,et al.  Beware of q2! , 2002, Journal of molecular graphics & modelling.