Integrating high-throughput characterization into combinatorial heterogeneous catalysis: unsupervised construction of quantitative structure/property relationship models

This paper presents a novel approach in the framework of heterogeneous combinatorial catalysis, which integrates into the global discovery strategy the use of inexpensive high-throughput characterization of libraries of catalysts, as multivariate spectral descriptors for catalytic quantitative structure/property relationship (QSPR) modeling. Moreover, QSPR models can be used to assist the design of new libraries and for extraction of rules and relationships, yielding knowledge about catalysis. This approach can be of special interest when experimental evaluation of catalytic behavior is very expensive or time-consuming, as, for instance, for catalyst deactivation studies, for testing under very severe conditions, or when high amounts of catalyst are demanded. This methodology has been applied to modeling of the behavior of epoxidation catalysts, with the composition vector of the starting synthesis gel and XRD spectra as descriptors. Dimensional reduction was conducted by principal components analysis, clustering, and Kohonen networks, and predictive models were obtained with the use of logistic equations, artificial neural networks, and decision tree techniques. The use of spectral descriptors made it possible to markedly improve the prediction performance obtained with synthesis descriptors alone.

[1]  Claude Mirodatos,et al.  How to Design Diverse Libraries of Solid Catalysts , 2003 .

[2]  Anders Berglund,et al.  PCA and PLS with very large data sets , 2005, Comput. Stat. Data Anal..

[3]  F. Schüth,et al.  Spatially resolving infrared spectroscopy for parallelized characterization of acid sites of catalysts via pyridine sorption: Possibilities and limitations , 2004 .

[4]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[5]  J. Gasteiger,et al.  Knowledge Discovery in Reaction Databases: Landscaping Organic Reactions by a Self-Organizing Neural Network , 1997 .

[6]  Jürgen Bajorath,et al.  Chemoinformatics : concepts, methods, and tools for drug discovery , 2004 .

[7]  Gisbert Schneider,et al.  Status of HTS Data Mining Approaches , 2004 .

[8]  Claude Mirodatos,et al.  The development of descriptors for solids: teaching "catalytic intuition" to a computer. , 2004, Angewandte Chemie.

[9]  Alessio Micheli,et al.  Recursive self-organizing network models , 2004, Neural Networks.

[10]  Ulrich Simon,et al.  A Flexible Database for Combinatorial and High‐Throughput Materials Science , 2005 .

[11]  Estefania Argente,et al.  Can artificial neural networks help the experimentation in catalysis , 2003 .

[12]  Claude Mirodatos,et al.  Design of Discovery Libraries for Solids Based on QSAR Models , 2005 .

[13]  Cristina Vázquez,et al.  Total reflection X-ray fluorescence polymer spectra: classification by taxonomy statistic tools. , 2002, Talanta.

[14]  Claude Mirodatos,et al.  Using Artificial Neural Networks to Boost High‐throughput Discovery in Heterogeneous Catalysis , 2004 .

[15]  Hans-Werner Schmidt,et al.  Combinatorial Material Libraries on the Microgram Scale with an Example of Hydrothermal Synthesis. , 1998, Angewandte Chemie.

[16]  David J. Livingstone,et al.  Neural Networks in 3D QSAR , 2003 .

[17]  Dan Nettleton,et al.  Principal Components Regression With Data Chosen Components and Related Methods , 2003, Technometrics.

[18]  Mar Michael Meier,et al.  Combinatorial Methods, Automated Synthesis and High-Throughput Screening in Polymer Research: Past and Present , 2003 .

[19]  Alberto Bugarín,et al.  Current Topics in Artificial Intelligence, 11th Conference of the Spanish Association for Artificial Intelligence, CAEPIA 2005, Santiago de Compostela, Spain, November 16-18, 2005, Revised Selected Papers , 2006, CAEPIA.

[20]  James Norman Cawse,et al.  Experimental Design for Combinatorial and High Throughput Materials Development , 2002 .

[21]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[22]  Yves Schuurman,et al.  Acceleration in catalyst development by fast transient kinetic investigation , 2003 .

[23]  W. H. Weinberg,et al.  High throughput experimental and theoretical predictive screening of materials : A comparative study of search strategies for new fuel cell anode catalysts , 2003 .

[24]  Suljo Linic,et al.  Selectivity driven design of bimetallic ethylene epoxidation catalysts from first principles , 2004 .

[25]  Josef Honerkamp,et al.  High-Output Polymer Screening: Exploiting Combinatorial Chemistry and Data Mining Tools in Catalyst and Polymer Development† , 2003 .

[26]  Zhide Hu,et al.  Prediction of electrophoretic mobility of substituted aromatic acids in different aqueous–alcoholic solvents by capillary zone electrophoresis based on support vector machine , 2004 .

[27]  M. Karelson Molecular descriptors in QSAR/QSPR , 2000 .

[28]  Estefania Argente,et al.  Optimisation of olefin epoxidation catalysts with the application of high-throughput and genetic algorithms assisted by artificial neural networks (softcomputing techniques) , 2005 .

[29]  J. M. Serra,et al.  High-throughput characterisation of materials by photoluminescence spectroscopy. , 2004, Chemistry.