Data flow modeling, data mining and QSAR in high-throughput discovery of functional nanomaterials

Abstract Metal oxide nanoparticles are promising materials in applications for fuel cells, gas sensors and fine chemical catalysis. Their functionality depends excessively on composition, structure as well as synthesis and processing conditions. Continuous hydrothermal flow synthesis (CHFS) reactors are an effective technology to make nanoceramics. In order to increase sample throughput of CHFS, a manual high-throughput continuous hydrothermal (HiTCH) flow synthesis process capable of formulating scores of samples per day was developed. More recently, a fully automated nanoceramics synthesis platform called RAMSI (rapid automated synthesis instrument) based on the HiTCH synthesis technology was developed. When large numbers of nanoceramics are made and formulated into appropriate libraries, automated analytical instruments can be used to allow collection of a large amount of useful data. This paper describes the information flow management system of RAMSI (as well as CHFS) and the data mining system for supporting discovery, QSAR (quantitative structure–activity relationship) modeling and DoE (design of experiments). Case studies demonstrating the use of the high-throughput data mining system are presented. These include clustering of Raman spectra, interpretation of X-ray diffraction (XRD) measurements, and QSAR model building linking XRD data and photocatalytic properties. A genetic algorithm method for DoE is also presented that can guide the experiments to search optimal XRD patterns.

[1]  Jr. Allen B. Tucker,et al.  The Computer Science and Engineering Handbook , 1997 .

[2]  Xue Z. Wang,et al.  Knowledge discovery from process operational data using PCA and fuzzy clustering , 2001 .

[3]  C. McGreavy,et al.  Automatic Classification for Mining Process Operational Data , 1998 .

[4]  Martin Vickers,et al.  High-throughput continuous hydrothermal synthesis of an entire nanoceramic phase diagram. , 2009, Journal of combinatorial chemistry.

[5]  Ataullah Khan,et al.  Surface stabilized nanosized Ce(x)Zr(1-x)O(2) solid solutions over SiO(2): characterization by XRD, Raman, and HREM techniques. , 2005, The journal of physical chemistry. B.

[6]  Benjaram M. Reddy,et al.  Nanosized CeO2–SiO2, CeO2–TiO2, and CeO2–ZrO2 Mixed Oxides: Influence of Supporting Oxide on Thermal Stability and Oxygen Storage Properties of Ceria , 2005 .

[7]  Jennifer S. Holmgren,et al.  Strategies and applications of combinatorial methods and high throughput screening to the discovery of non-noble metal catalyst , 2004 .

[8]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[9]  Isaac Abrahams,et al.  Direct syntheses of Lan+1NinO3n+1 phases (n=1, 2, 3 and ∞) from nanosized co-crystallites , 2008 .

[10]  Kathryn Thompson,et al.  Rapid automated materials synthesis instrument: exploring the composition and heat-treatment of nanoprecursors toward low temperature red phosphors. , 2010, Journal of combinatorial chemistry.

[11]  C. Proust,et al.  Vibrational Spectroscopy of the C-Form of Yttrium Sesquioxide , 1995 .

[12]  C. McGreavy,et al.  Data Mining and Knowledge Discovery for Process Monitoring and Control , 1999 .

[13]  Taeghwan Hyeon,et al.  Chemical synthesis of magnetic nanoparticles. , 2003, Chemical communications.

[14]  Jonathan C. Knowles,et al.  Photocatalytic activities of N-doped nano-titanias and titanium nitride , 2009 .

[15]  Godwin A Ayoko,et al.  Raman spectroscopy of uranyl rare earth carbonate kamotoite-(Y). , 2006, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[16]  Anita Young,et al.  Prediction of Noninteractive Mixture Toxicity of Organic Compounds Based on a Fuzzy Set Method , 2004, J. Chem. Inf. Model..

[17]  Xue Z. Wang,et al.  An integrated data mining system and its application to process operational data analysis , 1999 .

[18]  Hilario Vidal,et al.  Redox behavior of CeO2–ZrO2 mixed oxides: I. Influence of redox treatments on high surface area catalysts , 2000 .

[19]  Robert H. Byrne,et al.  Carbonate complexation of yttrium and the rare earth elements in natural waters , 2004 .

[20]  W. Maier,et al.  Combinatorial and high-throughput materials science. , 2007, Angewandte Chemie.

[21]  Changsheng Xie,et al.  Preparation and gas-sensing properties of Ce-doped ZnO thin-film sensors by dip-coating , 2007 .

[22]  M. Grätzel,et al.  A low-cost, high-efficiency solar cell based on dye-sensitized colloidal TiO2 films , 1991, Nature.

[23]  J. Darr,et al.  Continuous hydrothermal synthesis of extensive 2D sodium titanate (Na2Ti3O7) nano-sheets. , 2010, Dalton transactions.

[24]  J. Kašpar,et al.  Nanostructured materials for advanced automotive de-pollution catalysts , 2003 .

[25]  S. Wongkasemjit,et al.  Synthesis of ceria-zirconia mixed oxide from cerium and zirconium glycolates via sol-gel process and its reduction property , 2006 .

[26]  Anita Young,et al.  Genetic Programming for the Induction of Decision Trees to Model Ecotoxicity Data , 2005, J. Chem. Inf. Model..

[27]  Jonathan C. Knowles,et al.  Synthesis and characterisation of magnesium substituted calcium phosphate bioceramic nanoparticles made via continuous hydrothermal flow synthesis , 2008 .

[28]  Paul Boldrin,et al.  Instant nano-hydroxyapatite: a continuous and rapid hydrothermal synthesis. , 2006, Chemical communications.

[29]  Xue Z. Wang,et al.  Robust QSAR model development in high-throughput catalyst discovery based on genetic parameter optimisation , 2009 .

[30]  X. Z. Wang KNOWLEDGE DISCOVERY THROUGH MINING PROCESS OPERATIONAL DATA , 2001 .

[31]  Isaac Abrahams,et al.  Direct syntheses of mixed ion and electronic conductors La4Ni3O10 and La3Ni2O7 from nanosized coprecipitates , 2007 .

[32]  Jawwad A. Darr,et al.  Direct Synthesis of Nanosized NiCo2O4 Spinel and Related Compounds via Continuous Hydrothermal Synthesis Methods , 2007 .

[33]  Ihtesham ur Rehman,et al.  Screening tests for the evaluation of nanoparticle titania photocatalysts , 2009 .

[34]  Márcio Nele,et al.  Composition effects on the activity of Cu–ZnO–Al2O3 based catalysts for the water gas shift reaction: A statistical approach , 1998 .

[35]  Anders Hagfeldt,et al.  Light-Induced Redox Reactions in Nanocrystalline Systems , 1995 .

[36]  J. Ross Quinlan,et al.  Decision Trees and Instance-Based Classifiers , 1997, The Computer Science and Engineering Handbook.

[37]  Xue Z. Wang,et al.  MULTILEVEL PCA AND INDUCTIVE LEARNING FOR KNOWLEDGE EXTRACTION FROM OPERATIONAL DATA OF BATCH PROCESSES , 2001 .

[38]  Shoufeng Yang,et al.  Synthesis and characterization of doped nano-sized ceria–zirconia solid solutions , 2009 .

[39]  G Bognolo,et al.  The use of surface-active agents in the preparation and assembly of quantum-sized nanoparticles. , 2003, Advances in colloid and interface science.

[40]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[41]  X Z Wang,et al.  Induction of decision trees using genetic programming for modelling ecotoxicity data: adaptive discretization of real-valued endpoints , 2006, SAR and QSAR in environmental research.

[42]  Xue Zhong Wang,et al.  Inductive data mining based on genetic programming: Automatic generation of decision trees from data for process historical data analysis , 2009, Comput. Chem. Eng..

[43]  Kathryn Thompson,et al.  Direct continuous hydrothermal synthesis of high surface area nanosized titania , 2009 .

[44]  Nigel P. Brandon,et al.  Recent Advances in Materials for Fuel Cells , 2003 .

[45]  Edward Lester,et al.  Continuous hydrothermal synthesis of inorganicmaterials in a near-critical water flow reactor; the one-step synthesisof nano-particulate Ce1 − xZrxO2(x = 0–1)solid solutions , 2001 .

[46]  Vittorio Longo,et al.  Phase equilibrium diagram of the system ceria-yttria for temperatures between 900 and 1700° C , 1981 .