Bayesian Optimization for Materials Design with Mixed Quantitative and Qualitative Variables

Although Bayesian Optimization (BO) has been employed for accelerating materials design in computational materials engineering, existing works are restricted to problems with quantitative variables. However, real designs of materials systems involve both qualitative and quantitative design variables representing material compositions, microstructure morphology, and processing conditions. For mixed-variable problems, existing Bayesian Optimization (BO) approaches represent qualitative factors by dummy variables first and then fit a standard Gaussian process (GP) model with numerical variables as the surrogate model. This approach is restrictive theoretically and fails to capture complex correlations between qualitative levels. We present in this paper the integration of a novel latent-variable (LV) approach for mixed-variable GP modeling with the BO framework for materials design. LVGP is a fundamentally different approach that maps qualitative design variables to underlying numerical LV in GP, which has strong physical justification. It provides flexible parameterization and representation of qualitative factors and shows superior modeling accuracy compared to the existing methods. We demonstrate our approach through testing with numerical examples and materials design examples. The chosen materials design examples represent two different scenarios, one on concurrent materials selection and microstructure optimization for optimizing the light absorption of a quasi-random solar cell, and another on combinatorial search of material constitutes for optimal Hybrid Organic-Inorganic Perovskite (HOIP) design. It is found that in all test examples the mapped LVs provide intuitive visualization and substantial insight into the nature and effects of the qualitative factors. Though materials designs are used as examples, the method presented is generic and can be utilized for other mixed variable design optimization problems that involve expensive physics-based simulations.

[1]  Matthias Poloczek,et al.  Efficient search of compositional space for hybrid organic–inorganic perovskites via Bayesian optimization , 2018, npj Computational Materials.

[2]  Deborah F. Swayne,et al.  Data Visualization With Multidimensional Scaling , 2008 .

[3]  Ker-Chau Li,et al.  Sliced Inverse Regression for Dimension Reduction , 1991 .

[4]  Eric Walter,et al.  Global optimization based on noisy evaluations: An empirical study of two statistical approaches , 2008 .

[5]  Sonja Kuhnt,et al.  Design and analysis of computer experiments , 2010 .

[6]  Salvatore Torquato,et al.  STATISTICAL DESCRIPTION OF MICROSTRUCTURES , 2002 .

[7]  D. Griffin,et al.  Finite-Element Analysis , 1975 .

[8]  Hongyi Xu A Machine Learning-Based Design Representation Method for Designing Heterogeneous Microstructures , 2014, DAC 2014.

[9]  Thomas Hochkirchen Design and Modeling for Computer Experiments by K.-T. Fang, R. Li and A. Sudjianto , 2006 .

[10]  Donald R. Jones,et al.  Efficient Global Optimization of Expensive Black-Box Functions , 1998, J. Glob. Optim..

[11]  Weihong Zhang,et al.  Shape, sizing optimization and material selection based on mixed variables and genetic algorithm , 2011 .

[12]  R. Cook,et al.  Sufficient Dimension Reduction via Inverse Regression , 2005 .

[13]  Peter Z. G. Qian,et al.  Gaussian Process Models for Computer Experiments With Qualitative and Quantitative Factors , 2008, Technometrics.

[14]  Wei Chen,et al.  Perspective: NanoMine: A material genome approach for polymer nanocomposites analysis and design , 2016 .

[15]  R. K. Rowe,et al.  Additive Gaussian Process for Computer Models With Qualitative and Quantitative Factors , 2017, Technometrics.

[16]  Wei Chen,et al.  Descriptor-based methodology for statistical characterization and 3D reconstruction of microstructural materials , 2014 .

[17]  Wei Chen,et al.  Microstructure reconstruction and structural equation modeling for computational design of nanodielectrics , 2015, Integrating Materials and Manufacturing Innovation.

[18]  James Theiler,et al.  Adaptive Strategies for Materials Design using Uncertainties , 2016, Scientific Reports.

[19]  Muratahan Aykol,et al.  Materials Design and Discovery with High-Throughput Density Functional Theory: The Open Quantum Materials Database (OQMD) , 2013 .

[20]  Runze Li,et al.  Design and Modeling for Computer Experiments , 2005 .

[21]  Nando de Freitas,et al.  Taking the Human Out of the Loop: A Review of Bayesian Optimization , 2016, Proceedings of the IEEE.

[22]  Cheng Li,et al.  Rapid Bayesian optimisation for synthesis of short polymer fiber materials , 2017, Scientific Reports.

[23]  D. Ginsbourger,et al.  A benchmark of kriging-based infill criteria for noisy optimization , 2013, Structural and Multidisciplinary Optimization.

[24]  J. Dennis,et al.  Mixed Variable Optimization of the Number and Composition of Heat Intercepts in a Thermal Insulation System , 2001 .

[25]  Wei Chen,et al.  Toward the development of a quantitative tool for predicting dispersion of nanocomposites under non-equilibrium processing conditions , 2016, Journal of Materials Science.

[26]  Thomas K. Gaylord,et al.  Stable implementation of the rigorous coupled-wave analysis for surface-relief gratings: enhanced transmittance matrix approach , 1995 .

[27]  Yulei Zhang,et al.  Computer Experiments with Qualitative and Quantitative Variables: A Review and Reexamination , 2015 .

[28]  J. Sacks,et al.  Analysis of protein activity data by Gaussian stochastic process models. , 1999, Journal of biopharmaceutical statistics.

[29]  M. Abramson Mixed Variable Optimization of a Load-Bearing Thermal Insulation System Using a Filter Pattern Search Algorithm , 2004 .

[30]  Daniel W. Apley,et al.  Lifted Brownian Kriging Models , 2017, Technometrics.

[31]  Chen Wang,et al.  Design of Non-Deterministic Quasi-random Nanophotonic Structures Using Fourier Space Representations , 2017, Scientific Reports.

[32]  Lifeng Li,et al.  New formulation of the Fourier modal method for crossed surface-relief gratings , 1997 .

[33]  Wei Chen,et al.  A Latent Variable Approach to Gaussian Process Modeling with Qualitative and Quantitative Factors , 2018, Technometrics.

[34]  G. Pazour,et al.  Ror2 signaling regulates Golgi structure and transport through IFT20 for tumor invasiveness , 2017, Scientific Reports.

[35]  Takashi Miyake,et al.  Crystal structure prediction accelerated by Bayesian optimization , 2018 .

[36]  Teri W. Odom,et al.  Characterization and Design of Functional Quasi-Random Nanostructured Materials Using Spectral Density Function , 2016, DAC 2016.

[37]  M. Orio,et al.  Density functional theory , 2009, Photosynthesis Research.

[38]  Yang Zhang,et al.  The classical correlation limits the ability of the measurement-induced average coherence , 2017, Scientific Reports.

[39]  Jeong‐Soo Park Optimal Latin-hypercube designs for computer experiments , 1994 .

[40]  Deborah L. McGuinness,et al.  NanoMine schema: An extensible data representation for polymer nanocomposites , 2018, APL Materials.