Autonomous efficient experiment design for materials discovery with Bayesian model averaging

The accelerated exploration of the materials space in order to identify configurations with optimal properties is an ongoing challenge. Current paradigms are typically centered around the idea of performing this exploration through high-throughput experimentation/computation. Such approaches, however, do not account fo the always present constraints in resources available. Recently, this problem has been addressed by framing materials discovery as an optimal experiment design. This work augments earlier efforts by putting forward a framework that efficiently explores the materials design space not only accounting for resource constraints but also incorporating the notion of model uncertainty. The resulting approach combines Bayesian Model Averaging within Bayesian Optimization in order to realize a system capable of autonomously and adaptively learning not only the most promising regions in the materials space but also the models that most efficiently guide such exploration. The framework is demonstrated by efficiently exploring the MAX ternary carbide/nitride space through Density Functional Theory (DFT) calculations.

[1]  R. Arróyave,et al.  Does aluminum play well with others? Intrinsic Al-A alloying behavior in 211/312 MAX phases , 2017 .

[2]  Y. Makino,et al.  Estimation of bulk moduli of compounds by empirical relations between bulk modulus and interatomic distance , 2000 .

[3]  R. Arróyave,et al.  Ab-initio aprroach to the electronic, structural, elastic, and finite-temperature thermodynamic properties of Ti2AX (A = Al or Ga and X = C or N) , 2011 .

[4]  M. Barsoum,et al.  Mechanical Properties of the MAX Phases , 2004 .

[5]  Charles H. Ward Materials Genome Initiative for Global Competitiveness , 2012 .

[6]  Ichiro Takeuchi,et al.  Fulfilling the promise of the materials genome initiative with high-throughput experimental methodologies , 2017 .

[7]  Nando de Freitas,et al.  Bayesian Optimization in a Billion Dimensions via Random Embeddings , 2013, J. Artif. Intell. Res..

[8]  Warren B. Powell,et al.  A Knowledge-Gradient Policy for Sequential Information Collection , 2008, SIAM J. Control. Optim..

[9]  Burke,et al.  Generalized Gradient Approximation Made Simple. , 1996, Physical review letters.

[10]  Edward R. Dougherty,et al.  Constructing Pathway-Based Priors within a Gaussian Mixture Model for Bayesian Regression and Classification , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[11]  Edward R. Dougherty,et al.  Optimal Bayesian Transfer Learning , 2018, IEEE Transactions on Signal Processing.

[12]  Krishna Rajan,et al.  Combinatorial and high-throughput screening of materials libraries: review of state of the art. , 2011, ACS combinatorial science.

[13]  Warren B. Powell,et al.  The Knowledge-Gradient Policy for Correlated Normal Beliefs , 2009, INFORMS J. Comput..

[14]  Edward R. Dougherty,et al.  What should be expected from feature selection in small-sample settings , 2006, Bioinform..

[15]  W. Kohn,et al.  Self-Consistent Equations Including Exchange and Correlation Effects , 1965 .

[16]  Wang,et al.  Accurate and simple analytic representation of the electron-gas correlation energy. , 1992, Physical review. B, Condensed matter.

[17]  Adrian E. Raftery,et al.  Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors , 1999 .

[18]  Geoffrey T. Parks,et al.  Multi-criteria material selection in engineering design , 2004 .

[19]  Krishna Rajan,et al.  Informatics derived materials databases for multifunctional properties , 2015, Science and technology of advanced materials.

[20]  Christopher K. I. Williams,et al.  Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) , 2005 .

[21]  Wasserman,et al.  Bayesian Model Selection and Model Averaging. , 2000, Journal of mathematical psychology.

[22]  Adrian E. Raftery,et al.  Bayesian Model Averaging: A Tutorial , 2016 .

[23]  J. Rumble CRC Handbook of Chemistry and Physics , 2019 .

[24]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[25]  Turab Lookman,et al.  Multi-objective Optimization for Materials Discovery via Adaptive Design , 2018, Scientific Reports.

[26]  Ichiro Takeuchi,et al.  Bayesian-Driven First-Principles Calculations for Accelerating Exploration of Fast Ion Conductors for Rechargeable Battery Application , 2018, Scientific Reports.

[27]  Edward R. Dougherty,et al.  Optimal Experimental Design for Gene Regulatory Networks in the Presence of Uncertainty , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[28]  Xiaoning Qian,et al.  Bayesian module identification from multiple noisy networks , 2016, EURASIP J. Bioinform. Syst. Biol..

[29]  Surya R. Kalidindi,et al.  Materials Data Science: Current Status and Future Outlook , 2015 .

[30]  Michael T. M. Emmerich,et al.  Hypervolume-based expected improvement: Monotonicity properties and exact computation , 2011, 2011 IEEE Congress of Evolutionary Computation (CEC).

[31]  Edward R. Dougherty,et al.  Incorporating biological prior knowledge for Bayesian learning via maximal knowledge-driven information priors , 2017, BMC Bioinformatics.

[32]  Large difference in the elastic properties of fcc and hcp hard-sphere crystals. , 2003, Physical review letters.

[33]  Atsuto Seko,et al.  Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single- and binary-component solids , 2013, 1310.1546.

[34]  James Theiler,et al.  Adaptive Strategies for Materials Design using Uncertainties , 2016, Scientific Reports.

[35]  James Theiler,et al.  Accelerated search for materials with targeted properties by adaptive design , 2016, Nature Communications.

[36]  Edward R. Dougherty,et al.  Optimal experimental design for materials discovery , 2017 .

[37]  Edward R. Dougherty,et al.  Intrinsically Bayesian robust classifier for single-cell gene expression trajectories in gene regulatory networks , 2018, BMC Systems Biology.

[38]  Michel W. Barsoum,et al.  The MN+1AXN phases: A new class of solids , 2000 .

[39]  Zhengming Sun,et al.  Progress in research and development on MAX phases: a family of layered ternary compounds , 2011 .

[40]  Adam D. Bull,et al.  Convergence Rates of Efficient Global Optimization Algorithms , 2011, J. Mach. Learn. Res..

[41]  Kresse,et al.  Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. , 1996, Physical review. B, Condensed matter.

[42]  Paxton,et al.  High-precision sampling for Brillouin-zone integration in metals. , 1989, Physical review. B, Condensed matter.

[43]  D. Basak,et al.  Support Vector Regression , 2008 .

[44]  Edward R. Dougherty,et al.  Is cross-validation valid for small-sample microarray classification? , 2004, Bioinform..

[45]  Ridwan Sakidja,et al.  A genomic approach to the stability, elastic, and electronic properties of the MAX phases , 2014 .

[46]  G. Kresse,et al.  Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set , 1996 .

[47]  Junichiro Shiomi,et al.  Designing Nanostructures for Phonon Transport via Bayesian Optimization , 2016, 1609.04972.

[48]  Vijay Kumar,et al.  The Role of Valence Electron Concentration in Tuning the Structure, Stability, and Electronic Properties of Mo6S9–xIx Nanowires , 2015 .

[49]  Edward R. Dougherty,et al.  Quantifying the Objective Cost of Uncertainty in Complex Dynamical Systems , 2013, IEEE Transactions on Signal Processing.

[50]  L. Györfi,et al.  A Distribution-Free Theory of Nonparametric Regression (Springer Series in Statistics) , 2002 .

[51]  C. Liu,et al.  Effect of valence electron concentration on stability of fcc or bcc phase in high entropy alloys , 2011 .

[52]  D. Goldberg,et al.  BOA: the Bayesian optimization algorithm , 1999 .

[53]  Edward R. Dougherty,et al.  Performance of feature-selection methods in the classification of high-dimension data , 2009, Pattern Recognit..

[54]  Blöchl,et al.  Improved tetrahedron method for Brillouin-zone integrations. , 1994, Physical review. B, Condensed matter.

[55]  R. Hill The Elastic Behaviour of a Crystalline Aggregate , 1952 .

[56]  Marco Buongiorno Nardelli,et al.  The high-throughput highway to computational materials design. , 2013, Nature materials.

[57]  Hirokazu Sato,et al.  Determination of electrons per atom ratio for transition metal compounds studied by FLAPW-Fourier calculations , 2016 .

[58]  A. Choudhary,et al.  Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science , 2016 .

[59]  Rahul Rao,et al.  Autonomy in materials research: a case study in carbon nanotube growth , 2016 .

[60]  S. Suram,et al.  Generating information-rich high-throughput experimental materials genomes using functional clustering via multitree genetic programming and information theory. , 2015, ACS combinatorial science.

[61]  D. Chadi,et al.  Special points for Brillouin-zone integrations , 1977 .

[62]  M. Barsoum,et al.  MAX phases : Bridging the gap between metals and ceramics MAX phases : Bridging the gap between metals and ceramics , 2013 .

[63]  Paul Saxe,et al.  Symmetry-general least-squares extraction of elastic data for strained materials from ab initio calculations of stress , 2002 .

[64]  Tom Schaul,et al.  Building Machines that Learn and Think for Themselves: Commentary on Lake et al., Behavioral and Brain Sciences, 2017 , 2017, 1711.08378.

[65]  J. Hogden,et al.  Statistical inference and adaptive design for materials discovery , 2017 .

[66]  David Fletcher,et al.  Bayesian Model Averaging , 2018 .

[67]  Edward R. Dougherty,et al.  Experimental Design via Generalized Mean Objective Cost of Uncertainty , 2018, IEEE Access.

[68]  Donald R. Jones,et al.  Efficient Global Optimization of Expensive Black-Box Functions , 1998, J. Glob. Optim..

[69]  R. Arróyave,et al.  Effect of ternary additions to structural properties of NiTi alloys , 2016 .

[70]  Siamak Zamani Dadaneh,et al.  BNP-Seq: Bayesian Nonparametric Differential Expression Analysis of Sequencing Count Data , 2016, 1608.03991.

[71]  Edward R. Dougherty,et al.  Intrinsically Bayesian robust Karhunen-Loève compression , 2018, Signal Process..

[72]  D. Tromans ELASTIC ANISOTROPY OF HCP METAL CRYSTALS AND POLYCRYSTALS , 2011 .

[73]  Nando de Freitas,et al.  Taking the Human Out of the Loop: A Review of Bayesian Optimization , 2016, Proceedings of the IEEE.

[74]  Kirthevasan Kandasamy,et al.  High Dimensional Bayesian Optimisation and Bandits via Additive Models , 2015, ICML.

[75]  K. Fujimura,et al.  Accelerated Materials Design of Lithium Superionic Conductors Based on First‐Principles Calculations and Machine Learning Algorithms , 2013 .

[76]  Michel W. Barsoum,et al.  Elastic and Mechanical Properties of the MAX Phases , 2011 .

[77]  R. Arróyave,et al.  High-throughput combinatorial study of the effect of M site alloying on the solid solution behavior of M 2 AlC MAX phases , 2016 .