Towards an Autonomous Efficient Materials Discovery Framework: An Example of Optimal Experiment Design Under Model Uncertainty

The accelerated exploration of the materials space in order to identify configurations with optimal properties is an ongoing challenge. Current paradigms are typically centered around the idea of performing this exploration through high-throughput experimentation/computation. Such approaches, however, do not account fo the always present constraints in resources available. Recently, this problem has been addressed by framing materials discovery as an optimal experiment design. This work augments earlier efforts by putting forward a framework that efficiently explores the materials design space not only accounting for resource constraints but also incorporating the notion of model uncertainty. The resulting approach combines Bayesian Model Averaging within Bayesian Optimization in order to realize a system capable of autonomously and adaptively learning not only the most promising regions in the materials space but also the models that most efficiently guide such exploration. The framework is demonstrated by efficiently exploring the MAX ternary carbide/nitride space through Density Functional Theory (DFT) calculations.

[1]  R. Arróyave,et al.  Does aluminum play well with others? Intrinsic Al-A alloying behavior in 211/312 MAX phases , 2017 .

[2]  Edward R. Dougherty,et al.  Optimal Bayesian Transfer Learning , 2018, IEEE Transactions on Signal Processing.

[3]  C. Liu,et al.  Effect of valence electron concentration on stability of fcc or bcc phase in high entropy alloys , 2011 .

[4]  Edward R. Dougherty,et al.  Constructing Pathway-Based Priors within a Gaussian Mixture Model for Bayesian Regression and Classification , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[5]  Geoffrey T. Parks,et al.  Multi-criteria material selection in engineering design , 2004 .

[6]  Edward R. Dougherty,et al.  Optimal Experimental Design for Gene Regulatory Networks in the Presence of Uncertainty , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[7]  D. Tromans ELASTIC ANISOTROPY OF HCP METAL CRYSTALS AND POLYCRYSTALS , 2011 .

[8]  R. Arróyave,et al.  Effect of ternary additions to structural properties of NiTi alloys , 2016 .

[9]  R. Hill The Elastic Behaviour of a Crystalline Aggregate , 1952 .

[10]  Blöchl,et al.  Improved tetrahedron method for Brillouin-zone integrations. , 1994, Physical review. B, Condensed matter.

[11]  R DoughertyEdward,et al.  Optimal experimental design for gene regulatory networks in the presence of uncertainty , 2015 .

[12]  Xiaoning Qian,et al.  Bayesian module identification from multiple noisy networks , 2016, EURASIP J. Bioinform. Syst. Biol..

[13]  Adrian E. Raftery,et al.  Bayesian Model Averaging: A Tutorial , 2016 .

[14]  Large difference in the elastic properties of fcc and hcp hard-sphere crystals. , 2003, Physical review letters.

[15]  Atsuto Seko,et al.  Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single- and binary-component solids , 2013, 1310.1546.

[16]  Paxton,et al.  High-precision sampling for Brillouin-zone integration in metals. , 1989, Physical review. B, Condensed matter.

[17]  Siamak Zamani Dadaneh,et al.  BNP-Seq: Bayesian Nonparametric Differential Expression Analysis of Sequencing Count Data , 2016, 1608.03991.

[18]  Adam D. Bull,et al.  Convergence Rates of Efficient Global Optimization Algorithms , 2011, J. Mach. Learn. Res..

[19]  Ridwan Sakidja,et al.  A genomic approach to the stability, elastic, and electronic properties of the MAX phases , 2014 .

[20]  Wang,et al.  Accurate and simple analytic representation of the electron-gas correlation energy. , 1992, Physical review. B, Condensed matter.

[21]  Edward R. Dougherty,et al.  What should be expected from feature selection in small-sample settings , 2006, Bioinform..

[22]  Krishna Rajan,et al.  Informatics derived materials databases for multifunctional properties , 2015, Science and technology of advanced materials.

[23]  Zhengming Sun,et al.  Progress in research and development on MAX phases: a family of layered ternary compounds , 2011 .

[24]  Warren B. Powell,et al.  A Knowledge-Gradient Policy for Sequential Information Collection , 2008, SIAM J. Control. Optim..

[25]  R. Arróyave,et al.  High-throughput combinatorial study of the effect of M site alloying on the solid solution behavior of M 2 AlC MAX phases , 2016 .

[26]  Wasserman,et al.  Bayesian Model Selection and Model Averaging. , 2000, Journal of mathematical psychology.

[27]  Burke,et al.  Generalized Gradient Approximation Made Simple. , 1996, Physical review letters.

[28]  Christopher K. I. Williams,et al.  Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) , 2005 .

[29]  G. Kresse,et al.  Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set , 1996 .

[30]  David Fletcher,et al.  Bayesian Model Averaging , 2018 .

[31]  W. Kohn,et al.  Self-Consistent Equations Including Exchange and Correlation Effects , 1965 .

[32]  Michel W. Barsoum,et al.  Elastic and Mechanical Properties of the MAX Phases , 2011 .

[33]  M. Barsoum,et al.  MAX phases : Bridging the gap between metals and ceramics MAX phases : Bridging the gap between metals and ceramics , 2013 .

[34]  Ichiro Takeuchi,et al.  Bayesian-Driven First-Principles Calculations for Accelerating Exploration of Fast Ion Conductors for Rechargeable Battery Application , 2018, Scientific Reports.

[35]  Warren B. Powell,et al.  The Knowledge-Gradient Policy for Correlated Normal Beliefs , 2009, INFORMS J. Comput..

[36]  Edward R. Dougherty,et al.  Is cross-validation valid for small-sample microarray classification? , 2004, Bioinform..

[37]  D. Chadi,et al.  Special points for Brillouin-zone integrations , 1977 .

[38]  Paul Saxe,et al.  Symmetry-general least-squares extraction of elastic data for strained materials from ab initio calculations of stress , 2002 .

[39]  Nando de Freitas,et al.  Bayesian Optimization in a Billion Dimensions via Random Embeddings , 2013, J. Artif. Intell. Res..

[40]  Hirokazu Sato,et al.  Determination of electrons per atom ratio for transition metal compounds studied by FLAPW-Fourier calculations , 2016 .

[41]  Michael T. M. Emmerich,et al.  Hypervolume-based expected improvement: Monotonicity properties and exact computation , 2011, 2011 IEEE Congress of Evolutionary Computation (CEC).

[42]  Edward R. Dougherty,et al.  Incorporating biological prior knowledge for Bayesian learning via maximal knowledge-driven information priors , 2017, BMC Bioinformatics.

[43]  Surya R. Kalidindi,et al.  Materials Data Science: Current Status and Future Outlook , 2015 .

[44]  Edward R. Dougherty,et al.  Quantifying the Objective Cost of Uncertainty in Complex Dynamical Systems , 2013, IEEE Transactions on Signal Processing.

[45]  Edward R. Dougherty,et al.  Experimental Design via Generalized Mean Objective Cost of Uncertainty , 2018, IEEE Access.

[46]  Junichiro Shiomi,et al.  Designing Nanostructures for Phonon Transport via Bayesian Optimization , 2016, 1609.04972.

[47]  Nando de Freitas,et al.  Taking the Human Out of the Loop: A Review of Bayesian Optimization , 2016, Proceedings of the IEEE.

[48]  Edward R. Dougherty,et al.  Intrinsically Bayesian robust classifier for single-cell gene expression trajectories in gene regulatory networks , 2018, BMC Systems Biology.

[49]  M. Barsoum,et al.  Mechanical Properties of the MAX Phases , 2004 .

[50]  D. Goldberg,et al.  BOA: the Bayesian optimization algorithm , 1999 .

[51]  Ichiro Takeuchi,et al.  Fulfilling the promise of the materials genome initiative with high-throughput experimental methodologies , 2017 .

[52]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[53]  Y. Makino,et al.  Estimation of bulk moduli of compounds by empirical relations between bulk modulus and interatomic distance , 2000 .

[54]  Kirthevasan Kandasamy,et al.  High Dimensional Bayesian Optimisation and Bandits via Additive Models , 2015, ICML.

[55]  Marco Buongiorno Nardelli,et al.  The high-throughput highway to computational materials design. , 2013, Nature materials.

[56]  Edward R. Dougherty,et al.  Intrinsically Bayesian robust Karhunen-Loève compression , 2018, Signal Process..

[57]  J. Rumble CRC Handbook of Chemistry and Physics , 2019 .

[58]  A. Choudhary,et al.  Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science , 2016 .

[59]  Krishna Rajan,et al.  Combinatorial and high-throughput screening of materials libraries: review of state of the art. , 2011, ACS combinatorial science.

[60]  James Theiler,et al.  Adaptive Strategies for Materials Design using Uncertainties , 2016, Scientific Reports.

[61]  James Theiler,et al.  Accelerated search for materials with targeted properties by adaptive design , 2016, Nature Communications.

[62]  Charles H. Ward Materials Genome Initiative for Global Competitiveness , 2012 .

[63]  Edward R. Dougherty,et al.  Optimal experimental design for materials discovery , 2017 .

[64]  Vijay Kumar,et al.  The Role of Valence Electron Concentration in Tuning the Structure, Stability, and Electronic Properties of Mo6S9–xIx Nanowires , 2015 .

[65]  Donald R. Jones,et al.  Efficient Global Optimization of Expensive Black-Box Functions , 1998, J. Glob. Optim..

[66]  Kresse,et al.  Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. , 1996, Physical review. B, Condensed matter.

[67]  Rahul Rao,et al.  Autonomy in materials research: a case study in carbon nanotube growth , 2016 .

[68]  R. Arróyave,et al.  Ab-initio aprroach to the electronic, structural, elastic, and finite-temperature thermodynamic properties of Ti2AX (A = Al or Ga and X = C or N) , 2011 .

[69]  K. Fujimura,et al.  Accelerated Materials Design of Lithium Superionic Conductors Based on First‐Principles Calculations and Machine Learning Algorithms , 2013 .

[70]  S. Suram,et al.  Generating information-rich high-throughput experimental materials genomes using functional clustering via multitree genetic programming and information theory. , 2015, ACS combinatorial science.

[71]  Michel W. Barsoum,et al.  The MN+1AXN phases: A new class of solids , 2000 .

[72]  L. Györfi,et al.  A Distribution-Free Theory of Nonparametric Regression (Springer Series in Statistics) , 2002 .

[73]  Tom Schaul,et al.  Building Machines that Learn and Think for Themselves: Commentary on Lake et al., Behavioral and Brain Sciences, 2017 , 2017, 1711.08378.

[74]  J. Hogden,et al.  Statistical inference and adaptive design for materials discovery , 2017 .

[75]  D. Basak,et al.  Support Vector Regression , 2008 .