On-the-fly active learning of interpretable Bayesian force fields for atomistic rare events

Machine learned force fields typically require manual construction of training sets consisting of thousands of first principles calculations, which can result in low training efficiency and unpredictable errors when applied to structures not represented in the training set of the model. This severely limits the practical application of these models in systems with dynamics governed by important rare events, such as chemical reactions and diffusion. We present an adaptive Bayesian inference method for automating the training of interpretable, low-dimensional, and multi-element interatomic force fields using structures drawn on the fly from molecular dynamics simulations. Within an active learning framework, the internal uncertainty of a Gaussian process regression model is used to decide whether to accept the model prediction or to perform a first principles calculation to augment the training set of the model. The method is applied to a range of single- and multi-element systems and shown to achieve a favorable balance of accuracy and computational efficiency, while requiring a minimal amount of ab initio training data. We provide a fully open-source implementation of our method, as well as a procedure to map trained models to computationally efficient tabulated force fields.

[1]  Alexander V. Shapeev,et al.  Moment Tensor Potentials: A Class of Systematically Improvable Interatomic Potentials , 2015, Multiscale Model. Simul..

[2]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[3]  Gisbert Schneider,et al.  Machine Learning Estimates of Natural Product Conformational Energies , 2014, PLoS Comput. Biol..

[4]  Priya Vashishta,et al.  Structural Transitions in Superionic Conductors , 1983 .

[5]  S. Hull,et al.  Superionics: crystal structures and conduction processes , 2004 .

[6]  Otfried Madelung,et al.  II-VI and I-VII compounds, semimagnetic compounds , 1999 .

[7]  Ju Li,et al.  AtomEye: an efficient atomistic configuration viewer , 2003 .

[8]  Michael Gastegger,et al.  Machine learning molecular dynamics for the simulation of infrared spectra† †Electronic supplementary information (ESI) available. See DOI: 10.1039/c7sc02267k , 2017, Chemical science.

[9]  Adrian E. Roitberg,et al.  Less is more: sampling chemical space with active learning , 2018, The Journal of chemical physics.

[10]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[11]  Gábor Csányi,et al.  Accuracy and transferability of Gaussian approximation potential models for tungsten , 2014 .

[12]  Noam Bernstein,et al.  Machine Learning a General-Purpose Interatomic Potential for Silicon , 2018, Physical Review X.

[13]  Noam Bernstein,et al.  De novo exploration and self-guided learning of potential-energy surfaces , 2019, npj Computational Materials.

[14]  Nicola Marzari,et al.  Dynamical structure, bonding, and thermodynamics of the superionic sublattice in alpha-AgI. , 2006, Physical review letters.

[15]  Aldo Glielmo,et al.  Efficient nonparametric n -body force fields from machine learning , 2018, 1801.04823.

[16]  Peter Sollich,et al.  Accurate interatomic force fields via machine learning with covariant kernels , 2016, 1611.03877.

[17]  Ankit Mishra,et al.  Multiobjective genetic training and uncertainty quantification of reactive force fields , 2018, npj Computational Materials.

[18]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[19]  M. Kramer,et al.  Highly optimized embedded-atom-method potentials for fourteen fcc metals , 2011 .

[20]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[21]  Georg Kresse,et al.  On-the-fly machine learning force field generation: Application to melting points , 2019, Physical Review B.

[22]  J. Behler Neural network potential-energy surfaces in chemistry: a tool for large-scale simulations. , 2011, Physical chemistry chemical physics : PCCP.

[23]  Klaus-Robert Müller,et al.  Machine learning of accurate energy-conserving molecular force fields , 2016, Science Advances.

[24]  H. Kulik,et al.  A Quantitative Uncertainty Metric Controls Error in Neural Network-Driven Chemical Discovery , 2019 .

[25]  E Weinan,et al.  Deep Potential Molecular Dynamics: a scalable model with the accuracy of quantum mechanics , 2017, Physical review letters.

[26]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[27]  Mordechai Kornbluth,et al.  A fast neural network approach for direct covariant forces prediction in complex multi-element extended systems , 2019, Nature Machine Intelligence.

[28]  G. Kirk,et al.  The greenhouse gas impacts of converting food production in England and Wales to organic methods , 2019, Nature Communications.

[29]  Klaus-Robert Müller,et al.  SchNet: A continuous-filter convolutional neural network for modeling quantum interactions , 2017, NIPS.

[30]  Michele Ceriotti,et al.  Fast and Accurate Uncertainty Estimation in Chemical Machine Learning. , 2018, Journal of chemical theory and computation.

[31]  Zhenwei Li On-the-fly machine learning of quantum mechanical forces and its potential applications for large scale molecular dynamics , 2014 .

[32]  Alessandro De Vita,et al.  Building Nonparametric n-Body Force Fields Using Gaussian Process Regression , 2019, Machine Learning Meets Quantum Physics.

[33]  Scheffler,et al.  Theory of self-diffusion at and growth of Al(111). , 1994, Physical review letters.

[34]  R. Kondor,et al.  Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. , 2009, Physical review letters.

[35]  Volker L. Deringer,et al.  Machine learning based interatomic potential for amorphous carbon , 2016, 1611.03277.

[36]  R. Kondor,et al.  On representing chemical environments , 2012, 1209.3140.

[37]  Kipton Barros,et al.  Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning , 2019, Nature Communications.

[38]  Weber,et al.  Computer simulation of local order in condensed phases of silicon. , 1985, Physical review. B, Condensed matter.

[39]  Michael Walter,et al.  The atomic simulation environment-a Python library for working with atoms. , 2017, Journal of physics. Condensed matter : an Institute of Physics journal.

[40]  Gus L. W. Hart,et al.  Accelerating high-throughput searches for new alloys with active learning of interatomic potentials , 2018, Computational Materials Science.

[41]  Gábor Csányi,et al.  Comparing molecules and solids across structural and alchemical space. , 2015, Physical chemistry chemical physics : PCCP.

[42]  Alexander V. Shapeev,et al.  Accelerating crystal structure prediction by machine-learning interatomic potentials with active learning , 2018, Physical Review B.

[43]  Zhenwei Li,et al.  Molecular dynamics with on-the-fly machine learning of quantum-mechanical forces. , 2015, Physical review letters.

[44]  Alexander V. Shapeev,et al.  Active learning of linearly parametrized interatomic potentials , 2016, 1611.09346.

[45]  E Weinan,et al.  End-to-end Symmetry Preserving Inter-atomic Potential Energy Model for Finite and Extended Systems , 2018, NeurIPS.

[46]  Gábor Csányi,et al.  Gaussian approximation potentials: A brief tutorial introduction , 2015, 1502.01366.

[47]  Noam Bernstein,et al.  Research data supporting "De novo exploration and self-guided learning of potential-energy surfaces" , 2019 .

[48]  Volker L. Deringer,et al.  Data-Driven Learning of Total and Local Energies in Elemental Boron. , 2017, Physical review letters.

[49]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[50]  Christian Trott,et al.  Spectral neighbor analysis method for automated generation of quantum-accurate interatomic potentials , 2014, J. Comput. Phys..

[51]  Roy Tärneberg,et al.  Self-diffusion of Silver Ions in the Cubic High Temperature Modification of Silver Iodide , 1970 .

[52]  Konstantin Gubaev,et al.  Machine learning of molecular properties: Locality and active learning. , 2017, The Journal of chemical physics.

[53]  Rampi Ramprasad,et al.  Machine Learning Force Fields: Construction, Validation, and Outlook , 2016, 1610.02098.

[54]  Siu Kwan Lam,et al.  Numba: a LLVM-based Python JIT compiler , 2015, LLVM '15.

[55]  Georg Kresse,et al.  Phase Transitions of Hybrid Perovskites Simulated by Machine-Learning Force Fields Trained on the Fly with Bayesian Inference. , 2019, Physical review letters.

[56]  Scheffler,et al.  Ab initio calculations of energies and self-diffusion on flat and stepped surfaces of Al and their implications on crystal growth. , 1996, Physical review. B, Condensed matter.

[57]  Elena Uteva,et al.  Active learning in Gaussian process interpolation of potential energy surfaces. , 2018, The Journal of chemical physics.

[58]  Carl E. Rasmussen,et al.  A Unifying View of Sparse Approximate Gaussian Process Regression , 2005, J. Mach. Learn. Res..

[59]  E Weinan,et al.  Active Learning of Uniformly Accurate Inter-atomic Potentials for Materials Simulation , 2018, Physical Review Materials.

[60]  M. Kunitski,et al.  Double-slit photoelectron interference in strong-field ionization of the neon dimer , 2018, Nature Communications.

[61]  J. Behler Atom-centered symmetry functions for constructing high-dimensional neural network potentials. , 2011, The Journal of chemical physics.

[62]  Stefano de Gironcoli,et al.  QUANTUM ESPRESSO: a modular and open-source software project for quantum simulations of materials , 2009, Journal of physics. Condensed matter : an Institute of Physics journal.