Adaptive machine learning framework to accelerate ab initio molecular dynamics

Quantum mechanics-based ab initio molecular dynamics (MD) simulation schemes offer an accurate and direct means to monitor the time evolution of materials. Nevertheless, the expensive and repetitive energy and force computations required in such simulations lead to significant bottlenecks. Here, we lay the foundations for an accelerated ab initio MD approach integrated with a machine learning framework. The proposed algorithm learns from previously visited configurations in a continuous and adaptive manner on-the-fly, and predicts (with chemical accuracy) the energies and atomic forces of a new configuration at a minuscule fraction of the time taken by conventional ab initio methods. Key elements of this new accelerated ab initio MD paradigm include representations of atomic configurations by numerical fingerprints, a learning algorithm to map the fingerprints to the properties, a decision engine that guides the choice of the prediction scheme, and requisite amount of ab initio data. The performance of each aspect of the proposed scheme is critically evaluated for Al in several different chemical environments. This work has enormous implications beyond ab initio MD acceleration. It can also lead to accelerated structure and property prediction schemes, and accurate force fields. V C 2014 Wiley Periodicals, Inc.

[1]  Ghanshyam Pilania,et al.  Rational design of all organic polymer dielectrics , 2014, Nature Communications.

[2]  Jörg Neugebauer,et al.  Density functional theory in materials science , 2013, Wiley interdisciplinary reviews. Computational molecular science.

[3]  Gerbrand Ceder,et al.  Predicting crystal structure by merging data mining with quantum mechanics , 2006, Nature materials.

[4]  Anubhav Jain,et al.  Finding Nature’s Missing Ternary Oxide Compounds Using Machine Learning and Density Functional Theory , 2010 .

[5]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[6]  Abhijit Chatterjee,et al.  An overview of spatial microscopic and accelerated kinetic Monte Carlo methods , 2007 .

[7]  R. Kondor,et al.  Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. , 2009, Physical review letters.

[8]  A. Voter,et al.  Temperature-accelerated dynamics for simulation of infrequent events , 2000 .

[9]  Krishna Rajan,et al.  Data-Driven Model for Estimation of Friction Coefficient Via Informatics Methods , 2012, Tribology Letters.

[10]  Karsten Wedel Jacobsen,et al.  Designing rules and probabilistic weighting for fast materials discovery in the Perovskite structure , 2014 .

[11]  Ian M. Torrens,et al.  CHAPTER IV – EMPIRICAL INTERATOMIC POTENTIALS , 1972 .

[12]  Sanguthevar Rajasekaran,et al.  Accelerating materials property predictions using machine learning , 2013, Scientific Reports.

[13]  A. Becke Perspective: Fifty years of density-functional theory in chemical physics. , 2014, The Journal of chemical physics.

[14]  R. Ramprasad,et al.  Machine Learning in Materials Science , 2016 .

[15]  K. Müller,et al.  Fast and accurate modeling of molecular atomization energies with machine learning. , 2011, Physical review letters.

[16]  I. Takeuchi,et al.  Rapid structural mapping of ternary metallic alloy systems using the combinatorial approach and cluster analysis. , 2007, The Review of scientific instruments.

[17]  Wang,et al.  Generalized gradient approximation for the exchange-correlation hole of a many-electron system. , 1996, Physical review. B, Condensed matter.

[18]  R. Kondor,et al.  On representing chemical environments , 2012, 1209.3140.

[19]  Robert P. Sheridan,et al.  Using Random Forest To Model the Domain Applicability of Another Random Forest Model , 2013, J. Chem. Inf. Model..

[20]  Hiromasa Kaneko,et al.  Applicability Domain Based on Ensemble Learning in Classification and Regression Analyses , 2014, J. Chem. Inf. Model..

[21]  Ian Witten,et al.  Data Mining , 2000 .

[22]  Liping Yu,et al.  Sorting Stable versus Unstable Hypothetical Compounds: The Case of Multi‐Functional ABX Half‐Heusler Filled Tetrahedral Structures , 2012 .

[23]  A. Voter,et al.  Extending the Time Scale in Atomistic Simulation of Materials Annual Re-views in Materials Research , 2002 .

[24]  Mario Valle,et al.  Evolutionary Crystal Structure Prediction as a Method for the Discovery of Minerals and Materials , 2010 .

[25]  Gerbrand Ceder,et al.  World changing ideas. 10 ways science may jazz up our gadgets, help to solve our most intractable problems and save lives. , 2013, Scientific American.

[26]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[27]  Anubhav Jain,et al.  From the computer to the laboratory: materials discovery and design using first-principles calculations , 2012, Journal of Materials Science.

[28]  S. Goedecker Minima hopping: an efficient search method for the global minimum of the potential energy surface of complex molecular systems. , 2004, The Journal of chemical physics.

[29]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[30]  Klaus-Robert Müller,et al.  Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies. , 2013, Journal of chemical theory and computation.

[31]  Manh Cuong Nguyen,et al.  On-the-fly machine-learning for high-throughput experiments: search for rare-earth-free permanent magnets , 2014, Scientific Reports.

[32]  Lusann Yang,et al.  Proposed definition of crystal substructure and substructural similarity , 2014 .

[33]  Krishna Rajan,et al.  Identifying the ‘inorganic gene’ for high-temperature piezoelectric perovskites through statistical learning , 2011, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[34]  Alex Smola,et al.  Kernel methods in machine learning , 2007, math/0701907.

[35]  A. Voter Hyperdynamics: Accelerated Molecular Dynamics of Infrequent Events , 1997 .

[36]  J. Mongan,et al.  Accelerated molecular dynamics: a promising and efficient simulation method for biomolecules. , 2004, The Journal of chemical physics.

[37]  A. Laio,et al.  Escaping free-energy minima , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[38]  J. Behler Atom-centered symmetry functions for constructing high-dimensional neural network potentials. , 2011, The Journal of chemical physics.

[39]  G. Kresse,et al.  From ultrasoft pseudopotentials to the projector augmented-wave method , 1999 .

[40]  J. Elliott Novel approaches to multiscale modelling in materials science , 2011 .

[41]  Blöchl,et al.  Projector augmented-wave method. , 1994, Physical review. B, Condensed matter.

[42]  A. Laio,et al.  Metadynamics: a method to simulate rare events and reconstruct the free energy in biophysics, chemistry and material science , 2008 .

[43]  Arthur F. Voter,et al.  Accelerating Atomistic Simulations of Defect Dynamics: Hyperdynamics, Parallel Replica Dynamics, and Temperature-Accelerated Dynamics , 1998 .

[44]  Kresse,et al.  Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. , 1996, Physical review. B, Condensed matter.

[45]  G. Henkelman,et al.  Long time scale kinetic Monte Carlo simulations without lattice approximation and predefined event table , 2001 .

[46]  Peter Delves,et al.  Encyclopedia of life sciences , 2009 .

[47]  Krishna Rajan,et al.  “Property Phase Diagrams” for Compound Semiconductors through Data Mining , 2013, Materials.

[48]  Stefano Curtarolo,et al.  High-throughput and data mining with ab initio methods , 2004 .

[49]  A. Voter A method for accelerating the molecular dynamics simulation of infrequent events , 1997 .

[50]  M. Rupp,et al.  Machine learning of molecular electronic properties in chemical compound space , 2013, 1305.7074.

[51]  Ferran Sanz,et al.  Applicability Domain Analysis (ADAN): A Robust Method for Assessing the Reliability of Drug Property Predictions , 2014, J. Chem. Inf. Model..

[52]  Kristof T. Schütt,et al.  How to represent crystal structures for machine learning: Towards fast prediction of electronic properties , 2013, 1307.1266.

[53]  J. Doye,et al.  Global Optimization by Basin-Hopping and the Lowest Energy Structures of Lennard-Jones Clusters Containing up to 110 Atoms , 1997, cond-mat/9803344.

[54]  J. Behler Neural network potential-energy surfaces in chemistry: a tool for large-scale simulations. , 2011, Physical chemistry chemical physics : PCCP.