Optimizing many-body atomic descriptors for enhanced computational performance of machine learning based interatomic potentials

We explore different ways to simplify the evaluation of the smooth overlap of atomic positions (SOAP) many-body atomic descriptor [Bart\'ok et al., Phys. Rev. B 87, 184115 (2013).]. Our aim is to improve the computational efficiency of SOAP-based similarity kernel construction. While these improved atomic descriptors can be used for general characterization and interpolation of atomic properties, their main target application is accelerated evaluation of machine-learning-based interatomic potentials within the Gaussian approximation potential (GAP) framework [Bart\'ok et al., Phys. Rev. Lett. 104, 136403 (2010)]. We achieve this objective by expressing the atomic densities in an approximate separable form, which decouples the radial and angular channels. We then express the elements of the SOAP descriptor (i.e., the expansion coefficients for the atomic densities) in analytical form given a particular choice of radial basis set. Finally, we derive recursion formulas for the expansion coefficients. This new SOAP-based descriptor allows for tenfold speedups compared to previous implementations, while improving the stability of the radial expansion for distant atomic neighbors, without degradation of the interpolation power of GAP models.

[1]  Gábor Csányi,et al.  Comparing molecules and solids across structural and alchemical space. , 2015, Physical chemistry chemical physics : PCCP.

[2]  Michele Parrinello,et al.  Generalized neural-network representation of high-dimensional potential-energy surfaces. , 2007, Physical review letters.

[3]  I. Sobol Uniformly distributed sequences with an additional uniform property , 1976 .

[4]  R. Kondor,et al.  On representing chemical environments , 2012, 1209.3140.

[5]  Volker L. Deringer,et al.  Machine learning based interatomic potential for amorphous carbon , 2016, 1611.03277.

[6]  Michele Ceriotti,et al.  A Data-Driven Construction of the Periodic Table of the Elements , 2018, 1807.00236.

[7]  Gábor Csányi,et al.  Gaussian approximation potentials: A brief tutorial introduction , 2015, 1502.01366.

[8]  J. Behler Atom-centered symmetry functions for constructing high-dimensional neural network potentials. , 2011, The Journal of chemical physics.

[9]  Single-centre expansion of Gaussian basis functions and the angular decomposition of their overlap integrals , 1989 .

[10]  Volker L. Deringer,et al.  Growth Mechanism and Origin of High sp^{3} Content in Tetrahedral Amorphous Carbon. , 2018, Physical review letters.

[11]  Noam Bernstein,et al.  Realistic Atomistic Structure of Amorphous Silicon from Machine-Learning-Driven Molecular Dynamics. , 2018, The journal of physical chemistry letters.

[12]  G. Arfken Mathematical Methods for Physicists , 1967 .

[13]  Michele Ceriotti,et al.  Atom-density representations for machine learning. , 2018, The Journal of chemical physics.

[14]  Matthias Rupp,et al.  Unified representation of molecules and crystals for machine learning , 2017, Mach. Learn. Sci. Technol..

[15]  G. Arfken,et al.  Mathematical methods for physicists 6th ed. , 1996 .

[16]  Jasper Snoek,et al.  Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[17]  Tomi Laurila,et al.  Reactivity of Amorphous Carbon Surfaces: Rationalizing the Role of Structural Motifs in Functionalization Using Machine Learning , 2018, Chemistry of materials : a publication of the American Chemical Society.

[18]  R. Kondor,et al.  Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. , 2009, Physical review letters.

[19]  Adam S. Foster,et al.  Machine learning hydrogen adsorption on nanoclusters through structural descriptors , 2018, npj Computational Materials.

[20]  Filippo Federici Canova,et al.  DScribe: Library of Descriptors for Machine Learning in Materials Science , 2019, Comput. Phys. Commun..