A smooth basis for atomistic machine learning

Machine learning frameworks based on correlations of interatomic positions begin with a discretized description of the density of other atoms in the neighborhood of each atom in the system. Symmetry considerations support the use of spherical harmonics to expand the angular dependence of this density, but there is, as of yet, no clear rationale to choose one radial basis over another. Here, we investigate the basis that results from the solution of the Laplacian eigenvalue problem within a sphere around the atom of interest. We show that this generates a basis of controllable smoothness within the sphere (in the same sense as plane waves provide a basis with controllable smoothness for a problem with periodic boundaries) and that a tensor product of Laplacian eigenstates also provides a smooth basis for expanding any higher-order correlation of the atomic density within the appropriate hypersphere. We consider several unsupervised metrics of the quality of a basis for a given dataset and show that the Laplacian eigenstate basis has a performance that is much better than some widely used basis sets and competitive with data-driven bases that numerically optimize each metric. Finally, we investigate the role of the basis in building models of the potential energy. In these tests, we find that a combination of the Laplacian eigenstate basis and target-oriented heuristics leads to equal or improved regression performance when compared to both heuristic and data-driven bases in the literature. We conclude that the smoothness of the basis functions is a key aspect of successful atomic density representations.

[1]  Michael J. Willatt,et al.  Comment on "Manifolds of quasi-constant SOAP and ACSF fingerprints and the resulting failure to machine learn four-body interactions" [J. Chem. Phys. 156, 034302 (2022)]. , 2022, Journal of Chemical Physics.

[2]  Jigyasa Nigam,et al.  Unified theory of atom-centered representations and message-passing machine-learning schemes. , 2022, The Journal of chemical physics.

[3]  Anders S. Christensen,et al.  Informing geometric deep learning with electronic interactions to accelerate quantum chemistry , 2021, Proceedings of the National Academy of Sciences of the United States of America.

[4]  S. Goedecker,et al.  Manifolds of quasi-constant SOAP and ACSF fingerprints and the resulting failure to machine learn four-body interactions. , 2021, The Journal of chemical physics.

[5]  Jonathan P. Mailoa,et al.  E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials , 2021, Nature Communications.

[6]  Cas van der Oord,et al.  Atomic cluster expansion: Completeness, efficiency and stability , 2019, J. Comput. Phys..

[7]  Cas van der Oord,et al.  Linear Atomic Cluster Expansion Force Fields for Organic Molecules: Beyond RMSE , 2021, Journal of chemical theory and computation.

[8]  Gábor Csányi,et al.  Local invertibility and sensitivity of atomic structure-feature mappings , 2021, Open research Europe.

[9]  Michele Ceriotti,et al.  Optimal radial basis for density-based atomic representations , 2021, The Journal of chemical physics.

[10]  Michael J. Willatt,et al.  Efficient implementation of atom-density representations. , 2021, The Journal of chemical physics.

[11]  Gábor Csányi,et al.  Physics-Inspired Structural Representations for Molecules and Materials. , 2021, Chemical reviews.

[12]  Michele Ceriotti,et al.  Improving sample and feature selection with principal covariates regression , 2020, Mach. Learn. Sci. Technol..

[13]  Michele Ceriotti,et al.  The role of feature space in atomistic learning , 2020, Mach. Learn. Sci. Technol..

[14]  Stefan Goedecker,et al.  An assessment of the structural resolution of various fingerprints commonly used in machine learning , 2020, Mach. Learn. Sci. Technol..

[15]  Alexander V. Shapeev,et al.  The MLIP package: moment tensor potentials with MPI and active learning , 2020, Mach. Learn. Sci. Technol..

[16]  O. Anatole von Lilienfeld,et al.  On the role of gradients for machine learning of molecular energies and forces , 2020, Mach. Learn. Sci. Technol..

[17]  Jigyasa Nigam,et al.  Recursive evaluation and iterative contraction of N-body equivariant features. , 2020, The Journal of chemical physics.

[18]  Christoph Ortner,et al.  Sensitivity and dimensionality of atomic environment representations used for machine learning interatomic potentials. , 2020, The Journal of chemical physics.

[19]  C. Pickard AIRSS data for carbon at 10GPa and the C+N+H+O system at 1GPa , 2020 .

[20]  Cas van der Oord,et al.  Regularised atomic body-ordered permutation-invariant polynomials for the construction of interatomic potentials , 2019, Mach. Learn. Sci. Technol..

[21]  Filippo Federici Canova,et al.  DScribe: Library of Descriptors for Machine Learning in Materials Science , 2019, Comput. Phys. Commun..

[22]  Risi Kondor,et al.  Cormorant: Covariant Molecular Neural Networks , 2019, NeurIPS.

[23]  Ralf Drautz,et al.  Atomic cluster expansion for accurate and transferable interatomic potentials , 2019, Physical Review B.

[24]  Michele Ceriotti,et al.  A Data-Driven Construction of the Periodic Table of the Elements , 2018, 1807.00236.

[25]  Jörg Behler,et al.  Automatic selection of atomic fingerprints and reference configurations for machine-learning potentials. , 2018, The Journal of chemical physics.

[26]  Aidan P Thompson,et al.  Extending the accuracy of the SNAP interatomic potential form. , 2017, The Journal of chemical physics.

[27]  O. A. von Lilienfeld,et al.  Communication: Understanding molecular representations in machine learning: The role of uniqueness and target similarity. , 2016, The Journal of chemical physics.

[28]  Alexander V. Shapeev,et al.  Moment Tensor Potentials: A Class of Systematically Improvable Interatomic Potentials , 2015, Multiscale Model. Simul..

[29]  Gábor Csányi,et al.  Gaussian approximation potentials: A brief tutorial introduction , 2015, 1502.01366.

[30]  R. Kondor,et al.  On representing chemical environments , 2012, 1209.3140.

[31]  J. Behler Atom-centered symmetry functions for constructing high-dimensional neural network potentials. , 2011, The Journal of chemical physics.

[32]  Chris J Pickard,et al.  Ab initio random structure searching , 2011, Journal of physics. Condensed matter : an Institute of Physics journal.

[33]  R. Kondor,et al.  Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. , 2009, Physical review letters.

[34]  Michele Parrinello,et al.  Generalized neural-network representation of high-dimensional potential-energy surfaces. , 2007, Physical review letters.

[35]  P.-L. Chau,et al.  A new order parameter for tetrahedral configurations , 1998 .