Accurate and transferable multitask prediction of chemical properties with an atoms-in-molecules neural network

We introduce a modular, chemically inspired deep neural network model for prediction of several atomic and molecular properties. Atomic and molecular properties could be evaluated from the fundamental Schrodinger’s equation and therefore represent different modalities of the same quantum phenomena. Here, we present AIMNet, a modular and chemically inspired deep neural network potential. We used AIMNet with multitarget training to learn multiple modalities of the state of the atom in a molecular system. The resulting model shows on several benchmark datasets state-of-the-art accuracy, comparable to the results of orders of magnitude more expensive DFT methods. It can simultaneously predict several atomic and molecular properties without an increase in the computational cost. With AIMNet, we show a new dimension of transferability: the ability to learn new targets using multimodal information from previous training. The model can learn implicit solvation energy (SMD method) using only a fraction of the original training data and an archive median absolute deviation error of 1.1 kcal/mol compared to experimental solvation free energies in the MNSol database.

[1]  Quanquan Gu,et al.  Exploring the use of adaptive gradient methods in effective deep learning systems , 2018, 2018 Systems and Information Engineering Design Symposium (SIEDS).

[2]  Nancy Forbes,et al.  Imitation of Life: How Biology Is Inspiring Computing , 2004 .

[3]  R. Bader Atoms in molecules : a quantum theory , 1990 .

[4]  Michele Parrinello,et al.  Generalized neural-network representation of high-dimensional potential-energy surfaces. , 2007, Physical review letters.

[5]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[6]  Louis-Philippe Morency,et al.  Multimodal Machine Learning: A Survey and Taxonomy , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Sepp Hochreiter,et al.  Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[8]  Frank Neese,et al.  An overlap fitted chain of spheres exchange method. , 2011, The Journal of chemical physics.

[9]  Alexander V. Shapeev,et al.  Active learning of linearly parametrized interatomic potentials , 2016, 1611.09346.

[10]  Heather J Kulik,et al.  Accelerating Chemical Discovery with Machine Learning: Simulated Evolution of Spin Crossover Complexes with an Artificial Neural Network. , 2018, The journal of physical chemistry letters.

[11]  T. Manz Comment on "Minimal Basis Iterative Stockholder: Atoms in Molecules for Force-Field Development" , 2017, 1701.01714.

[12]  Manoj K. Kesharwani,et al.  The S66x8 benchmark for noncovalent interactions revisited: explicitly correlated ab initio methods and density functional theory. , 2016, Physical chemistry chemical physics : PCCP.

[13]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[14]  Thomas F. Miller,et al.  Transferability in Machine Learning for Electronic Structure via the Molecular Orbital Basis. , 2018, Journal of chemical theory and computation.

[15]  L. Rulíšek,et al.  Toward Accurate Conformational Energies of Smaller Peptides and Medium-Sized Macrocycles: MPCONF196 Benchmark Energy Data Set. , 2018, Journal of chemical theory and computation.

[16]  Klaus-Robert Müller,et al.  Machine learning of accurate energy-conserving molecular force fields , 2016, Science Advances.

[17]  K-R Müller,et al.  SchNet - A deep learning architecture for molecules and materials. , 2017, The Journal of chemical physics.

[18]  Jorge Luis Rodriguez,et al.  The Open Science Grid , 2005 .

[19]  K. Müller,et al.  Fast and accurate modeling of molecular atomization energies with machine learning. , 2011, Physical review letters.

[20]  Louis B. Rall,et al.  Automatic differentiation , 1981 .

[21]  Friedemann Pulvermüller,et al.  Brain mechanisms linking language and action , 2005, Nature Reviews Neuroscience.

[22]  Kipton Barros,et al.  Discovering a Transferable Charge Assignment Model Using Machine Learning. , 2018, The journal of physical chemistry letters.

[23]  Nancy Forbes Imitation of Life , 2004 .

[24]  Alán Aspuru-Guzik,et al.  Inverse molecular design using machine learning: Generative models for matter engineering , 2018, Science.

[25]  Michael Gastegger,et al.  Machine learning molecular dynamics for the simulation of infrared spectra† †Electronic supplementary information (ESI) available. See DOI: 10.1039/c7sc02267k , 2017, Chemical science.

[26]  David W Toth,et al.  The TensorMol-0.1 model chemistry: a neural network augmented with long-range physics , 2017, Chemical science.

[27]  Adrian E. Roitberg,et al.  Less is more: sampling chemical space with active learning , 2018, The Journal of chemical physics.

[28]  Daniel W. Davies,et al.  Machine learning for molecular and materials science , 2018, Nature.

[29]  Geoffrey J. Gordon,et al.  A Density Functional Tight Binding Layer for Deep Learning of Chemical Hamiltonians. , 2018, Journal of chemical theory and computation.

[30]  Heather J Kulik,et al.  Predicting electronic structure properties of transition metal complexes with neural networks† †Electronic supplementary information (ESI) available. See DOI: 10.1039/c7sc01247k , 2017, Chemical science.

[31]  Jan H. Jensen,et al.  Improving solvation energy predictions using the SMD solvation method and semiempirical electronic structure methods. , 2018, The Journal of chemical physics.

[32]  Samuel S. Schoenholz,et al.  Neural Message Passing for Quantum Chemistry , 2017, ICML.

[33]  J. Behler First Principles Neural Network Potentials for Reactive Simulations of Large Molecular and Condensed Systems. , 2017, Angewandte Chemie.

[34]  Benjamin D. Sellers,et al.  A Comparison of Quantum and Molecular Mechanical Methods to Estimate Strain Energy in Druglike Fragments , 2017, J. Chem. Inf. Model..

[35]  J S Smith,et al.  ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost , 2016, Chemical science.

[36]  Kipton Barros,et al.  Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning , 2019, Nature Communications.

[37]  P. Ayers,et al.  Minimal Basis Iterative Stockholder: Atoms in Molecules for Force-Field Development. , 2016, Journal of chemical theory and computation.

[38]  Mike Preuss,et al.  Planning chemical syntheses with deep neural networks and symbolic AI , 2017, Nature.

[39]  Igor Sfiligoi,et al.  The Pilot Way to Grid Resources Using glideinWMS , 2009, 2009 WRI World Congress on Computer Science and Information Engineering.

[40]  Frank Neese,et al.  The ORCA program system , 2012 .