Training atomic neural networks using fragment-based data generated in virtual reality.

The ability to understand and engineer molecular structures relies on having accurate descriptions of the energy as a function of atomic coordinates. Here, we outline a new paradigm for deriving energy functions of hyperdimensional molecular systems, which involves generating data for low-dimensional systems in virtual reality (VR) to then efficiently train atomic neural networks (ANNs). This generates high-quality data for specific areas of interest within the hyperdimensional space that characterizes a molecule's potential energy surface (PES). We demonstrate the utility of this approach by gathering data within VR to train ANNs on chemical reactions involving fewer than eight heavy atoms. This strategy enables us to predict the energies of much higher-dimensional systems, e.g., containing nearly 100 atoms. Training on datasets containing only 15k geometries, this approach generates mean absolute errors around 2 kcal mol-1. This represents one of the first times that an ANN-PES for a large reactive radical has been generated using such a small dataset. Our results suggest that VR enables the intelligent curation of high-quality data, which accelerates the learning process.

[1]  Klaus-Robert Müller,et al.  Machine learning of accurate energy-conserving molecular force fields , 2016, Science Advances.

[2]  Moritz P. Haag,et al.  Real‐time quantum chemistry , 2012, 1208.3717.

[3]  J. Behler Atom-centered symmetry functions for constructing high-dimensional neural network potentials. , 2011, The Journal of chemical physics.

[4]  Michael A. Collins,et al.  Molecular potential-energy surfaces for chemical reaction dynamics , 2002 .

[5]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[6]  Marwin H. S. Segler,et al.  Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction. , 2017, Chemistry.

[7]  G. G. Maisuradze,et al.  Interpolating moving least-squares methods for fitting potential energy surfaces: Illustrative approaches and applications , 2003 .

[8]  David W Toth,et al.  The TensorMol-0.1 model chemistry: a neural network augmented with long-range physics , 2017, Chemical science.

[9]  J S Smith,et al.  ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost , 2016, Chemical science.

[10]  Jörg Behler,et al.  Neural network potential-energy surfaces for atomistic simulations , 2010 .

[11]  Robert T. McGibbon,et al.  Osprey: Hyperparameter Optimization for Machine Learning , 2016, J. Open Source Softw..

[12]  David R Glowacki,et al.  Non-equilibrium reaction and relaxation dynamics in a strongly interacting explicit solvent: F + CD3CN treated with a parallel multi-state EVB model. , 2014, The Journal of chemical physics.

[13]  Markus Reiher,et al.  Real‐time feedback from iterative electronic structure calculations , 2015, J. Comput. Chem..

[14]  Arieh Warshel,et al.  An empirical valence bond approach for comparing reactions in solutions and in enzymes , 1980 .

[15]  Oussama Metatla,et al.  Sampling molecular conformations and dynamics in a multiuser virtual reality framework , 2018, Science Advances.

[16]  Joel M. Bowman,et al.  Permutationally invariant potential energy surfaces in high dimensionality , 2009 .

[17]  Gao Shang,et al.  Cubic spline interpolation on new condition , 2011 .

[18]  David R. Glowacki,et al.  Training neural nets to learn reactive potential energy surfaces using interactive quantum chemistry in virtual reality , 2019, The journal of physical chemistry. A.

[19]  Petra Schneider,et al.  Generative Recurrent Networks for De Novo Drug Design , 2017, Molecular informatics.

[20]  Walter Thiel,et al.  Structure-based sampling and self-correcting machine learning for accurate calculations of potential energy surfaces and vibrational levels. , 2017, The Journal of chemical physics.

[21]  Markus Meuwly,et al.  Toolkit for the Construction of Reproducing Kernel-Based Representations of Data: Application to Multidimensional Potential Energy Surfaces , 2017, J. Chem. Inf. Model..