Multi-task learning for electronic structure to predict and explore molecular potential energy surfaces

We refine the OrbNet model to accurately predict energy, forces, and other response properties for molecules using a graph neural-network architecture based on features from low-cost approximated quantum operators in the symmetry-adapted atomic orbital basis. The model is end-to-end differentiable due to the derivation of analytic gradients for all electronic structure terms, and is shown to be transferable across chemical space due to the use of domain-specific features. The learning efficiency is improved by incorporating physically motivated constraints on the electronic structure through multi-task learning. The model outperforms existing methods on energy prediction tasks for the QM9 dataset and for molecular geometry optimizations on conformer datasets, at a computational cost that is thousand-fold or more reduced compared to conventional quantum-chemistry calculations (such as density functional theory) that offer similar accuracy.

[1]  J. Magnus On Differentiating Eigenvalues and Eigenvectors , 1985, Econometric Theory.

[2]  Frederick R. Manby,et al.  Fast Hartree–Fock theory using local density fitting approximations , 2004 .

[3]  F. Weigend,et al.  Balanced basis sets of split valence, triple zeta valence and quadruple zeta valence quality for H to Rn: Design and assessment of accuracy. , 2005, Physical chemistry chemical physics : PCCP.

[4]  Florian Weigend,et al.  Hartree–Fock exchange fitting basis sets for H to Rn † , 2008, J. Comput. Chem..

[5]  Jeng-Da Chai,et al.  Long-Range Corrected Hybrid Density Functionals with Improved Dispersion Corrections. , 2012, Journal of chemical theory and computation.

[6]  Pavlo O. Dral,et al.  Quantum chemistry structures and properties of 134 kilo molecules , 2014, Scientific Data.

[7]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[8]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Lee-Ping Wang,et al.  Geometry optimization made simple with translation and rotation coordinates. , 2016, The Journal of chemical physics.

[10]  Kaiming He,et al.  Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour , 2017, ArXiv.

[11]  Stefan Grimme,et al.  A Robust and Accurate Tight-Binding Quantum Chemical Method for Structures, Vibrational Frequencies, and Noncovalent Interactions of Large Molecular Systems Parametrized for All spd-Block Elements (Z = 1-86). , 2017, Journal of chemical theory and computation.

[12]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[13]  C. Bannwarth,et al.  B97-3c: A revised low-cost variant of the B97-D density functional method. , 2018, The Journal of chemical physics.

[14]  Zhao Chen,et al.  GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks , 2017, ICML.

[15]  Frederick R. Manby,et al.  entos: A Quantum Molecular Simulation Package , 2019 .

[16]  Yee Whye Teh,et al.  Set Transformer , 2018, ICML.

[17]  Frederick R. Manby,et al.  OrbNet: Deep Learning for Quantum Chemistry Using Symmetry-Adapted Atomic-Orbital Features , 2020, The Journal of chemical physics.