Acceleration of High Angular Momentum Electron Repulsion Integrals and Integral Derivatives on Graphics Processing Units.

We present an efficient implementation of ab initio self-consistent field (SCF) energy and gradient calculations that run on Compute Unified Device Architecture (CUDA) enabled graphical processing units (GPUs) using recurrence relations. We first discuss the machine-generated code that calculates the electron-repulsion integrals (ERIs) for different ERI types. Next we describe the porting of the SCF gradient calculation to GPUs, which results in an acceleration of the computation of the first-order derivative of the ERIs. However, only s, p, and d ERIs and s and p derivatives could be executed simultaneously on GPUs using the current version of CUDA and generation of NVidia GPUs using a previously described algorithm [Miao and Merz J. Chem. Theory Comput. 2013, 9, 965-976.]. Hence, we developed an algorithm to compute f type ERIs and d type ERI derivatives on GPUs. Our benchmarks shows the performance GPU enable ERI and ERI derivative computation yielded speedups of 10-18 times relative to traditional CPU execution. An accuracy analysis using double-precision calculations demonstrates that the overall accuracy is satisfactory for most applications.

[1]  Kenneth M Merz,et al.  Divide-and-Conquer Hartree-Fock Calculations on Proteins. , 2010, Journal of chemical theory and computation.

[2]  Koji Yasuda,et al.  Two‐electron integral evaluation on the graphics processor unit , 2008, J. Comput. Chem..

[3]  Christine M Isborn,et al.  Electronic Absorption Spectra from MM and ab initio QM/MM Molecular Dynamics: Environmental Effects on the Absorption Spectrum of Photoactive Yellow Protein. , 2012, Journal of chemical theory and computation.

[4]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[5]  Todd J. Martínez,et al.  Generating Efficient Quantum Chemistry Codes for Novel Architectures. , 2013, Journal of chemical theory and computation.

[6]  A Eugene DePrince,et al.  Coupled Cluster Theory on Graphics Processing Units I. The Coupled Cluster Doubles Method. , 2011, Journal of chemical theory and computation.

[7]  E. Davidson,et al.  One- and two-electron integrals over cartesian gaussian functions , 1978 .

[8]  Karl A. Wilkinson,et al.  Acceleration of the GAMESS‐UK electronic structure package on graphical processing units , 2011, J. Comput. Chem..

[9]  Yihan Shao,et al.  Accelerating resolution-of-the-identity second-order Møller-Plesset quantum chemistry calculations with graphical processing units. , 2008, The journal of physical chemistry. A.

[10]  Kazuhiro Ishida,et al.  Efficient determination and characterization of transition states using ab-initio methods , 1977 .

[11]  Gustavo E. Scuseria,et al.  A quantitative study of the scaling properties of the Hartree–Fock method , 1995 .

[12]  Shigeru Obara,et al.  General recurrence formulas for molecular integrals over Cartesian Gaussian functions , 1988 .

[13]  Ivan S. Ufimtsev,et al.  Dynamic Precision for Electron Repulsion Integral Evaluation on Graphical Processing Units (GPUs). , 2011, Journal of chemical theory and computation.

[14]  Todd J. Martínez,et al.  Charge Transfer and Polarization in Solvated Proteins from Ab Initio Molecular Dynamics , 2011 .

[15]  Ivan S Ufimtsev,et al.  Quantum Chemistry on Graphical Processing Units. 2. Direct Self-Consistent-Field Implementation. , 2009, Journal of chemical theory and computation.

[16]  Ivan S Ufimtsev,et al.  Quantum Chemistry on Graphical Processing Units. 1. Strategies for Two-Electron Integral Evaluation. , 2008, Journal of chemical theory and computation.

[17]  Christine M. Isborn,et al.  Excited-State Electronic Structure with Configuration Interaction Singles and Tamm–Dancoff Time-Dependent Density Functional Theory on Graphical Processing Units , 2011, Journal of chemical theory and computation.

[18]  Andrey Asadchev,et al.  Fast and Flexible Coupled Cluster Implementation. , 2013, Journal of chemical theory and computation.

[19]  Duncan Poole,et al.  Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 1. Generalized Born , 2012, Journal of chemical theory and computation.

[20]  Bobby G. Sumpter,et al.  Density-fitted singles and doubles coupled cluster on graphics processing units , 2014 .

[21]  Martin Head-Gordon,et al.  A method for two-electron Gaussian integral and integral derivative evaluation using recurrence relations , 1988 .

[22]  Brett M. Bode,et al.  Uncontracted Rys Quadrature Implementation of up to G Functions on Graphical Processing Units. , 2010, Journal of chemical theory and computation.

[23]  Heather J Kulik,et al.  Ab initio quantum chemistry for protein structures. , 2012, The journal of physical chemistry. B.

[24]  Duncan Poole,et al.  Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 2. Explicit Solvent Particle Mesh Ewald. , 2013, Journal of chemical theory and computation.

[25]  Koji Yasuda,et al.  Accelerating Density Functional Calculations with Graphics Processing Unit. , 2008, Journal of chemical theory and computation.

[26]  Ivan S Ufimtsev,et al.  Quantum Chemistry on Graphical Processing Units. 3. Analytical Energy Gradients, Geometry Optimization, and First Principles Molecular Dynamics. , 2009, Journal of chemical theory and computation.

[27]  Peter M. W. Gill,et al.  Molecular integrals Over Gaussian Basis Functions , 1994 .

[28]  G. Fletcher Recursion formula for electron repulsion integrals over Hermite polynomials , 2006 .

[29]  Sriram Krishnamoorthy,et al.  GPU-Based Implementations of the Noniterative Regularized-CCSD(T) Corrections: Applications to Strongly Correlated Systems. , 2011, Journal of chemical theory and computation.

[30]  Huub J. J. Van Dam,et al.  Starting SCF calculations by superposition of atomic densities , 2006, J. Comput. Chem..

[31]  Timo Aila,et al.  Megakernels considered harmful: wavefront path tracing on GPUs , 2013, HPG '13.

[32]  S. F. Boys Electronic wave functions - I. A general method of calculation for the stationary states of any molecular system , 1950, Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences.

[33]  Mark S. Gordon,et al.  New Multithreaded Hybrid CPU/GPU Approach to Hartree-Fock. , 2012, Journal of chemical theory and computation.

[34]  Oreste Villa,et al.  Noniterative Multireference Coupled Cluster Methods on Heterogeneous CPU-GPU Systems. , 2013, Journal of chemical theory and computation.

[35]  Kenneth M Merz,et al.  Acceleration of Electron Repulsion Integral Evaluation on Graphics Processing Units via Use of Recurrence Relations. , 2013, Journal of chemical theory and computation.

[36]  Michel Dupuis,et al.  Computation of electron repulsion integrals using the rys quadrature method , 1983 .