GPU Linear Algebra Libraries and GPGPU Programming for Accelerating MOPAC Semiempirical Quantum Chemistry Calculations.

In this study, we present some modifications in the semiempirical quantum chemistry MOPAC2009 code that accelerate single-point energy calculations (1SCF) of medium-size (up to 2500 atoms) molecular systems using GPU coprocessors and multithreaded shared-memory CPUs. Our modifications consisted of using a combination of highly optimized linear algebra libraries for both CPU (LAPACK and BLAS from Intel MKL) and GPU (MAGMA and CUBLAS) to hasten time-consuming parts of MOPAC such as the pseudodiagonalization, full diagonalization, and density matrix assembling. We have shown that it is possible to obtain large speedups just by using CPU serial linear algebra libraries in the MOPAC code. As a special case, we show a speedup of up to 14 times for a methanol simulation box containing 2400 atoms and 4800 basis functions, with even greater gains in performance when using multithreaded CPUs (2.1 times in relation to the single-threaded CPU code using linear algebra libraries) and GPUs (3.8 times). This degree of acceleration opens new perspectives for modeling larger structures which appear in inorganic chemistry (such as zeolites and MOFs), biochemistry (such as polysaccharides, small proteins, and DNA fragments), and materials science (such as nanotubes and fullerenes). In addition, we believe that this parallel (GPU-GPU) MOPAC code will make it feasible to use semiempirical methods in lengthy molecular simulations using both hybrid QM/MM and QM/QM potentials.

[1]  Johannes Grotendorst,et al.  Modern methods and algorithms of quantum chemistry , 2000 .

[2]  Alán Aspuru-Guzik,et al.  Accelerating Correlated Quantum Chemistry Calculations Using Graphical Processing Units , 2010, Computing in Science & Engineering.

[3]  K Fukushima,et al.  An insight into the general relationship between the three dimensional structures of enzymes and their electronic wave functions: Implication for the prediction of functional sites of enzymes , 2008, Proteins.

[4]  Ivan S Ufimtsev,et al.  Quantum Chemistry on Graphical Processing Units. 2. Direct Self-Consistent-Field Implementation. , 2009, Journal of chemical theory and computation.

[5]  B. McConkey,et al.  Discrimination of native protein structures using atom–atom contact scoring , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[6]  A. Klamt,et al.  COSMO : a new approach to dielectric screening in solvents with explicit expressions for the screening energy and its gradient , 1993 .

[7]  Lorena A. Barba,et al.  How to obtain efficient GPU kernels: An illustration using FMM & FGT algorithms , 2010, Comput. Phys. Commun..

[8]  Rolf Seeger,et al.  Parallel processing on minicomputers: A powerful tool for quantum chemistry , 1981 .

[9]  James J. P. Stewart,et al.  Application of localized molecular orbitals to the solution of semiempirical self‐consistent field equations , 1996 .

[10]  Yihan Shao,et al.  Accelerating resolution-of-the-identity second-order Møller-Plesset quantum chemistry calculations with graphical processing units. , 2008, The journal of physical chemistry. A.

[11]  Alfredo Mayall Simas,et al.  RM1: A reparameterization of AM1 for H, C, N, O, P, S, F, Cl, Br, and I , 2006, J. Comput. Chem..

[12]  Alistair P. Rendell,et al.  Computational chemistry on Fujitsu vector-parallel processors: Development and performance of applications software , 2000, Parallel Comput..

[13]  A Eugene DePrince,et al.  Coupled Cluster Theory on Graphics Processing Units I. The Coupled Cluster Doubles Method. , 2011, Journal of chemical theory and computation.

[14]  Christopher J. Woods,et al.  A massively multicore parallelization of the Kohn‐Sham energy gradients , 2010, J. Comput. Chem..

[15]  Elena F. Sheka,et al.  NANOPACK: Parallel codes for semiempirical quantum chemical calculations of large systems in the sp‐ and spd‐basis , 2002 .

[16]  Sriram Krishnamoorthy,et al.  GPU-Based Implementations of the Noniterative Regularized-CCSD(T) Corrections: Applications to Strongly Correlated Systems. , 2011, Journal of chemical theory and computation.

[17]  Jack Dongarra,et al.  LAPACK Users' Guide, 3rd ed. , 1999 .

[18]  Roland Lindh,et al.  Utilizing high performance computing for chemistry: parallel computational chemistry. , 2010, Physical chemistry chemical physics : PCCP.

[19]  James J. P. Stewart,et al.  Fast semiempirical calculations , 1982 .

[20]  Joshua A. Anderson,et al.  General purpose molecular dynamics simulations fully implemented on graphics processing units , 2008, J. Comput. Phys..

[21]  Andreas W. Götz,et al.  Quantum Chemistry on Graphics Processing Units , 2010 .

[22]  Karl A. Wilkinson,et al.  Acceleration of the GAMESS‐UK electronic structure package on graphical processing units , 2011, J. Comput. Chem..

[23]  M. Karplus,et al.  Effective energy functions for protein structure prediction. , 2000, Current opinion in structural biology.

[24]  Robert J. Brunner,et al.  High-Performance Computing with Accelerators , 2010, Comput. Sci. Eng..

[25]  Kenneth M. Merz,et al.  Fully Quantum Mechanical Description of Proteins in Solution. Combining Linear Scaling Quantum Mechanical Methodologies with the Poisson−Boltzmann Equation , 1999 .

[26]  Kenneth M Merz,et al.  Assessment of Semiempirical Quantum Mechanical Methods for the Evaluation of Protein Structures. , 2007, Journal of chemical theory and computation.

[27]  Ryo Maezono,et al.  Acceleration of a QM/MM‐QMC simulation using GPU , 2010, J. Comput. Chem..

[28]  John E. Stone,et al.  Fast analysis of molecular dynamics trajectories with graphics processing units - Radial distribution function histogramming , 2011, J. Comput. Phys..

[29]  Ivan S. Ufimtsev,et al.  Dynamic Precision for Electron Repulsion Integral Evaluation on Graphical Processing Units (GPUs). , 2011, Journal of chemical theory and computation.

[30]  Kim K. Baldridge,et al.  Parallel implementation of semiempirical quantum methods for the Intel platforms , 1996 .

[31]  Ivan S Ufimtsev,et al.  Quantum Chemistry on Graphical Processing Units. 3. Analytical Energy Gradients, Geometry Optimization, and First Principles Molecular Dynamics. , 2009, Journal of chemical theory and computation.

[32]  Alán Aspuru-Guzik,et al.  Accelerating Correlated Quantum Chemistry Calculations Using Graphical Processing Units , 2010, Computing in Science & Engineering.

[33]  Volodymyr Kindratenko,et al.  Porting Optimized GPU Kernels to a Multi-core CPU: Computational Quantum Chemistry Application Example , 2011, 2011 Symposium on Application Accelerators in High-Performance Computing.

[35]  Tseng-Hui Lin Parallelizing legacy applications using message passing programming model and the example of MOPAC , 2000 .

[36]  Eamonn F. Healy,et al.  Development and use of quantum mechanical molecular models. 76. AM1: a new general purpose quantum mechanical molecular model , 1985 .

[37]  Vijay S. Pande,et al.  Accelerating molecular dynamic simulation on graphics processing units , 2009, J. Comput. Chem..

[38]  Martin Korth,et al.  Third-Generation Hydrogen-Bonding Corrections for Semiempirical QM Methods and Force Fields , 2010 .

[39]  V. Hornak,et al.  Comparison of multiple Amber force fields and development of improved protein backbone parameters , 2006, Proteins.

[40]  Klaus Schulten,et al.  GPU-accelerated molecular modeling coming of age. , 2010, Journal of molecular graphics & modelling.

[41]  Kazuo Kitaura,et al.  The Fragment Molecular Orbital Method: Practical Applications to Large Molecular Systems , 2009 .

[42]  Errol Lewars,et al.  Computational chemistry , 2003 .

[43]  Walter Thiel,et al.  Parallelisation in quantum chemistry: the MNDO code , 1995, HPCN Europe.

[44]  Xin Wu,et al.  Semiempirical Quantum Chemical Calculations Accelerated on a Hybrid Multicore CPU-GPU Computing Platform. , 2012, Journal of chemical theory and computation.

[45]  Robert Zaleśny,et al.  Linear-Scaling Techniques in Computational Chemistry and Physics , 2011 .

[46]  M. Levitt,et al.  Energy functions that discriminate X-ray and near native folds from well-constructed decoys. , 1996, Journal of molecular biology.

[47]  Walter Thiel,et al.  MNDO Study of Large Carbon Clusters , 1991 .