A Performance Study of Quantum ESPRESSO's PWscf Code on Multi-core and GPU Systems

We describe the porting of PWscf (Plane-Wave Self Consistent Field), a key component of the Quantum ESPRESSO open-source suite of codes for materials modeling, to GPU systems using CUDA Fortran. Kernel loop directives (CUF kernels) have been extensively used in order to have a single source code for both CPU and GPU implementations. The results of the GPU version have been carefully validated and the performance of the code on several GPU systems (both x86 and POWER8 based) has been compared with traditional Intel multi-core (CPU only) systems. This current GPU version can reduce the time-to-solution by an average factor of 2–3 running two different input cases widely used as benchmarks on small and large high performance computing systems.

[1]  Jack Dongarra,et al.  ScaLAPACK user's guide , 1997 .

[2]  Johnson,et al.  Modified Broyden's method for accelerating convergence in self-consistent calculations. , 1988, Physical review. B, Condensed matter.

[3]  Stefano de Gironcoli,et al.  QUANTUM ESPRESSO: a modular and open-source software project for quantum simulations of materials , 2009, Journal of physics. Condensed matter : an Institute of Physics journal.

[4]  Warren E. Pickett,et al.  Pseudopotential methods in condensed matter applications , 1989 .

[5]  Jack J. Dongarra,et al.  Accelerating Numerical Dense Linear Algebra Calculations with GPUs , 2014, Numerical Computations with GPUs.

[6]  David G. Weinman Fortran for Scientists and Engineers , 1989 .

[7]  Lukas Krämer,et al.  Parallel solution of partial symmetric eigenvalue problems from electronic structure calculations , 2011, Parallel Comput..

[8]  Massimiliano Fatica,et al.  CUDA Fortran for Scientists and Engineers , 2012 .

[9]  Filippo Spiga,et al.  phiGEMM: A CPU-GPU Library for Porting Quantum ESPRESSO on Hybrid Systems , 2012, 2012 20th Euromicro International Conference on Parallel, Distributed and Network-based Processing.

[10]  S. Froyen,et al.  Brillouin-zone integration by Fourier quadrature: Special points for superlattice and supercell calculations. , 1989, Physical review. B, Condensed matter.

[11]  A Marek,et al.  The ELPA library: scalable parallel eigenvalue solutions for electronic structure theory and computational science , 2014, Journal of physics. Condensed matter : an Institute of Physics journal.

[12]  Walter Kohn,et al.  Fundamentals of density functional theory , 1998 .