Towards Highly scalable Ab Initio Molecular Dynamics (AIMD) Simulations on the Intel Knights Landing Manycore Processor

The Ab Initio Molecular Dynamics (AIMD) method allows scientists to treat the dynamics of molecular and condensed phase systems while retaining a first-principles-based description of their interactions. This extremely important method has tremendous computational requirements, because the electronic Schrodinger equation, approximated using Kohn-Sham Density Functional Theory (DFT), is solved at every time step. With the advent of manycore architectures, application developers have a significant amount of processing power within each compute node that can only be exploited through massive parallelism. A compute intensive application such as AIMD forms a good candidate to leverage this processing power. In this paper, we focus on adding thread level parallelism to the plane wave DFT methodology implemented in NWChem. Through a careful optimization of tall-skinny matrix products, which are at the heart of the Lagrange Multiplier and non-local pseudopotential kernels, as well as 3D FFTs, our OpenMP implementation delivers excellent strong scaling on the latest Intel Knights Landing (KNL) processor. We assess the efficiency of our Lagrange multipliers kernels by building a Roofline model of the platform, and verify that our implementation is close to the roofline for various problem sizes. Finally, we present strong scaling results on the complete AIMD simulation for a 64 water molecules test case, that scales up to all 68 cores of the Knights Landing processor.

[1]  Johannes Grotendorst,et al.  Modern methods and algorithms of quantum chemistry , 2000 .

[2]  Samuel Williams,et al.  Applying the Roofline Performance Model to the Intel Xeon Phi Knights Landing Processor , 2016, ISC Workshops.

[3]  Eric J. Bylaska,et al.  Large‐Scale Plane‐Wave‐Based Density Functional Theory: Formalism, Parallelization, and Applications , 2011 .

[4]  Eric J. Bylaska,et al.  First Principles Estimation of Geochemically Important Transition Metal Oxide Properties , 2016 .

[5]  Car,et al.  Unified approach for molecular dynamics and density-functional theory. , 1985, Physical review letters.

[6]  James Demmel,et al.  Communication-Optimal Parallel Recursive Rectangular Matrix Multiplication , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.

[7]  François Gygi,et al.  Architecture of Qbox: A scalable first-principles molecular dynamics code , 2008, IBM J. Res. Dev..

[8]  Nino Boccara,et al.  Simple molecular systems at very high density , 1989 .

[9]  Scott B. Baden,et al.  Hard scaling challenges for ab initio molecular dynamics capabilities in NWChem: Using 100,000 CPUs per second , 2009 .

[10]  Eric J. Bylaska,et al.  Importance of Counteranions on the Hydration Structure of the Curium Ion , 2013 .

[11]  D. Remler,et al.  Molecular dynamics without effective potentials via the Car-Parrinello approach , 1990 .

[12]  Eric J. Bylaska,et al.  Parallel Implementation of the Projector Augmented Plane Wave Method for Charged Systems , 2002 .

[13]  T. Arias,et al.  Iterative minimization techniques for ab initio total energy calculations: molecular dynamics and co , 1992 .

[14]  Samuel Williams,et al.  Roofline: an insightful visual performance model for multicore architectures , 2009, CACM.

[15]  Wibe A de Jong,et al.  Coordination and hydrolysis of plutonium ions in aqueous solution using Car-Parrinello molecular dynamics free energy simulations. , 2013, The journal of physical chemistry. A.

[16]  Roland Lindh,et al.  Utilizing high performance computing for chemistry: parallel computational chemistry. , 2010, Physical chemistry chemical physics : PCCP.

[17]  Leiba Rodman,et al.  Algebraic Riccati equations , 1995 .

[18]  Kresse,et al.  Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. , 1996, Physical review. B, Condensed matter.

[19]  Tjerk P. Straatsma,et al.  NWChem: A comprehensive and scalable open-source solution for large scale molecular simulations , 2010, Comput. Phys. Commun..