Combining Task-based Parallelism and Adaptive Mesh Refinement Techniques in Molecular Dynamics Simulations

Modern parallel architectures require applications to generate massive parallelism so as to feed their large number of cores and their wide vector units. We revisit the extensively studied classical Molecular Dynamics N-body problem in the light of these hardware constraints. We use Adaptive Mesh Refinement techniques to store particles in memory, and to optimize the force computation loop using multi-threading and vectorization-friendly data structures. Our design is guided by the need for load balancing and adaptivity raised by highly dynamic particle sets, as typically observed in simulations of strong shocks resulting in material micro-jetting. We analyze performance results on several simulation scenarios, over nodes equipped by Intel Xeon Phi Knights Landing (KNL) or Intel Xeon Skylake (SKL) processors. Performance obtained with our OpenMP implementation outperforms state-of-the-art implementations (LAMMPS) on both steady and micro-jetting particles simulations. In the latter case, our implementation is 4.7 times faster on KNL, and 2 times faster on SKL.

[1]  S. Meloni,et al.  Efficient particle labeling in atomistic simulations. , 2007, The Journal of chemical physics.

[2]  Dennis Gannon,et al.  Strategies for cache and local memory management by global program transformation , 1988, J. Parallel Distributed Comput..

[3]  Mark F. Adams,et al.  Chombo Software Package for AMR Applications Design Document , 2014 .

[4]  Brian van Straalen,et al.  A survey of high level frameworks in block-structured adaptive mesh refinement packages , 2014, J. Parallel Distributed Comput..

[5]  Ralf Meyer,et al.  Efficient parallelization of molecular dynamics simulations with short-ranged forces , 2014 .

[6]  R. Meyer,et al.  Efficient Parallelization of Short-Range Molecular Dynamics Simulations on Many-Core Systems , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  C. Brooks Computer simulation of liquids , 1989 .

[8]  Devin W. Silvia,et al.  ENZO: AN ADAPTIVE MESH REFINEMENT CODE FOR ASTROPHYSICS , 2013, J. Open Source Softw..

[9]  William Smith,et al.  The DL POLY Classic User Manual , 2010 .

[10]  Laurent Colombet,et al.  ExaStamp: A Parallel Framework for Molecular Dynamics on Heterogeneous Clusters , 2014, Euro-Par Workshops.

[11]  Benedict Leimkuhler,et al.  Integration Methods for Molecular Dynamics , 1996 .

[12]  Efraim Rotem,et al.  Inside 6th-Generation Intel Core: New Microarchitecture Code-Named Skylake , 2017, IEEE Micro.

[13]  Sandia Report,et al.  Improving Performance via Mini-applications , 2009 .

[14]  M. Berger,et al.  Adaptive mesh refinement for hyperbolic partial differential equations , 1982 .

[15]  L. Richardson The Approximate Arithmetical Solution by Finite Differences of Physical Problems Involving Differential Equations, with an Application to the Stresses in a Masonry Dam , 1911 .

[16]  Changjun Hu,et al.  Kernel optimization for short-range molecular dynamics , 2017, Comput. Phys. Commun..

[17]  Chris M. Mangiardi,et al.  A hybrid algorithm for parallel molecular dynamics simulations , 2017, Comput. Phys. Commun..

[18]  Andres More,et al.  Intel Xeon Phi Coprocessor High Performance Programming , 2013 .

[19]  E. A. Mason Transport Properties of Gases Obeying a Modified Buckingham (Exp‐Six) Potential , 1954 .

[20]  Philip M. Morse,et al.  Diatomic Molecules According to the Wave Mechanics I: Electronic Levels of the Hydrogen Molecular Ion , 1929 .

[21]  Steve Plimpton,et al.  Fast parallel algorithms for short-range molecular dynamics , 1993 .

[22]  D. van der Spoel,et al.  GROMACS: A message-passing parallel molecular dynamics implementation , 1995 .

[23]  Avinash Sodani,et al.  Knights landing (KNL): 2nd Generation Intel® Xeon Phi processor , 2015, 2015 IEEE Hot Chips 27 Symposium (HCS).

[24]  Shahid H. Bokhari,et al.  A Partitioning Strategy for Nonuniform Problems on Multiprocessors , 1987, IEEE Transactions on Computers.

[25]  O. Durand,et al.  Comparative simulations of microjetting using atomistic and continuous approaches in the presence of viscosity and surface tension , 2017 .

[26]  L. Verlet Computer "Experiments" on Classical Fluids. I. Thermodynamical Properties of Lennard-Jones Molecules , 1967 .

[27]  Janet E. Jones On the determination of molecular fields. —II. From the equation of state of a gas , 1924 .

[28]  Steven J. Plimpton,et al.  Optimizing legacy molecular dynamics software with directive-based offload , 2015, Comput. Phys. Commun..

[29]  François Faure,et al.  A Packed Memory Array to Keep Moving Particles Sorted , 2012, VRIPHYS.

[30]  Stephen R. Schach,et al.  Max-Min Tree Partitioning , 1981, JACM.

[31]  P. Colella,et al.  Local adaptive mesh refinement for shock hydrodynamics , 1989 .

[32]  D. Wolff,et al.  Tabulated potentials in molecular dynamics simulations , 1999 .