A domain decomposition strategy for hybrid parallelization of moving particle semi-implicit (MPS) method for computer cluster

The MPS is a particle method developed to simulate incompressible flows with free surfaces that has several applications in nonlinear hydrodynamics. Much effort has been done to handle the large amount of particles required for simulating practical problems with desired refinement. However, the efficient use of the currently available computational resources, such as computer cluster, remains as a challenge. The present paper proposes a new strategy to parallelize the MPS method for fully distributed computing in cluster, which enables to simulate models with hundreds of millions of particles and keeps the required runtime within reasonable limits, as shown by the analysis of scalability and performance. The proposed strategy uses a non-geometric dynamic domain decomposition method that provides homogeneous load balancing and for very large models the scalability is supra-linear. Also, the domain decomposition (DD) is carried out only in the initial setup. As a result, the DD method is based on renumbering of particles using an original fully distributed sorting algorithm. Moreover, unlike the usual strategies, none of the processors require access to global data of the particles on any time step. Therefore, the limit for the maximum size of the model depends more on the total memory of the allocated nodes than the quantity of the local memory of each node. Thus, by extending the application of MPS method to very large models, this study contributes to consolidating the method as a practical tool to investigate complex engineering problems.

[1]  Liang-Yee Cheng,et al.  Particle Based Numerical Analysis of Green Water on FPSO Deck , 2013 .

[2]  S. Meloni,et al.  Efficient particle labeling in atomistic simulations. , 2007, The Journal of chemical physics.

[3]  Hitoshi Gotoh,et al.  Key issues in the particle method for computation of wave breaking , 2006 .

[4]  Min Cheng,et al.  Improved O(N) neighbor list method using domain decomposition and data sorting , 2004 .

[5]  Seiichi KOSHIZUKA,et al.  Numerical Analysis of Jet Injection Behavior for Fuel-Coolant Interaction using Particle Method , 2001 .

[6]  Bin Chen,et al.  Comparison of parallel solvers for Moving Particle Semi-Implicit method , 2015 .

[7]  Satoshi Tanaka,et al.  Development of Hierarchical Domain Decomposition Explicit MPS Method and Application to Large-scale Tsunami Analysis with Floating Objects , 2014 .

[8]  Matthias Teschner,et al.  A Parallel SPH Implementation on Multi‐Core CPUs , 2011, Comput. Graph. Forum.

[9]  Donald E. Knuth,et al.  The art of computer programming: sorting and searching (volume 3) , 1973 .

[10]  Luca Formaggia,et al.  Domain Decomposition Techniques , 2006 .

[11]  L. M. Sato,et al.  EXPLICIT MOVING PARTICLE SIMULATION METHOD ON GPU CLUSTERS , 2012 .

[12]  Seiichi Koshizuka,et al.  Reduction of Communication in Parallel Computing of Particle Method for Flow Simulation of Seaside Areas , 2010 .

[13]  Abbas Khayyer,et al.  5. 3D-CMPS METHOD FOR IMPROVEMENT OF WATER SURFACE TRACKING IN BREAKING WAVES , 2009 .

[14]  Hesham El-Rewini,et al.  Message Passing Interface (MPI) , 2005 .

[15]  Zbigniew J. Czech,et al.  Introduction to Parallel Computing , 2017 .

[16]  Donald E. Knuth,et al.  The Art of Computer Programming: Volume 3: Sorting and Searching , 1998 .

[17]  Takeshi Iwashita,et al.  Parallel processing of 3-D eddy current analysis with moving conductor using parallelized ICCG solver with renumbering process , 2000 .

[18]  Hitoshi Gotoh,et al.  GPU-acceleration for Moving Particle Semi-Implicit method , 2011 .

[19]  S. Koshizuka A particle method for incompressible viscous flow with fluid fragmentation , 1995 .

[20]  Mohammad Piri,et al.  Multi-GPU acceleration of direct pore-scale modeling of fluid flow in natural porous media , 2012, Comput. Phys. Commun..

[21]  Lei Wang,et al.  Achieving Scalable Parallel Molecular Dynamics Using Dynamic Spatial Domain Decomposition Techniques , 1997, J. Parallel Distributed Comput..

[22]  Donald Ervin Knuth,et al.  The Art of Computer Programming , 1968 .

[23]  Donald E. Knuth,et al.  The art of computer programming, volume 3: (2nd ed.) sorting and searching , 1998 .

[24]  Hiroshi Kawai,et al.  Seismic Response Analysis of Nuclear Pressure Vessel Model with ADVENTRUE System on the Earth Simulator , 2005 .

[25]  Yoshiaki Oka,et al.  A particle method for calculating splashing of incompressible viscous fluid , 1995 .

[26]  D. Marr,et al.  Hyper-Threading Technology Architecture and MIcroarchitecture , 2002 .