Towards Informed Partitioning for Load Balancing: a Proof-of-Concept

Most parallel applications suffer from load imbalance, a crucial performance degradation factor. In particle simulations, this is mainly due to the migration of particles between processing elements, which eventually gather unevenly and create workload imbalance. Dynamic load balancing is used at various iterations to mitigate load imbalance, employing a partitioning method to divide the computational space evenly while minimizing communications. In this paper, we propose a novel partitioning methodology called “informed partitioning”. It uses information based on the evolution of the computation to reduce the load balancing growth and the number of load balancing calls. In this paper, we illustrate informed partitioning by proposing a new geometric partitioning technique for particles simulations. This technique is derived from the wellknown recursive coordinate bisection and employs the velocity of the particles to guide the bisection axis. To properly compare the performance of our new method with existing partitioning techniques during application execution, we introduce an effort metric based on a theoretical model of load balanced parallel application time. We propose a proof-of-concept of informed partitioning, through a numerical study, on three N-Body simulations with various particle dynamics, and we discuss its performance against popular geometric partition∗Corresponding author Email address: anthony.boulmier@unige.ch Preprint submitted to Elsevier August 26, 2021 ar X iv :2 10 8. 11 09 9v 1 [ cs .D C ] 2 5 A ug 2 02 1 ing techniques. Moreover, we show that our effort metric can be used to rank partitioning techniques by their efficiency at any time point during the simulation. Eventually, this could be used to choose the best partitioning on the fly. In the numerical study, we report that our novel concept increases the performance of two experiments out of three by up to 76% and 15%, while being marginally slower by only 3% in one experiment. Also, we discuss the limitations of our implementation of informed partitioning and our effort metric.

[1]  Sanjay Ranka,et al.  Dynamic Load Balancing for Compressible Multiphase Turbulence , 2018, ICS.

[2]  Luiz De Rose,et al.  Detecting Application Load Imbalance on High End Massively Parallel Systems , 2007, Euro-Par.

[3]  Wolfgang E. Nagel,et al.  Highly scalable SFC-based dynamic load balancing and its application to atmospheric modeling , 2017, Future Gener. Comput. Syst..

[4]  Shinji Shimojo,et al.  A new dynamical domain decomposition method for parallel molecular dynamics simulation , 2004, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[5]  Laxmikant V. Kalé,et al.  Automated Load Balancing Invocation Based on Application Characteristics , 2012, 2012 IEEE International Conference on Cluster Computing.

[6]  Dirk Pflüger,et al.  Towards Understanding Optimal Load-Balancing of Heterogeneous Short-Range Molecular Dynamics , 2016, 2016 IEEE 23rd International Conference on High Performance Computing Workshops (HiPCW).

[7]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[8]  Courtenay T. Vaughan,et al.  Zoltan data management services for parallel dynamic applications , 2002, Comput. Sci. Eng..

[9]  C. A. R. Hoare,et al.  Algorithm 65: find , 1961, Commun. ACM.

[10]  Bastien Chopard,et al.  Optimal Load Balancing and Assessment of Existing Load Balancing Criteria , 2021, ArXiv.

[11]  R J Procassini,et al.  Load Balancing of Parallel Monte Carlo Transport Calculations , 2004 .

[12]  Shang-Hua Teng,et al.  How Good is Recursive Bisection? , 1997, SIAM J. Sci. Comput..

[13]  Dirk Roose,et al.  An Improved Spectral Bisection Algorithm and its Application to Dynamic Load Balancing , 1995, EUROSIM International Conference.

[14]  Scott B. Baden,et al.  Dynamic Partitioning of Non-Uniform Structured Workloads with Spacefilling Curves , 1996, IEEE Trans. Parallel Distributed Syst..

[15]  Bastien Chopard,et al.  On the Benefits of Anticipating Load Imbalance for Performance Optimization of Parallel Applications , 2019, 2019 IEEE International Conference on Cluster Computing (CLUSTER).

[16]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[17]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[18]  Claudio Márquez,et al.  A Load Balancing Schema for Agent-based SPMD Applications , 2013 .

[19]  Miss A.O. Penney (b) , 1974, The New Yale Book of Quotations.

[20]  Mehmet Deveci,et al.  Multi-Jagged: A Scalable Parallel Spatial Partitioning Algorithm , 2016, IEEE Transactions on Parallel and Distributed Systems.

[21]  Christian Siebert Scalable and Efficient Parallel Selection , 2013, PPAM.

[22]  Godehard Sutmann,et al.  Adaptive dynamic load-balancing with irregular domain decomposition for particle simulations , 2015, Comput. Phys. Commun..

[23]  M. S. Egorova,et al.  Parallel SPH modeling using dynamic domain decomposition and load balancing displacement of Voronoi subdomains , 2018, Comput. Phys. Commun..