A Parallel Shape Optimizing Load Balancer

Load balancing is an important issue in parallel numerical simulations. However, state-of-the-art libraries addressing this problem show several deficiencies: they are hard to parallelize, focus on small edge-cuts rather than few boundary vertices, and often produce disconnected partitions. We present a distributed implementation of a load balancing heuristic for parallel adaptive FEM simulations. It is based on a disturbed diffusion scheme embedded in a learning framework. This approach incorporates a high degree of parallelism that can be exploited and it computes well-shaped partitions as shown in previous publications. Our focus lies on improving the condition of the involved matrix and solving the resulting linear systems with local accuracy. This helps to omit unnecessary computations as well as allows to replace the domain decomposition by an alternative data distribution scheme reducing the communication overhead, as shown by experiments with our new MPI based implementation.

[1]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[2]  Bruce Hendrickson,et al.  Graph Partitioning and Parallel Solvers: Has the Emperor No Clother? (Extended Abstract) , 1998, IRREGULAR.

[3]  Stefan Schamberger,et al.  A Shape Optimizing Load Distribution Heuristic for Parallel Adaptive FEM Computations , 2005, PaCT.

[4]  Martin G. Everett,et al.  A Localized Algorithm for Optimizing Unstructured Mesh Partitions , 1995, Int. J. High Perform. Comput. Appl..

[5]  Y. F. Hu,et al.  An improved diffusion algorithm for dynamic load balancing , 1999, Parallel Comput..

[6]  Henning Meyerhenke,et al.  Balancing Parallel Adaptive FEM Computations by Solving Systems of Linear Equations , 2005, Euro-Par.

[7]  Leonid Oliker,et al.  PLUM: Parallel Load Balancing for Adaptive Unstructured Meshes , 1998, J. Parallel Distributed Comput..

[8]  Juraj Hromkovič,et al.  The Bisection Problem for Graphs of Degree 4 (Configuring Transputer Systems) , 1991, MFCS.

[9]  Ralf Diekmann,et al.  Shape-optimized mesh partitioning and load balancing for parallel adaptive FEM , 2000, Parallel Comput..

[10]  H. L. Le Roy,et al.  Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability; Vol. IV , 1969 .

[11]  B. Monien,et al.  Graph partitioning with the Party library: helpful-sets in practice , 2004 .

[12]  José D. P. Rolim,et al.  Solving Irregularly Structured Problems in Parallel , 1997, Lecture Notes in Computer Science.

[13]  Stephen Gilmore,et al.  Flexible Skeletal Programming with eSkel , 2005, Euro-Par.

[14]  E. F. Kaasschieter,et al.  Preconditioned conjugate gradients for solving singular systems , 1988 .

[15]  Marcin Paprzycki,et al.  Parallel computing works! , 1996, IEEE Parallel & Distributed Technology: Systems & Applications.

[16]  Robert Elsässer,et al.  Diffusion Schemes for Load Balancing on Heterogeneous Networks , 2002, Theory of Computing Systems.

[17]  Rupak Biswas,et al.  Parallel Load Balancing for Adaptive Unstructured Meshes , 1998 .

[18]  Bruce Hendrickson,et al.  A Multi-Level Algorithm For Partitioning Graphs , 1995, Proceedings of the IEEE/ACM SC95 Conference.

[19]  Juraj Hromkovic,et al.  The Bisection Problem for Graphs of Degree 4 (Configuring Transputer Systems) , 1992, Informatik.

[20]  Vipin Kumar,et al.  Multilevel Diffusion Schemes for Repartitioning of Adaptive Meshes , 1997, J. Parallel Distributed Comput..

[21]  Burkhard Monien,et al.  Graph partitioning with the Party library: helpful-sets in practice , 2004, 16th Symposium on Computer Architecture and High Performance Computing.

[22]  Chris Walshaw,et al.  Parallel optimisation algorithms for multilevel mesh partitioning , 2000, Parallel Comput..

[23]  Brian Kernighan,et al.  An efficient heuristic for partitioning graphs , 1970 .

[24]  Stefan Schamberger,et al.  On partitioning FEM graphs using diffusion , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[25]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[26]  Ralf Diekmann,et al.  Efficient schemes for nearest neighbor load balancing , 1999, Parallel Comput..

[27]  George Cybenko,et al.  Dynamic Load Balancing for Distributed Memory Multiprocessors , 1989, J. Parallel Distributed Comput..

[28]  Henning Meyerhenke,et al.  Accelerating shape optimizing load balancing for parallel FEM simulations by algebraic multigrid , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[29]  Andrzej Tarlecki,et al.  Mathematical Foundations of Computer Science 1991 , 1991, Lecture Notes in Computer Science.