Simultaneous Mesh Generation and Partitioning for Delaunay Meshes

In this paper, we present a new approach for the parallel generation and partitioning of unstructured 3D Delaunay meshes. The new approach couples the mesh generation and partitioning problems into a single optimization problem. Traditionally, these two problems are solved separately, first generating the mesh (usually sequentially) and then partitioning the mesh, either sequentially or in parallel. In the traditional approach, the overheads due to I/O and data movement exceed 50% of the total execution time. Even if parallel partitioning schemes are employed, data movement, synchronization, and data structure translation overheads are high; for applications which require frequent remeshing (e.g. crack growth simulations), these overheads are prohibitive. We present a method for solving the mesh partitioning and placement problem simultaneously with the mesh generation problem. By eliminating unnecessary and redundant cache, local, and remote memory accesses, we can speed up the generation and redistribution process, for very large meshes, by almost an order of magnitude compared to traditional approaches. Our results show that we can achieve nearly perfect equi-distribution of mesh elements over the processors, while maintaining reasonably good separator size, all while improving the quality of the mesh by eliminating many of the problems inherent in traditional parallel constrained mesh generation.

[1]  Rupak Biswas,et al.  HARP: A Dynamic Spectral Partitioner , 1998, J. Parallel Distributed Comput..

[2]  Michael Ian Shamos,et al.  Computational geometry: an introduction , 1985 .

[3]  Roy D. Williams,et al.  Performance of dynamic load balancing algorithms for unstructured mesh calculations , 1991, Concurr. Pract. Exp..

[4]  D. F. Watson Computing the n-Dimensional Delaunay Tesselation with Application to Voronoi Polytopes , 1981, Comput. J..

[5]  John R. Rice,et al.  Automatic load balanced paritioning strategies for PDE computations , 1989, ICS '89.

[6]  Alfio Quarteroni,et al.  Domain Decomposition Methods for Partial Differential Equations , 1999 .

[7]  Adrian Bowyer,et al.  Computing Dirichlet Tessellations , 1981, Comput. J..

[8]  Horst D. Simon,et al.  Partitioning of unstructured problems for parallel processing , 1991 .

[9]  Franco P. Preparata,et al.  Computational Geometry , 1985, Texts and Monographs in Computer Science.

[10]  Bruce Hendrickson,et al.  The Chaco user`s guide. Version 1.0 , 1993 .

[11]  G. C. Fox,et al.  Solving Problems on Concurrent Processors , 1988 .

[12]  Brian Kernighan,et al.  An efficient heuristic for partitioning graphs , 1970 .

[13]  Barry Smith,et al.  Domain Decomposition Methods for Partial Differential Equations , 1997 .

[14]  Chris Hawblitzel,et al.  The Mobile Object Layer: A Run-Time Substrate for Mobile Adaptive Computations , 1998, ISCOPE.

[15]  M. Shephard,et al.  Load balancing for the parallel adaptive solution of partial differential equations , 1994 .

[16]  John R. Rice,et al.  Mapping Algorithms and Software Environment for Data Parallel PDE Iterative Solvers , 1994, J. Parallel Distributed Comput..

[17]  George Cybenko,et al.  Dynamic Load Balancing for Distributed Memory Multiprocessors , 1989, J. Parallel Distributed Comput..

[18]  Martin Berzins,et al.  Dynamic load-balancing for PDE solvers on adaptive unstructured meshes , 1995, Concurr. Pract. Exp..

[19]  Geoffrey C. Fox,et al.  A comparison of optimization heuristics for the data mapping problem , 1997 .

[20]  Kenneth Steiglitz,et al.  Combinatorial Optimization: Algorithms and Complexity , 1981 .

[21]  George Karypis,et al.  Multilevel k-way Partitioning Scheme for Irregular Graphs , 1998, J. Parallel Distributed Comput..

[22]  Bruce Hendrickson,et al.  A Multi-Level Algorithm For Partitioning Graphs , 1995, Proceedings of the IEEE/ACM SC95 Conference.

[23]  Vipin Kumar,et al.  Multilevel Diffusion Schemes for Repartitioning of Adaptive Meshes , 1997, J. Parallel Distributed Comput..

[24]  Kevin J. Barker,et al.  Mobile object layer: a runtime substrate for parallel adaptive and irregular computations , 2000 .

[25]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[26]  C. Hawblitzel,et al.  Data Migration Substrate for the Load Balancing of Parallel Adaptive Unstructured Mesh Computations , 1998 .