Parallel Hierarchical Solvers and Preconditioners for Boundary Element Methods

The method of moments is an important tool for solving boundary integral equations arising in a variety of applications. It transforms the physical problem into a dense linear system. Due to the large number of variables and the associated computational requirements, these systems are solved iteratively using methods such as GMRES and CG and its variants. The core operation of these iterative solvers is the application of the system matrix to a vector. This requires $\theta(n^2)$ operations and memory using accurate dense methods. The computational complexity can be reduced to $O(n \log n)$ and the memory requirement to $\Theta(n)$ using hierarchical approximation techniques. The algorithmic speedup from approximation can be combined with parallelism to yield very fast dense solvers. In this paper, we present efficient parallel formulations of dense iterative solvers based on hierarchical approximations for solving potential integral equations of the first kind. We study the impact of various parameters on the accuracy and performance of the parallel solver. We demonstrate that our parallel formulation incurs minimal parallel processing overhead and scales up to a large number of processors. We present two preconditioning techniques for accelerating the convergence of the iterative solver. These techniques are based on an inner-outer scheme and a block-diagonal scheme based on a truncated Green's function. We present detailed experimental results on up to 256 processors of a Cray T3D. Our code achieves raw computational speeds of over 5 GFLOPS. When compared to the accurate solver, this corresponds to a speed of approximately 776 GFLOPS.

[1]  Ananth Grama,et al.  Efficient parallel formulations of hierarchical methods and their applications , 1996 .

[2]  Feng Zhao,et al.  The Parallel Multipole Method on the Connection Machine , 1991, SIAM J. Sci. Comput..

[3]  Ananth Grama,et al.  n-body simulations using message passing parallel computers. , 1995 .

[4]  Andrew W. Appel,et al.  An Efficient Program for Many-Body Simulation , 1983 .

[5]  William Gropp,et al.  A Parallel Version of the Fast Multipole Method-Invited Talk , 1987, PPSC.

[6]  John L. Volakis,et al.  Guidelines for using the fast multipole method to calculate the RCS of large objects , 1996 .

[7]  Ananth Grama,et al.  Scalable parallel formulations of the Barnes-Hut method for n-body simulations , 1994, Proceedings of Supercomputing '94.

[8]  R. Harrington Matrix methods for field problems , 1967 .

[9]  Jiming Song,et al.  Multilevel fast‐multipole algorithm for solving combined field integral equations of electromagnetic scattering , 1995 .

[10]  Piet Hut,et al.  A hierarchical O(N log N) force-calculation algorithm , 1986, Nature.

[11]  K. Schmidt,et al.  Implementing the fast multipole method in three dimensions , 1991 .

[12]  V. Rokhlin Rapid solution of integral equations of classical potential theory , 1985 .

[13]  V. Rokhlin Rapid Solution of Integral Equations of Scattering Theory , 1990 .

[14]  M. S. Warren,et al.  A parallel hashed Oct-Tree N-body algorithm , 1993, Supercomputing '93.

[15]  Jiming Song,et al.  Fast multipole method solution using parametric geometry , 1994 .

[16]  Sanjay Ranka,et al.  Many-to-many personalized communication with bounded traffic , 1995, Proceedings Frontiers '95. The Fifth Symposium on the Frontiers of Massively Parallel Computation.

[17]  Leslie Greengard,et al.  A fast algorithm for particle simulations , 1987 .

[18]  Michael S. Warren,et al.  Astrophysical N-body simulations using hierarchical tree data structures , 1992, Proceedings Supercomputing '92.