Performance of parallel incomplete LDLt factorizations for solving acoustic wave propagation problems from industry

Parallel incomplete LDLt (ParILDLt) factorizations are used to solve highly indefinite complex-symmetric linear systems that arise from finite element discretization of acoustic wave propagation problems. The parallelization strategy is a generalized domain decomposition type approach in which adjacent subdomains have to exchange data during the construction of the incomplete factorization preconditioning matrix, as well as during each local forward and backward substitution. Comparison with the SYSNOISE (LMS International NV) direct solver, and the finite element tearing and interconnecting method for the Helmholtz equation (FETI-H), is done in terms of execution time and memory usage. Challenging industrial problems are tested, showing that high performance is achieved with ParlLDLt. Copyright © 2004 John Wiley & Sons, Ltd.

[1]  Y. Saad,et al.  GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems , 1986 .

[2]  Bruce Hendrickson,et al.  An Improved Spectral Graph Partitioning Algorithm for Mapping Parallel Computations , 1995, SIAM J. Sci. Comput..

[3]  Olof B. Widlund,et al.  A Domain Decomposition Method with Lagrange Multipliers and Inexact Solvers for Linear Elasticity , 2000, SIAM J. Sci. Comput..

[4]  Takumi Washio,et al.  Ordering strategies and related techniques to overcome the trade-off between parallelism and convergence in incomplete factorizations , 1999, Parallel Comput..

[5]  Alex Pothen,et al.  A Scalable Parallel Algorithm for Incomplete Factor Preconditioning , 2000, SIAM J. Sci. Comput..

[6]  Mardochée Magolu monga Made,et al.  Incomplete factorization-based preconditionings for solving the Helmholtz equation , 2001 .

[7]  Walter Hoffmann,et al.  Iterative algorithms for Gram-Schmidt orthogonalization , 1989, Computing.

[8]  Henk A. van der Vorst,et al.  Bi-CGSTAB: A Fast and Smoothly Converging Variant of Bi-CG for the Solution of Nonsymmetric Linear Systems , 1992, SIAM J. Sci. Comput..

[9]  Dianne P. O'Leary,et al.  A Multigrid Method Enhanced by Krylov Subspace Iteration for Discrete Helmholtz Equations , 2001, SIAM J. Sci. Comput..

[10]  Chris Walshaw,et al.  Mesh Partitioning: A Multilevel Balancing and Refinement Algorithm , 2000, SIAM J. Sci. Comput..

[11]  J. Meijerink,et al.  Guidelines for the usage of incomplete decompositions in solving sets of linear equations as they occur in practical problems , 1981 .

[12]  Charbel Farhat,et al.  A non Overlapping Domain Decomposition Method for the Exterior Helmholtz Problem , 1997 .

[13]  Roland W. Freund,et al.  QMRPACK: a package of QMR algorithms , 1996, TOMS.

[14]  S. Teng Fast Nested Dissection for Finite Element Meshes , 1997 .

[15]  C. Farhat,et al.  Two-level domain decomposition methods with Lagrange multipliers for the fast iterative solution of acoustic scattering problems , 2000 .

[16]  D. R. Fokkema,et al.  BiCGstab(ell) for Linear Equations involving Unsymmetric Matrices with Complex Spectrum , 1993 .

[17]  Henk A. van der Vorst,et al.  ParIC: A Family of Parallel Incomplete Cholesky Preconditioners , 2000, HPCN Europe.

[18]  Gundolf Haase Parallel Incomplete Cholesky Preconditioners Based on the Non-Overlapping Data Distribution , 1998, Parallel Comput..

[19]  Yvan Notay,et al.  An Efficient Parallel Discrete PDE Solver , 1995, Parallel Comput..

[20]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .

[21]  J. Meijerink,et al.  An iterative solution method for linear systems of which the coefficient matrix is a symmetric -matrix , 1977 .

[22]  Roland W. Freund,et al.  Conjugate Gradient-Type Methods for Linear Systems with Complex Symmetric Coefficient Matrices , 1992, SIAM J. Sci. Comput..

[23]  Henk A. van der Vorst,et al.  Spectral analysis of parallel incomplete factorizations with implicit pseudo‐overlap , 2002, Numer. Linear Algebra Appl..

[24]  Charbel Farhat,et al.  A two-level domain decomposition method for the iterative solution of high frequency exterior Helmholtz problems , 2000, Numerische Mathematik.

[25]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[26]  Robert Beauwens,et al.  Preconditioning of discrete Helmholtz operators perturbed by a diagonal complex matrix , 2000 .

[27]  Henk A. van der Vorst,et al.  Parallel incomplete factorizations with pseudo-overlapped subdomains , 2001, Parallel Comput..

[28]  R. Freund,et al.  QMR: a quasi-minimal residual method for non-Hermitian linear systems , 1991 .

[29]  Roland W. Freund,et al.  An Implementation of the QMR Method Based on Coupled Two-Term Recurrences , 1994, SIAM J. Sci. Comput..

[30]  Chris Walshaw,et al.  Parallel optimisation algorithms for multilevel mesh partitioning , 2000, Parallel Comput..

[31]  Mardochée Magolu monga Made Implementation of parallel block preconditionings on a transputer-based multiprocessor , 1995, Future Gener. Comput. Syst..

[32]  Karl Meerbergen,et al.  Application of a domain decomposition method with Lagrange multipliers to acoustic problems arising , 2000 .

[33]  Henk A. van der Vorst,et al.  A generalized domain decomposition paradigm for parallel incomplete LU factorization preconditionings , 2001, Future Gener. Comput. Syst..

[34]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[35]  I. Duff,et al.  The Use of multiple fronts in Gaussian elimination , 1994 .