Comparison Criteria for Parallel Orderings in ILU Preconditioning

This paper introduces block red-black ordering in a general three-dimensional form for parallel incomplete LU (ILU) preconditioning. This parallel ordering method is designed to attain fast convergence with reduced synchronization among processors in parallelized forward and backward substitutions. In this method, the grid-nodes are divided into blocks and red-black ordering is applied to them. Since blocks with identical colors never have a data-dependency, the blocks in each color can be processed in parallel. Moreover, in order to compare parallel orderings, we propose a new tool for investigating orderings on convergence. The analytic index for convergence, which is based on the remainder matrix, is easily computed and has a unique value for a fixed ordering. Nodes are classified into seven groups, and the effect of each group on convergence is estimated. Numerical tests using a three-dimensional problem confirm the validity of both proposed parallel ordering and analytic convergence comparison methods.

[1]  Henk A. van der Vorst,et al.  Bi-CGSTAB: A Fast and Smoothly Converging Variant of Bi-CG for the Solution of Nonsymmetric Linear Systems , 1992, SIAM J. Sci. Comput..

[2]  George F. Pinder,et al.  Parallel implementation of the Bi-CGSTAB method with block red-black Gauss-Seidel preconditioner applied to the Hermite collocation discretization of partial differential equations , 2002, Parallel Comput..

[3]  Tony F. Chan,et al.  Parallel preconditioning for sparse linear equations , 1995 .

[4]  I. Duff,et al.  The effect of ordering on preconditioned conjugate gradients , 1989 .

[5]  Henk A. van der Vorst,et al.  Parallel incomplete factorizations with pseudo-overlapped subdomains , 2001, Parallel Comput..

[6]  C.-C. Jay Kuo,et al.  Two-Color Fourier Analysis of Iterative Algorithms for Elliptic Problems with Red/Black Ordering , 1990, SIAM J. Sci. Comput..

[7]  V. Eijkhout Analysis of parallel incomplete point factorizations , 1991 .

[8]  Richard Barrett,et al.  Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods , 1994, Other Titles in Applied Mathematics.

[9]  J. Meijerink,et al.  An iterative solution method for linear systems of which the coefficient matrix is a symmetric -matrix , 1977 .

[10]  Alex Pothen,et al.  A Scalable Parallel Algorithm for Incomplete Factor Preconditioning , 2000, SIAM J. Sci. Comput..

[11]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .

[12]  W. Joubert,et al.  Numerical experiments with parallel orderings for ILU preconditioners. , 1999 .

[13]  Takumi Washio,et al.  Ordering strategies and related techniques to overcome the trade-off between parallelism and convergence in incomplete factorizations , 1999, Parallel Comput..

[14]  Jack Dongarra,et al.  Numerical Linear Algebra for High-Performance Computers , 1998 .

[15]  Jack J. Dongarra,et al.  Solving linear systems on vector and shared memory computers , 1990 .

[16]  Alain Lichnewsky,et al.  Some Parallel and Vector Implementations of Preconditioned Iterative Methods on Cray-2 , 1990, Int. J. High Speed Comput..

[17]  James Demmel,et al.  Applied Numerical Linear Algebra , 1997 .

[18]  Henk A. van der Vorst,et al.  Spectral analysis of parallel incomplete factorizations with implicit pseudo‐overlap , 2002, Numer. Linear Algebra Appl..

[19]  S. Doi,et al.  A Graph-theory approach for analyzing the effects of ordering on ILU preconditionning , 1991 .

[20]  Takumi Washio,et al.  Overlapped Multicolor MILU Preconditioning , 1995, SIAM J. Sci. Comput..

[21]  Henk A. van der Vorst,et al.  Developments and trends in the parallel solution of linear systems , 1999, Parallel Comput..

[22]  Takeshi Iwashita,et al.  Block Red-Black Ordering: A New Ordering Strategy for Parallelization of ICCG Method , 2004, International Journal of Parallel Programming.

[23]  Gundolf Haase Parallel Incomplete Cholesky Preconditioners Based on the Non-Overlapping Data Distribution , 1998, Parallel Comput..

[24]  Takeshi Iwashita,et al.  Construction and ordering of edge elements for parallel computation , 2001 .

[25]  P. Plassmann,et al.  The Efficient Parallel Iterative Solution of Large Sparse Linear Systems , 1993 .

[26]  S. Doi On parallelism and convergence of incomplete LU factorizations , 1991 .

[27]  Magolu Monga-Made Ordering strategies for modified block incomplete factorizations , 1995 .