论文信息 - The improved BiCGStab method for large and sparse unsymmetric linear systems on parallel distributed memory architectures

The improved BiCGStab method for large and sparse unsymmetric linear systems on parallel distributed memory architectures

In this paper, an improved version of the BiCGStab (IBiCGStab) method for the solutions of large and sparse linear systems of equations with unsymmetric coefficient matrices is proposed. The method combines elements of numerical stability and parallel algorithm design without increasing the computational costs. The algorithm is derived such that all inner products of a single iteration step are independent and communication time required for the inner product can be overlapped efficiently with computation time of vector updates. Therefore, the cost of global communication which represents the bottleneck of the parallel performance can be significantly reduced. The resulting IBiCGStab algorithm maintains the favorable properties of the original method while not increasing computational costs. Data distribution suitable for both irregularly and regularly structured matrices based on the analysis of the nonzero matrix elements is presented. Communication scheme is supported by overlapping execution of computation and communication to reduce waiting times. The efficiency of this method is demonstrated by numerical experimental results carried out on a massively parallel distributed memory system.

L.T. Yang | R.P. Brent | R. Brent | L.T. Yang

[1] H. Martin Bücker,et al. A Variant of the Biconjugate Gradient Metho Suitable for Massively Parallel Computing , 1997, IRREGULAR.

[2] Laurence T. Yang,et al. The improved conjugate gradient squared (ICGS) method on parallel distributed memory architectures , 2001, Proceedings International Conference on Parallel Processing Workshops.

[3] Laurence T. Yang,et al. The improved BiCG method for large and sparse linear systems on parallel distributed memory architectures , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[4] H. Martin Bücker,et al. A Parallel Version of the Quasi-Minimal Residual Method, Based on Coupled Two-Term Recurrences , 1996, PARA.

[5] Jack J. Dongarra,et al. Solving linear systems on vector and shared memory computers , 1990 .

[6] Henk A. van der Vorst,et al. Bi-CGSTAB: A Fast and Smoothly Converging Variant of Bi-CG for the Solution of Nonsymmetric Linear Systems , 1992, SIAM J. Sci. Comput..

[7] R. Fletcher. Conjugate gradient methods for indefinite systems , 1976 .

[8] C. Lanczos. Solution of Systems of Linear Equations by Minimized Iterations1 , 1952 .

[9] G. Golub,et al. Iterative solution of linear systems , 1991, Acta Numerica.

[10] Christian Vollaire,et al. Electromagnetic Scattering with the Boundary Integral Method on MIMD Systems , 1999, HPCN Europe.

[11] E. Sturler. A PARALLEL VARIANT OF GMRES(m) , 1991 .

[12] Laurence Tianruo Yang,et al. Quantitative performance analysis of the improved quasi-minimal residual method on massively distributed memory computers , 2002 .

[13] Wanlei Zhou. Proceedings fifth International Conference on algorithms and architectures for parallel processing , 2002 .

[14] Achim Basermann,et al. Preconditioned CG Methods for Sparse Matrices on Massively Parallel Machines , 1997, Parallel Comput..

[15] Claude Pommerell,et al. Solution of large unsymmetric systems of linear equations , 1992 .

[16] H. V. D. Vorst,et al. Reducing the effect of global communication in GMRES( m ) and CG on parallel distributed memory computers , 1995 .