An Efficient Parallel Algorithm for Extreme Eigenvalues of Sparse Nonsymmetric Matrices

Main memory accesses for shared-memory systems or global communications (synchronizations) in message passing systems decrease the computation speed. In this paper, the standard Arnoldi algorithm for approxi mating a small number of eigenvalues, with largest (or smallest) real parts for nonsymmetric large sparse ma trices, is restructured so that only one synchronization point is required; that is, one global communication in a message passing distributed-memory machine or one global memory sweep in a shared-memory ma chine per each iteration is required. We also introduce an s-step Arnoldi method for finding a few eigenvalues of nonsymmetric large sparse matrices. This method generates reduction matrices that are similar to those generated by the standard method. One iteration of the s-step Arnoldi algorithm corresponds to s itera tions of the standard Arnoldi algorithm. The s-step method has improved data locality, minimized global communication, and superior parallel properties. These algorithms are implemented on a 64-node NCUBE/7 Hypercube and a CRAY-2, and performance results are presented.

[1]  Anthony T. Chronopoulos,et al.  A class of Lanczos-like algorithms implemented on parallel computers , 1991, Parallel Comput..

[2]  Gérard Meurant Multitasking the conjugate gradient method on the CRAY X-MP/48 , 1987, Parallel Comput..

[3]  Y. Saad Variations on Arnoldi's method for computing eigenelements of large unsymmetric matrices , 1980 .

[4]  P. Sadayappan,et al.  Iterative Algorithms for Solution of Large Sparse Systems of Linear Equations on Hypercubes , 1988, IEEE Trans. Computers.

[5]  J. H. Wilkinson The algebraic eigenvalue problem , 1966 .

[6]  Robert E. Benner,et al.  Development of Parallel Methods for a $1024$-Processor Hypercube , 1988 .

[7]  Iain S. Duff,et al.  Sparse matrix calculations C R A Y2 * on the , 2002 .

[8]  Jack Dongarra,et al.  Linear algebra on high performance computers , 1986 .

[9]  Chung-Ta King,et al.  Parallel Algorithm Design Considerations for Hypercube Multiprocessors , 1987, ICPP.

[10]  Y. Saad,et al.  Chebyshev acceleration techniques for solving nonsymmetric eigenvalue problems , 1984 .

[11]  Yousef Saad Partial Eigensolutions of Large Nonsymmetric Matrices. , 1985 .

[12]  Gene H. Golub,et al.  Matrix computations , 1983 .

[13]  P. Saylor,et al.  Leapfrog variants of iterative methods for linear algebraic equations , 1988 .

[14]  Axel Ruhe The two-sided arnoldi algorithm for nonsymmetric eigenvalue problems , 1983 .

[15]  Oliver A. McBryan,et al.  Matrix and vector operations on hypercube parallel processors , 1987, Parallel Comput..

[16]  Anthony T. Chronopoulos,et al.  On the efficient implementation of preconditioned s-step conjugate gradient methods on multiprocessors with memory hierarchy , 1989, Parallel Comput..

[17]  Anthony T. Chronopoulos,et al.  s-step iterative methods for symmetric linear systems , 1989 .