Abstract This paper describes a prototype parallel algorithm for approximating eigenvalues of a dense nonsymmetric matrix on a linear, synchronous processor array. The algorithm is parallel implementation of the explicitly-shifted QR, employing n distributed-memory processors to deliver all eigenvalues in O ( n 2 ) time. The algorithm uses Givens rotations to generate a series of unitary similarity transformations. The rotations are passed between neibouring processors and applied, in pipeline fashion, to columns of the matrix. The rotations are also accumulated in a unitary transformation matrix, enabling the solution of eigenvectors via back-substitution and back-transformation. The algorithm involves only local communication, and confronts the problems of convergence, splitting and updating the shift in a pipelined scheme. The algorithm is implemented on a hypercube, using a ring of processors to simulate a systolic array. Experimental results on the NCUBE/seven hypercube show O ( n ) speedup over competing sequential codes, despite the overhead of interprocessor communication. Speedup and efficiency are estimated by comparing with EISPACK performance.
[1]
Daniel Boley,et al.
Solving the generalized eigenvalue problem on a synchronous linear processor array
,
1986,
Parallel Comput..
[2]
Ahmed Sameh,et al.
On some parallel algorithms on a ring of processors
,
1985
.
[3]
Michael J. Quinn,et al.
Designing Efficient Algorithms for Parallel Computers
,
1987
.
[4]
G. Stewart.
Introduction to matrix computations
,
1973
.
[5]
Michael T. Heath.
Hypercube multiprocessors 1987
,
1987
.
[6]
G. W. Stewart,et al.
A parallel implementation of the QR-algorithm
,
1987,
Parallel Comput..
[7]
William J. Miceli,et al.
Real Time Signal Processing
,
1985
.
[8]
Jack J. Dongarra,et al.
Implementation of some concurrent algorithms for matrix factorization
,
1986,
Parallel Comput..