Parallel Algorithms on the CEDAR System

While significant progress has been made in developing efficient algorithms for vector machines, the same cannot be said of parallel processors which provide concurrency and vectorization along with a complex memory system. The three aspects of these architectures must be addressed simultaneously and are often contradictory; increasing vector length may destroy data locality and thereby lead to inefficient use of the hierarchical memory system. In this paper, we consider the tradeoffs involved in designing algorithms for such architectures and present some results for a single cluster of the CEDAR machine. 22 refs., 14 figs.

[1]  J. H. Wilkinson The algebraic eigenvalue problem , 1966 .

[2]  A. Sameh On Jacobi and Jacobi-I ike Algorithms for a Parallel Computer , 2010 .

[3]  David J. Kuck,et al.  Parallel Computation of Eigenvalues of Real Matrices , 1971, IFIP Congress.

[4]  O. E. Brønlund,et al.  QR-factorization of partitioned matrices: Solution of large systems of linear equations with non-definite coefficient matrices , 1974 .

[5]  James Hardy Wilkinson,et al.  On the stability of Gauss-Jordan elimination with pivoting , 1975, CACM.

[6]  G. Dietrich A new formulation of the hypermatrix Householder-QR decomposition , 1976 .

[7]  Brian T. Smith,et al.  Matrix Eigensystem Routines — EISPACK Guide , 1974, Lecture Notes in Computer Science.

[8]  David J. Kuck,et al.  Practical Parallel Band Triangular System Solvers , 1978, TOMS.

[9]  Franklin T. Luk,et al.  Computing the Singular-Value Decomposition on the ILLIAC IV , 1980, TOMS.

[10]  V. Klema LINPACK user's guide , 1980 .

[11]  Jack J. Dongarra,et al.  A proposal for an extended set of Fortran Basic Linear Algebra Subprograms , 1985, SGNM.

[12]  Jack J. Dongarra,et al.  A fully parallel algorithm for the symmetric eigenvalue problem , 1985, PPSC.

[13]  Christian H. Bischof,et al.  The WY representation for products of householder matrices , 1985, PPSC.

[14]  L. Kaufman,et al.  Squeezing the most out of eigenvalue solvers on high-performance computers , 1986 .

[15]  William Jalby,et al.  Optimizing matrix operations on a parallel multiprocessor with a memory hierarchy , 1986 .

[16]  Jack Dongarra,et al.  Linear algebra on high performance computers , 1986 .

[17]  Jack Dongarra,et al.  On The Implementation Of A Fully Parallel Algorithm For The Symmetric Eigenvalue Problem , 1986, Optics & Photonics.