Implementation of Out-of-Core Cholesky and QR Factorizations with POOCLAPACK

In this paper parallel implementation of out-of-core Cholesky factorization is used to introduce the Parallel Out-of-Core Linear Algebra Package (POOCLAPACK), a flexible infrastructure for parallel implementation of out-of-core linear algebra operations. POOCLAPACK builds on the Parallel Linear Algebra Package (PLAPACK) for in-core parallel dense linear algebra computation. Despite the extreme simplicity of POOCLAPACK, the out-of-core Cholesky factorization implementation is shown to achieve in excess of 80% of peak performance on a 64 node configuration of the Cray T3E-600. The insights gained from examining the Cholesky factorization have been applied to the much more difficult and important QR factorization operation. Preliminary results for parallel implementation of the resulting OOC QR factorization algorithm are included.

[1]  Robert A. van de Geijn,et al.  Using PLAPACK - parallel linear algebra package , 1997 .

[2]  Christian H. Bischof,et al.  The WY representation for products of householder matrices , 1985, PPSC.

[3]  Tom Cwik,et al.  Application of massively parallel computation to integral equation models of electromagnetic scattering , 1994 .

[4]  Jack J. Dongarra,et al.  Solving linear systems on vector and shared memory computers , 1990 .

[5]  D. S. Scott Out of core dense solvers on Intel parallel supercomputers , 1992, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation.

[6]  R. V. D. Geijn,et al.  A fast solution method for three‐dimensional many‐particle problems of linear elasticity , 1998 .

[7]  J. T. Oden,et al.  Massively parallel computation for acoustical scattering problems using boundary element methods , 1996 .

[8]  D. Sorensen,et al.  Block reduction of matrices to condensed forms for eigenvalue computations , 1990 .

[9]  James Demmel,et al.  LAPACK Users' Guide, Third Edition , 1999, Software, Environments and Tools.

[10]  Wesley C. Reiley Efficient Parallel Out-of-Core Implementation of the Cholesky Factorization , 1999 .

[11]  Sivan Toledo,et al.  The design and implementation of SOLAR, a portable library for scalable out-of-core linear algebra computations , 1996, IOPADS '96.

[12]  Robert A. van de Geijn,et al.  Fast Parallel Kernels for Selected Problems in Control Theory , 1999, PPSC.

[13]  Jack Dongarra,et al.  The Design and Implementation of the Parallel Out-of-coreScaLAPACK LU, QR, and Cholesky Factorization Routines , 1997 .

[14]  Robert A. van de Geijn,et al.  POOCLAPACK: Parallel Out-of-Core Linear Algebra Package , 1999 .

[15]  Jack Dongarra,et al.  ScaLAPACK: a scalable linear algebra library for distributed memory concurrent computers , 1992, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation.

[16]  S. Lennart Johnsson,et al.  Load-Balanced LU and QR Factor and Solve Routines for Scalable Processors with Scalable I/O , 1994 .

[17]  Robert A. van de Geijn,et al.  A flexible class of parallel matrix multiplication algorithms , 1998, Proceedings of the First Merged International Parallel Processing Symposium and Symposium on Parallel and Distributed Processing.

[18]  Leszek Demkowicz,et al.  Solution of elastic scattering problems in linear acoustics using h-p boundary element method , 1992 .

[19]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.