Efficient Parallel Out-of-Core Implementation of the Cholesky Factorization

In this paper we describe two efficient parallel out-of-core implementations of the Cholesky factorization. We Use the Parallel Out-of-Core Linear Algebra Package (POOCLAPACK) as an extension to the Parallel Linear AIgebra Package (PLAPACK) to implement our out-of-core algorithms. The first algorithm uses in-core kernels with additional code to manage the I/0. This is the classical approach to out-of-core implementations of the Cholesky factorization. Our second algorithm adds an out-of-core implementation of the triangular solve with multiple right hand sides, which doesn''t simply bring code in-core and run the in-core algorithm. This algorithm has the added benefit of requiring fewer copies of the matrix to be in-core at one time, thus allowing more of the matrix to be in-core at one time. Despite the extreme simplicity of POOCLAPACK and our out-of-core algorithm, the out-of-core Cholesky factorization implementation is shown to achieve in excess of 80% of peak performance on a 64 node configuration of the Cray T3E-600.

[1]  Tom Cwik,et al.  Application of massively parallel computation to integral equation models of electromagnetic scattering , 1994 .

[2]  Jack J. Dongarra,et al.  A set of level 3 basic linear algebra subprograms , 1990, TOMS.

[3]  S. Lennart Johnsson,et al.  Load-Balanced LU and QR Factor and Solve Routines for Scalable Processors with Scalable I/O , 1994 .

[4]  James Demmel,et al.  LAPACK Users' Guide, Third Edition , 1999, Software, Environments and Tools.

[5]  Robert A. van de Geijn,et al.  Using PLAPACK - parallel linear algebra package , 1997 .

[6]  Jack J. Dongarra,et al.  An extended set of FORTRAN basic linear algebra subprograms , 1988, TOMS.

[7]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.

[8]  D. S. Scott Out of core dense solvers on Intel parallel supercomputers , 1992, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation.

[9]  Sivan Toledo,et al.  The design and implementation of SOLAR, a portable library for scalable out-of-core linear algebra computations , 1996, IOPADS '96.

[10]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[11]  J. T. Oden,et al.  Massively parallel computation for acoustical scattering problems using boundary element methods , 1996 .

[12]  R. V. D. Geijn,et al.  A fast solution method for three‐dimensional many‐particle problems of linear elasticity , 1998 .

[13]  Charles L. Lawson,et al.  Basic Linear Algebra Subprograms for Fortran Usage , 1979, TOMS.

[14]  Jack Dongarra,et al.  ScaLAPACK: a scalable linear algebra library for distributed memory concurrent computers , 1992, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation.

[15]  Robert A. van de Geijn,et al.  POOCLAPACK: Parallel Out-of-Core Linear Algebra Package , 1999 .

[16]  Leszek Demkowicz,et al.  Solution of elastic scattering problems in linear acoustics using h-p boundary element method , 1992 .

[17]  Jack Dongarra,et al.  The Design and Implementation of the Parallel Out-of-coreScaLAPACK LU, QR, and Cholesky Factorization Routines , 1997 .