Crpc Research Into Linear Algebra Software for High Performance Computers

In this paper we look at a number of approaches being investigated in the Center for Research on Parallel Computation (CRPC) to develop linear algebra soft ware for high-performance computers. These ap proaches are exemplified by the LAPACK, templates, and ARPACK projects. LAPACK is a software library for performing dense and banded linear algebra computa tions, and was designed to run efficiently on high-per formance computers. We focus on the design of the distributed-memory version of LAPACK, and on an ob ject-oriented interface to LAPACK.

[1]  J. Hess,et al.  Calculation of potential flow about arbitrary bodies , 1967 .

[2]  Charles L. Lawson,et al.  Basic Linear Algebra Subprograms for Fortran Usage , 1979, TOMS.

[3]  G. C. Fox,et al.  Solving Problems on Concurrent Processors , 1988 .

[4]  Jack J. Dongarra,et al.  An extended set of FORTRAN basic linear algebra subprograms , 1988, TOMS.

[5]  Jack J. Dongarra,et al.  A set of level 3 basic linear algebra subprograms , 1990, TOMS.

[6]  J.J.H. Wang Generalised moment methods in electromagnetics , 1990 .

[7]  Eric F. van de Velde Data redistribution and concurrency , 1990, Parallel Comput..

[8]  J. Hess Panel Methods in Computational Fluid Dynamics , 1990 .

[9]  J. Nocedal Algorithms and software for large scale optimization. Progress report, August 1, 1989--May 5, 1990 , 1990 .

[10]  R. Harrington Origin and development of the method of moments for field computation , 1990, IEEE Antennas and Propagation Magazine.

[11]  Jack Dongarra,et al.  LAPACK Working Note 24: LAPACK Block Factorization Algorithms on the INtel iPSC/860 , 1990 .

[12]  D. Sorensen,et al.  On the orthogonality of eigenvectors computed by divide-and-conquer techniques , 1991 .

[13]  Jack Dongarra LAPACK Working Note 34: Workshop on the BLACS , 1991 .

[14]  P. Mccarty,et al.  A Cometabolic Biotransformation Model for Halogenated Aliphatic Compounds Exhibiting Product Toxicity , 1991 .

[15]  R. van de Geijn,et al.  A look at scalable dense linear algebra libraries , 1992, Proceedings Scalable High Performance Computing Conference SHPCC-92..

[16]  Danny C. Sorensen,et al.  Implicit Application of Polynomial Filters in a k-Step Arnoldi Method , 1992, SIAM J. Matrix Anal. Appl..

[17]  Jack Dongarra,et al.  ScaLAPACK: a scalable linear algebra library for distributed memory concurrent computers , 1992, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation.

[18]  D. W. Walker,et al.  Standards for message-passing in a distributed memory environment , 1992 .

[19]  Jack Dongarra,et al.  Level 3 BLAS for distributed memory concurrent computers , 1993 .

[20]  Jack Dongarra,et al.  A Proposal for a User-Level, Message-Passing Interface in a Distributed Memory Environment , 1993 .

[21]  Document for a Standard Message-Passing Interface , 1993 .

[22]  L. Quartapelle The incompressible Navier—Stokes equations , 1993 .

[23]  Corporate The MPI Forum,et al.  MPI: a message passing interface , 1993, Supercomputing '93.

[24]  Alan Edelman,et al.  Large Dense Numerical Linear Algebra in 1993: the Parallel Computing Influence , 1993, Int. J. High Perform. Comput. Appl..

[25]  D. W. Walker,et al.  LAPACK++: a design overview of object-oriented extensions for high performance linear algebra , 1993, Supercomputing '93.

[26]  Robert A. van de Geijn,et al.  Two Dimensional Basic Linear Algebra Communication Subprograms , 1993, PPSC.

[27]  D. Sorensen,et al.  A lanczos‐based technique for exact vibration analysis of skeletal structures , 1993 .

[28]  S. Lennart Johnsson,et al.  Block-Cyclic Dense Linear Algebra , 1993, SIAM J. Sci. Comput..

[29]  W. S. Edwards,et al.  Krylov methods for the incompressible Navier-Stokes equations , 1994 .

[30]  Jaeyoung Choi,et al.  The design of scalable software libraries for distributed memory concurrent computers , 1994, Proceedings of 8th International Parallel Processing Symposium.

[31]  Design of a , 1994 .

[32]  David W. Walker,et al.  The Design of a Standard Message Passing Interface for Distributed Memory Concurrent Computers , 1994, Parallel Comput..

[33]  Jaeyoung Choi,et al.  Pumma: Parallel universal matrix multiplication algorithms on distributed memory concurrent computers , 1994, Concurr. Pract. Exp..

[34]  E. F. Hayes,et al.  Scalable algorithms for three-dimensional reactive scattering: evaluation of a new algorithm for obtaining surface functions , 1994 .

[35]  Robert A. van de Geijn,et al.  Scalability Issues Affecting the Design of a Dense Linear Algebra Library , 1994, J. Parallel Distributed Comput..

[36]  Richard Barrett,et al.  Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods , 1994, Other Titles in Applied Mathematics.

[37]  H. V. D. Vorst,et al.  Application of the Implicitly Updated Arnoldi Method with a Complex Shift-and-Invert Strategy in MHD , 1995 .