论文信息 - Crpc Research Into Linear Algebra Software for High Performance Computers

Crpc Research Into Linear Algebra Software for High Performance Computers

In this paper we look at a number of approaches being investigated in the Center for Research on Parallel Computation (CRPC) to develop linear algebra soft ware for high-performance computers. These ap proaches are exemplified by the LAPACK, templates, and ARPACK projects. LAPACK is a software library for performing dense and banded linear algebra computa tions, and was designed to run efficiently on high-per formance computers. We focus on the design of the distributed-memory version of LAPACK, and on an ob ject-oriented interface to LAPACK.

[1] J. Hess,et al. Calculation of potential flow about arbitrary bodies , 1967 .

[2] Charles L. Lawson,et al. Basic Linear Algebra Subprograms for Fortran Usage , 1979, TOMS.

[3] G. C. Fox,et al. Solving Problems on Concurrent Processors , 1988 .

[4] Jack J. Dongarra,et al. An extended set of FORTRAN basic linear algebra subprograms , 1988, TOMS.

[5] Jack J. Dongarra,et al. A set of level 3 basic linear algebra subprograms , 1990, TOMS.

[6] J.J.H. Wang. Generalised moment methods in electromagnetics , 1990 .

[7] Eric F. van de Velde. Data redistribution and concurrency , 1990, Parallel Comput..

[8] J. Hess. Panel Methods in Computational Fluid Dynamics , 1990 .

[9] J. Nocedal. Algorithms and software for large scale optimization. Progress report, August 1, 1989--May 5, 1990 , 1990 .

[10] R. Harrington. Origin and development of the method of moments for field computation , 1990, IEEE Antennas and Propagation Magazine.

[11] Jack Dongarra,et al. LAPACK Working Note 24: LAPACK Block Factorization Algorithms on the INtel iPSC/860 , 1990 .

[12] D. Sorensen,et al. On the orthogonality of eigenvectors computed by divide-and-conquer techniques , 1991 .

[13] Jack Dongarra. LAPACK Working Note 34: Workshop on the BLACS , 1991 .

[14] P. Mccarty,et al. A Cometabolic Biotransformation Model for Halogenated Aliphatic Compounds Exhibiting Product Toxicity , 1991 .

[15] R. van de Geijn,et al. A look at scalable dense linear algebra libraries , 1992, Proceedings Scalable High Performance Computing Conference SHPCC-92..

[16] Danny C. Sorensen,et al. Implicit Application of Polynomial Filters in a k-Step Arnoldi Method , 1992, SIAM J. Matrix Anal. Appl..

[17] Jack Dongarra,et al. ScaLAPACK: a scalable linear algebra library for distributed memory concurrent computers , 1992, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation.

[18] D. W. Walker,et al. Standards for message-passing in a distributed memory environment , 1992 .

[19] Jack Dongarra,et al. Level 3 BLAS for distributed memory concurrent computers , 1993 .

[20] Jack Dongarra,et al. A Proposal for a User-Level, Message-Passing Interface in a Distributed Memory Environment , 1993 .

[21] Document for a Standard Message-Passing Interface , 1993 .

[22] L. Quartapelle. The incompressible Navier—Stokes equations , 1993 .

[23] Corporate The MPI Forum,et al. MPI: a message passing interface , 1993, Supercomputing '93.

[24] Alan Edelman,et al. Large Dense Numerical Linear Algebra in 1993: the Parallel Computing Influence , 1993, Int. J. High Perform. Comput. Appl..

[25] D. W. Walker,et al. LAPACK++: a design overview of object-oriented extensions for high performance linear algebra , 1993, Supercomputing '93.

[26] Robert A. van de Geijn,et al. Two Dimensional Basic Linear Algebra Communication Subprograms , 1993, PPSC.

[27] D. Sorensen,et al. A lanczos‐based technique for exact vibration analysis of skeletal structures , 1993 .

[28] S. Lennart Johnsson,et al. Block-Cyclic Dense Linear Algebra , 1993, SIAM J. Sci. Comput..

[29] W. S. Edwards,et al. Krylov methods for the incompressible Navier-Stokes equations , 1994 .

[30] Jaeyoung Choi,et al. The design of scalable software libraries for distributed memory concurrent computers , 1994, Proceedings of 8th International Parallel Processing Symposium.

[31] Design of a , 1994 .

[32] David W. Walker,et al. The Design of a Standard Message Passing Interface for Distributed Memory Concurrent Computers , 1994, Parallel Comput..

[33] Jaeyoung Choi,et al. Pumma: Parallel universal matrix multiplication algorithms on distributed memory concurrent computers , 1994, Concurr. Pract. Exp..

[34] E. F. Hayes,et al. Scalable algorithms for three-dimensional reactive scattering: evaluation of a new algorithm for obtaining surface functions , 1994 .

[35] Robert A. van de Geijn,et al. Scalability Issues Affecting the Design of a Dense Linear Algebra Library , 1994, J. Parallel Distributed Comput..

[36] Richard Barrett,et al. Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods , 1994, Other Titles in Applied Mathematics.

[37] H. V. D. Vorst,et al. Application of the Implicitly Updated Arnoldi Method with a Complex Shift-and-Invert Strategy in MHD , 1995 .