A set of level 3 basic linear algebra subprograms

This paper describes an extension to the set of Basic Linear Algebra Subprograms. The extensions are targeted at matrix-vector operations that should provide for efficient and portable implementations of algorithms for high-performance computers

[1]  D. W. Barron,et al.  Solution of Simultaneous Linear Equations using a Magnetic-Tape Store , 1960, Computer/law journal.

[2]  Edward G. Coffman,et al.  Organizing matrices and matrix operations for paged memory systems , 1969, Commun. ACM.

[3]  O. E. Brønlund,et al.  QR-factorization of partitioned matrices: Solution of large systems of linear equations with non-definite coefficient matrices , 1974 .

[4]  G. Dietrich A new formulation of the hypermatrix Householder-QR decomposition , 1976 .

[5]  Charles L. Lawson,et al.  Algorithm 539: Basic Linear Algebra Subprograms for Fortran Usage [F1] , 1979, TOMS.

[6]  Charles L. Lawson,et al.  Basic Linear Algebra Subprograms for Fortran Usage , 1979, TOMS.

[7]  John K. Reid,et al.  Solving Large Full Sets of Linear Equations in a Paged Virtual Store , 1981, TOMS.

[8]  Iain S. Duff,et al.  Full matrix techniques in sparse Gaussian elimination , 1982 .

[9]  John G. Lewis,et al.  Sparse matrix test problems , 1982, SGNM.

[10]  I. Y. Bucher,et al.  Linear algebra programs for use on a vector computer with a secondary solid state storage device , 1984 .

[11]  R. Fourer Staircase matrices and systems , 1984 .

[12]  F. Gustavson,et al.  Implementing Linear Algebra Algorithms for Dense Matrices on a Vector Pipeline Machine , 1984 .

[13]  A. George,et al.  Auxiliary Storage Methods for Solving Finite Element Systems , 1985 .

[14]  David S. Dodson,et al.  Issues relating to extension of the Basic Linear Algebra Subprograms , 1985, SGNM.

[15]  Christian H. Bischof,et al.  The WY representation for products of householder matrices , 1985, PPSC.

[16]  D. Calahan Block-Oriented, Local-Memory Based Linear Equation Solution on the Cray-2 Uniprocessor Algorithms , 1986, ICPP.

[17]  Jack Dongarra,et al.  Linear algebra on high performance computers , 1986 .

[18]  William Jalby,et al.  Parallel Algorithms on the CEDAR System , 1986, CONPAR.

[19]  A. K. Dave,et al.  Sparse matrix calculations on the CRAY-2 , 1987, Parallel Comput..

[20]  Ed Anderson,et al.  LAPACK Users' Guide , 1995 .

[21]  Paolo Carnevali,et al.  Efficient fortran implementation of the gaussian elimination and householder reduction algorithms on , 1987 .

[22]  William Jalby,et al.  The use of BLAS3 in linear algebra on a parallel processor with a hierarchical memory , 1987 .

[23]  Jack Dongarra,et al.  LINPACK Users' Guide , 1987 .

[24]  Jack J. Dongarra,et al.  A proposal for a set of level 3 basic linear algebra subprograms , 1987, SGNM.

[25]  D. Sorensen,et al.  LAPACK Working Note No. 2: Block reduction of matrices to condensed forms for eigenvalue computations , 1987 .

[26]  B. Parlett,et al.  Block reflectors: theory and computation , 1988 .

[27]  Jack J. Dongarra,et al.  Algorithm 656: an extended set of basic linear algebra subprograms: model implementation and test programs , 1988, TOMS.

[28]  Jack J. Dongarra,et al.  An extended set of FORTRAN basic linear algebra subprograms , 1988, TOMS.

[29]  G. V. Paolini,et al.  Data structures to vectorize CG algorithms for general sparsity patterns , 1989 .

[30]  Jack Dongarra,et al.  Advanced Architecture Computers , 1989 .

[31]  Youcef Saad,et al.  A Basic Tool Kit for Sparse Matrix Computations , 1990 .

[32]  Jack J. Dongarra,et al.  Algorithm 679: A set of level 3 basic linear algebra subprograms: model implementation and test programs , 1990, TOMS.

[33]  Jocelyne Erhel Sparse Matrix Multiplication on Vector Computers , 1990, Int. J. High Speed Comput..

[34]  John G. Lewis,et al.  Sparse extensions to the FORTRAN Basic Linear Algebra Subprograms , 1991, TOMS.

[35]  Ramesh C. Agarwal,et al.  A high performance algorithm using pre-processing for the sparse matrix-vector multiplication , 1992, Proceedings Supercomputing '92.

[36]  Michael Allen Heroux,et al.  A proposal for a sparse blas toolkit , 1992 .