Formal Methods for High-Performance Linear Algebra Libraries

A colleague of ours, Dr. Timothy Mattson of Intel, once made the following observation: “Literature professors read literature. Computer Science professors should at least occasionally read code.” The point he was making was that in order to write superior prose one needs to read good (and bad) literature. Analogously, it is our thesis that exposure to elegant (and ugly) programs tends to yield the insights which are necessary if one wishes to produce consistently well-written code.

[1]  Jack J. Dongarra,et al.  A set of level 3 basic linear algebra subprograms , 1990, TOMS.

[2]  Jack Dongarra,et al.  MPI: The Complete Reference , 1996 .

[3]  Ed Anderson,et al.  LAPACK Users' Guide , 1995 .

[4]  B. S. Garbow,et al.  Matrix Eigensystem Routines — EISPACK Guide , 1974, Lecture Notes in Computer Science.

[5]  David Garlan,et al.  A formal basis for architectural connection , 1997, TSEM.

[6]  Robert A. van de Geijn,et al.  POOCLAPACK: Parallel Out-of-Core Linear Algebra Package , 1999 .

[7]  G. W. Stewart,et al.  Matrix Algorithms: Volume 1, Basic Decompositions , 1998 .

[8]  Wesley C. Reiley Efficient Parallel Out-of-Core Implementation of the Cholesky Factorization , 1999 .

[9]  Robert A. van de Geijn,et al.  Fast Parallel Kernels for Selected Problems in Control Theory , 1999, PPSC.

[10]  B. S. Garbow,et al.  Matrix Eigensystem Routines — EISPACK Guide , 1974, Lecture Notes in Computer Science.

[11]  Jack Dongarra,et al.  LINPACK Users' Guide , 1987 .

[12]  Bo Kågström,et al.  GEMM-based level 3 BLAS: high-performance model implementations and performance evaluation benchmark , 1998, TOMS.

[13]  Charles L. Lawson,et al.  Basic Linear Algebra Subprograms for Fortran Usage , 1979, TOMS.

[14]  Robert A. van de Geijn,et al.  Formal Linear Algebra Methods Environment (FLAME) Overview , 2000 .

[15]  Robert A. van de Geijn,et al.  A flexible class of parallel matrix multiplication algorithms , 1998, Proceedings of the First Merged International Parallel Processing Symposium and Symposium on Parallel and Distributed Processing.

[16]  David Gries,et al.  The Science of Programming , 1981, Text and Monographs in Computer Science.

[17]  Jack J. Dongarra,et al.  Solving linear systems on vector and shared memory computers , 1990 .

[18]  Jack Dongarra,et al.  ScaLAPACK: a scalable linear algebra library for distributed memory concurrent computers , 1992, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation.

[19]  Fred G. Gustavson,et al.  Recursion leads to automatic variable blocking for dense linear-algebra algorithms , 1997, IBM J. Res. Dev..

[20]  Robert A. van de Geijn,et al.  Using PLAPACK - parallel linear algebra package , 1997 .

[21]  Jack J. Dongarra,et al.  An extended set of FORTRAN basic linear algebra subprograms , 1988, TOMS.