Two Different Data-Parallel Implementations of the BLAS

Massively parallel computer systems, having thousands of identical processors operating in SIMD mode, hold the promise of delivering cost effective computing alternatives for many important problems in scientific computing. Computational linear algebra is of fundamental importance to a large class of compute intensive algorithms. This paper discusses the implementation and performance of the computational BLAS kernels in a data-parallel setting. Two different programming languages are compared and several compiler issues are discussed.

[1]  John R. Gilbert,et al.  Automatic array alignment in data-parallel programs , 1993, POPL '93.

[2]  John R. Gilbert,et al.  Optimal evaluation of array expressions on massively parallel machines , 1995, TOPL.

[3]  Charles L. Lawson,et al.  A proposal for standard linear algebra subprograms , 1973 .

[4]  Rice UniversityCORPORATE,et al.  High performance Fortran language specification , 1993 .

[5]  C. Lawson,et al.  Improving the efficiency of portable software for linear algebra , 1973, SGNM.

[6]  Lynn Elliot Cannon,et al.  A cellular computer to implement the kalman filter algorithm , 1969 .

[7]  Jack J. Dongarra,et al.  A set of level 3 basic linear algebra subprograms , 1990, TOMS.

[8]  Charles F. Van Eecient Matrix Multiplication on Simd Computers , 1992 .

[9]  Petter E. Bjørstad,et al.  Efficient Matrix Multiplication on SIMD Computers , 1992, SIAM J. Matrix Anal. Appl..

[10]  Charles L. Lawson,et al.  Basic Linear Algebra Subprograms for Fortran Usage , 1979, TOMS.

[11]  Jack J. Dongarra,et al.  An extended set of FORTRAN basic linear algebra subprograms , 1988, TOMS.

[12]  Petter E. Bjørstad,et al.  Data-Parallel BLAS as a Basis for LAPACK on Massively Parallel Computers , 1993 .

[13]  Jack Dongarra,et al.  Corrigenda: “An Extended Set of FORTRAN Basic Linear Algebra Subprograms” , 1988, TOMS.

[14]  Jack J. Dongarra,et al.  Algorithm 679: A set of level 3 basic linear algebra subprograms: model implementation and test programs , 1990, TOMS.