Two Different Data-Parallel Implementations of the BLAS
暂无分享,去创建一个
[1] John R. Gilbert,et al. Automatic array alignment in data-parallel programs , 1993, POPL '93.
[2] John R. Gilbert,et al. Optimal evaluation of array expressions on massively parallel machines , 1995, TOPL.
[3] Charles L. Lawson,et al. A proposal for standard linear algebra subprograms , 1973 .
[4] Rice UniversityCORPORATE,et al. High performance Fortran language specification , 1993 .
[5] C. Lawson,et al. Improving the efficiency of portable software for linear algebra , 1973, SGNM.
[6] Lynn Elliot Cannon,et al. A cellular computer to implement the kalman filter algorithm , 1969 .
[7] Jack J. Dongarra,et al. A set of level 3 basic linear algebra subprograms , 1990, TOMS.
[8] Charles F. Van. Eecient Matrix Multiplication on Simd Computers , 1992 .
[9] Petter E. Bjørstad,et al. Efficient Matrix Multiplication on SIMD Computers , 1992, SIAM J. Matrix Anal. Appl..
[10] Charles L. Lawson,et al. Basic Linear Algebra Subprograms for Fortran Usage , 1979, TOMS.
[11] Jack J. Dongarra,et al. An extended set of FORTRAN basic linear algebra subprograms , 1988, TOMS.
[12] Petter E. Bjørstad,et al. Data-Parallel BLAS as a Basis for LAPACK on Massively Parallel Computers , 1993 .
[13] Jack Dongarra,et al. Corrigenda: “An Extended Set of FORTRAN Basic Linear Algebra Subprograms” , 1988, TOMS.
[14] Jack J. Dongarra,et al. Algorithm 679: A set of level 3 basic linear algebra subprograms: model implementation and test programs , 1990, TOMS.