Scalable and Modular Algorithms for Floating-Point Matrix Multiplication on Reconfigurable Computing Systems
暂无分享,去创建一个
[1] T. El-Ghazawi,et al. Comparative Analysis of High Level Programming for Reconfigurable Computers: Methodology and Empirical Study , 2007, 2007 3rd Southern Conference on Programmable Logic.
[2] Lynn Elliot Cannon,et al. A cellular computer to implement the kalman filter algorithm , 1969 .
[3] Viktor K. Prasanna,et al. Sparse Matrix-Vector multiplication on FPGAs , 2005, FPGA '05.
[4] Viktor K. Prasanna,et al. Energy-Efficient Matrix Multiplication on FPGAs , 2002, FPL.
[5] Viktor K. Prasanna,et al. High Performance Linear Algebra Operations on Reconfigurable Systems , 2005, ACM/IEEE SC 2005 Conference (SC'05).
[6] H. T. Kung,et al. I/O complexity: The red-blue pebble game , 1981, STOC '81.
[7] Yong Dou,et al. 64-bit floating-point FPGA matrix multiplication , 2005, FPGA '05.
[8] Mohamed Taher,et al. Reconfigurable computers: an empirical analysis (abstract only) , 2005, FPGA '05.
[9] Ansi Ieee,et al. IEEE Standard for Binary Floating Point Arithmetic , 1985 .
[10] IEEE Transactions on Parallel and Distributed Systems, Vol. 13 , 2002 .
[11] Abbes Amira,et al. An FPGA based parameterizable system for matrix product implementation , 2002, IEEE Workshop on Signal Processing Systems.
[12] Scott McMillan,et al. A re-evaluation of the practicality of floating-point operations on FPGAs , 1998, Proceedings. IEEE Symposium on FPGAs for Custom Computing Machines (Cat. No.98TB100251).
[13] Karl S. Hemmert,et al. Closing the gap: CPU and FPGA trends in sustainable floating-point BLAS performance , 2004, 12th Annual IEEE Symposium on Field-Programmable Custom Computing Machines.
[14] Victor Y. Pan,et al. Parallel matrix multiplication on a linear array with a reconfigurable pipelined bus system , 1999, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999.
[15] Robert A. van de Geijn,et al. SUMMA: scalable universal matrix multiplication algorithm , 1995, Concurr. Pract. Exp..
[16] Viktor K. Prasanna,et al. A Library of Parameterizable Floating-Point Cores for FPGAs and Their Application to Scientific Computing , 2005, ERSA.
[17] Jaeyoung Choi,et al. Pumma: Parallel universal matrix multiplication algorithms on distributed memory concurrent computers , 1994, Concurr. Pract. Exp..
[18] Viktor K. Prasanna,et al. Area and time efficient implementations of matrix multiplication on FPGAs , 2002, 2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings..
[19] Geoffrey C. Fox,et al. Matrix algorithms on a hypercube I: Matrix multiplication , 1987, Parallel Comput..
[20] Viktor K. Prasanna,et al. Scalable and modular algorithms for floating-point matrix multiplication on FPGAs , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..