Portable and scalable FPGA-based acceleration of a direct linear system solver
暂无分享,去创建一个
[1] Endong Wang,et al. Intel Math Kernel Library , 2014 .
[2] Michael J. Flynn,et al. PAM-Blox: high performance FPGA design for adaptive computing , 1998, Proceedings. IEEE Symposium on FPGAs for Custom Computing Machines (Cat. No.98TB100251).
[3] James Demmel,et al. Benchmarking GPUs to tune dense linear algebra , 2008, HiPC 2008.
[4] Jack Dongarra,et al. Numerical Linear Algebra for High-Performance Computers , 1998 .
[5] R. Stephenson. A and V , 1962, The British journal of ophthalmology.
[6] Laurie A. Smith King,et al. Vforce: An Extensible Framework for Reconfigurable Supercomputing , 2007, Computer.
[7] Jack S. N. Jean,et al. Mapping of generalized template matching onto reconfigurable computers , 2003, IEEE Trans. Very Large Scale Integr. Syst..
[8] James Demmel,et al. Benchmarking GPUs to tune dense linear algebra , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.
[9] J. Demmel,et al. Sun Microsystems , 1996 .
[10] Viktor K. Prasanna,et al. High-Performance and Parameterized Matrix Factorization on FPGAs , 2006, 2006 International Conference on Field Programmable Logic and Applications.
[11] Viktor K. Prasanna,et al. Efficient Floating-point Based Block LU Decomposition on FPGAs , 2004, ERSA.
[12] Viktor K. Prasanna,et al. Sparse Matrix-Vector multiplication on FPGAs , 2005, FPGA '05.
[13] André DeHon,et al. Floating-point sparse matrix-vector multiply for FPGAs , 2005, FPGA '05.
[14] Viktor K. Prasanna,et al. Sparse Matrix Computations on Reconfigurable Hardware , 2007, Computer.
[15] Karl S. Hemmert,et al. Closing the gap: CPU and FPGA trends in sustainable floating-point BLAS performance , 2004, 12th Annual IEEE Symposium on Field-Programmable Custom Computing Machines.
[16] George A. Constantinides,et al. A High Throughput FPGA-based Floating Point Conjugate Gradient Implementation , 2008, ARC.
[17] Ed Anderson,et al. LAPACK Users' Guide , 1995 .
[18] Wei Zhang,et al. Portable and scalable FPGA-based acceleration of a direct linear system solver , 2008, 2008 International Conference on Field-Programmable Technology.
[19] Viktor K. Prasanna,et al. High-Performance Designs for Linear Algebra Operations on Reconfigurable Hardware , 2008, IEEE Transactions on Computers.
[20] Karl S. Hemmert,et al. Embedded floating-point units in FPGAs , 2006, FPGA '06.
[21] Gregory D. Peterson,et al. High-Performance Mixed-Precision Linear Solver for FPGAs , 2008, IEEE Transactions on Computers.
[22] W. Hager. Applied Numerical Linear Algebra , 1987 .