Optimizing memory bandwidth use and performance for matrix-vector multiplication in iterative methods
暂无分享,去创建一个
[1] Louis O. Hertzberger,et al. Time complexity of a parallel conjugate gradient solver for light scattering simulations: theory and SPMD implementation , 1992 .
[2] Viktor K. Prasanna,et al. High-Performance Reduction Circuits Using Deeply Pipelined Operators on FPGAs , 2007, IEEE Transactions on Parallel and Distributed Systems.
[3] Gene H. Golub,et al. Matrix computations (3rd ed.) , 1996 .
[4] A. Mercer. Numerical Solution of Ordinary and Partial Differential Equations , 1963 .
[5] George A. Constantinides,et al. An FPGA-based implementation of the MINRES algorithm , 2008, 2008 International Conference on Field Programmable Logic and Applications.
[6] Granville Sewell,et al. Initial Value Ordinary Differential Equations , 1988 .
[7] Robert H. Halstead,et al. Matrix Computations , 2011, Encyclopedia of Parallel Computing.
[8] Richard Barrett,et al. Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods , 1994, Other Titles in Applied Mathematics.
[9] Viktor K. Prasanna,et al. Sparse Matrix-Vector multiplication on FPGAs , 2005, FPGA '05.
[10] George A. Constantinides,et al. Optimising Memory Bandwidth Use for Matrix-Vector Multiplication in Iterative Methods , 2010, ARC.
[11] Viktor K. Prasanna,et al. A Hybrid Approach for Mapping Conjugate Gradient onto an FPGA-Augmented Reconfigurable Supercomputer , 2006, 2006 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines.
[12] Wayne L. Winston. Introduction to Mathematical Programming: Applications and Algorithms , 1990 .
[13] Warren J. Gross,et al. Sparse Matrix-Vector Multiplication for Finite Element Method Matrices on FPGAs , 2006, 2006 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines.
[14] Wei Zhang,et al. Portable and scalable FPGA-based acceleration of a direct linear system solver , 2008, 2008 International Conference on Field-Programmable Technology.
[15] Michael T. Heath,et al. Scientific Computing , 2018 .
[16] André DeHon,et al. Floating-point sparse matrix-vector multiply for FPGAs , 2005, FPGA '05.
[17] Eric C. Kerrigan,et al. A floating-point solver for band structured linear equations , 2008, 2008 International Conference on Field-Programmable Technology.
[18] George A. Constantinides,et al. A High Throughput FPGA-based Floating Point Conjugate Gradient Implementation , 2008, ARC.
[19] Viktor K. Prasanna,et al. High-Performance and Parameterized Matrix Factorization on FPGAs , 2006, 2006 International Conference on Field Programmable Logic and Applications.
[20] Jack Poulson,et al. Scientific computing , 2013, XRDS.