Mapping Floating-Point Kernels onto High Performance Reconfigurable Computers
暂无分享,去创建一个
[1] Khalid H. Abed,et al. Achieving true parallelism on a High Performance Heterogeneous Computer via a threaded programming model , 2011, 2011 Proceedings of IEEE Southeastcon.
[2] Viktor K. Prasanna,et al. High-Performance Reduction Circuits Using Deeply Pipelined Operators on FPGAs , 2007, IEEE Transactions on Parallel and Distributed Systems.
[3] David A. Patterson,et al. Computer Architecture: A Quantitative Approach , 1969 .
[4] Khalid H. Abed,et al. Design Heuristics for Mapping Floating-Point Scientific Computational Kernels onto High Performance Reconfigurable Computers , 2009, J. Comput..
[5] Khalid H. Abed,et al. Integrating Quartus Wizard-based VHDL floating-point components into a high performance heterogeneous computing environment , 2011, 2011 Proceedings of IEEE Southeastcon.
[6] Martin C. Herbordt,et al. Achieving High Performance with FPGA-Based Computing , 2007, Computer.
[7] Khalid H. Abed,et al. Mapping Hierarchical Multiple File VHDL Kernels onto an SRC-7 High Performance Reconfigurable Computer , 2010, 2010 DoD High Performance Computing Modernization Program Users Group Conference.
[8] D Fountain. Algotronix: the first custom computer , 1991 .
[9] Viktor K. Prasanna,et al. High-performance FPGA-based general reduction methods , 2005, 13th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM'05).
[10] Tarek A. El-Ghazawi,et al. Performance of sorting algorithms on the SRC 6 reconfigurable computer , 2005, Proceedings. 2005 IEEE International Conference on Field-Programmable Technology, 2005..
[11] A. Folkesson. Analysis of numerical methods , 2011 .
[12] Viktor K. Prasanna,et al. High Performance Linear Algebra Operations on Reconfigurable Systems , 2005, ACM/IEEE SC 2005 Conference (SC'05).
[13] Viktor K. Prasanna,et al. An FPGA-Based Application-Specific Processor for Efficient Reduction of Multiple Variable-Length Floating-Point Data Sets , 2006, IEEE 17th International Conference on Application-specific Systems, Architectures and Processors (ASAP'06).
[14] Khalid H. Abed,et al. FPGA-based implementation of Horner's rule on a high performance heterogeneous computer , 2011, 2011 Proceedings of IEEE Southeastcon.
[15] Khalid H. Abed,et al. Improving Performance of Codes with Large/Irregular Stride Memory Access Patterns via High Performance Reconfigurable Computers , 2009, 2009 DoD High Performance Computing Modernization Program Users Group Conference.
[16] Itamar Arel,et al. A condensation-based application of Cramer's rule for solving large-scale linear systems , 2012, J. Discrete Algorithms.
[17] Youcef Saad,et al. A Basic Tool Kit for Sparse Matrix Computations , 1990 .
[18] Viktor K. Prasanna,et al. Design tradeoffs for BLAS operations on reconfigurable hardware , 2005, 2005 International Conference on Parallel Processing (ICPP'05).
[19] Khalid H. Abed,et al. Accelerating a Sparse Matrix Iterative Solver Using a High Performance Reconfigurable Computer , 2010, 2010 DoD High Performance Computing Modernization Program Users Group Conference.
[20] Gerald Estrin,et al. Organization of computer systems: the fixed plus variable structure computer , 1960, IRE-AIEE-ACM '60 (Western).
[21] David A. Patterson,et al. Computer Architecture - A Quantitative Approach (4. ed.) , 2007 .
[22] Viktor K. Prasanna,et al. Sparse Matrix Computations on Reconfigurable Hardware , 2007, Computer.
[23] Ray Morrison,et al. The Company History , 2003 .