Mapping Floating-Point Kernels onto High Performance Reconfigurable Computers

Contemporary field programmable gate arrays (FPGAs) combine the fine-grained design capability of the traditional lookup table with the speed of medium-scale and large-scale logic components such as RAM blocks or DSP blocks to provide for significant computational capability from a single FPGA. High performance reconfigurable computers, which typically use FPGAs as computational elements, have been commercially used to accelerate computational kernels. However, the deep pipelines and extensive parallelism needed for FPGAs to compete with GHz-scale general purpose processors make mapping of floating-point kernels a challenging research area. In this paper, we describe some of the progress that has been made towards solving some of these mapping challenges.

[1]  Khalid H. Abed,et al.  Achieving true parallelism on a High Performance Heterogeneous Computer via a threaded programming model , 2011, 2011 Proceedings of IEEE Southeastcon.

[2]  Viktor K. Prasanna,et al.  High-Performance Reduction Circuits Using Deeply Pipelined Operators on FPGAs , 2007, IEEE Transactions on Parallel and Distributed Systems.

[3]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[4]  Khalid H. Abed,et al.  Design Heuristics for Mapping Floating-Point Scientific Computational Kernels onto High Performance Reconfigurable Computers , 2009, J. Comput..

[5]  Khalid H. Abed,et al.  Integrating Quartus Wizard-based VHDL floating-point components into a high performance heterogeneous computing environment , 2011, 2011 Proceedings of IEEE Southeastcon.

[6]  Martin C. Herbordt,et al.  Achieving High Performance with FPGA-Based Computing , 2007, Computer.

[7]  Khalid H. Abed,et al.  Mapping Hierarchical Multiple File VHDL Kernels onto an SRC-7 High Performance Reconfigurable Computer , 2010, 2010 DoD High Performance Computing Modernization Program Users Group Conference.

[8]  D Fountain Algotronix: the first custom computer , 1991 .

[9]  Viktor K. Prasanna,et al.  High-performance FPGA-based general reduction methods , 2005, 13th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM'05).

[10]  Tarek A. El-Ghazawi,et al.  Performance of sorting algorithms on the SRC 6 reconfigurable computer , 2005, Proceedings. 2005 IEEE International Conference on Field-Programmable Technology, 2005..

[11]  A. Folkesson Analysis of numerical methods , 2011 .

[12]  Viktor K. Prasanna,et al.  High Performance Linear Algebra Operations on Reconfigurable Systems , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[13]  Viktor K. Prasanna,et al.  An FPGA-Based Application-Specific Processor for Efficient Reduction of Multiple Variable-Length Floating-Point Data Sets , 2006, IEEE 17th International Conference on Application-specific Systems, Architectures and Processors (ASAP'06).

[14]  Khalid H. Abed,et al.  FPGA-based implementation of Horner's rule on a high performance heterogeneous computer , 2011, 2011 Proceedings of IEEE Southeastcon.

[15]  Khalid H. Abed,et al.  Improving Performance of Codes with Large/Irregular Stride Memory Access Patterns via High Performance Reconfigurable Computers , 2009, 2009 DoD High Performance Computing Modernization Program Users Group Conference.

[16]  Itamar Arel,et al.  A condensation-based application of Cramer's rule for solving large-scale linear systems , 2012, J. Discrete Algorithms.

[17]  Youcef Saad,et al.  A Basic Tool Kit for Sparse Matrix Computations , 1990 .

[18]  Viktor K. Prasanna,et al.  Design tradeoffs for BLAS operations on reconfigurable hardware , 2005, 2005 International Conference on Parallel Processing (ICPP'05).

[19]  Khalid H. Abed,et al.  Accelerating a Sparse Matrix Iterative Solver Using a High Performance Reconfigurable Computer , 2010, 2010 DoD High Performance Computing Modernization Program Users Group Conference.

[20]  Gerald Estrin,et al.  Organization of computer systems: the fixed plus variable structure computer , 1960, IRE-AIEE-ACM '60 (Western).

[21]  David A. Patterson,et al.  Computer Architecture - A Quantitative Approach (4. ed.) , 2007 .

[22]  Viktor K. Prasanna,et al.  Sparse Matrix Computations on Reconfigurable Hardware , 2007, Computer.

[23]  Ray Morrison,et al.  The Company History , 2003 .