Definition and SIMD Implementation of a Multi-Processing Architecture Approach on FPGA

In a context of high performance, low technology access cost and application code reusability objectives, this paper presents an "architectured FPGA" approach that consists in the definition of a general frame for embedded system application implementations. Addressing image processing as a first application domain, a FPGA architecture implementation based on that approach is presented. Built around SIMD architecture, the "Ter@Core" FPGA implementation illustrates the competitiveness of the approach compared to off-the-shelf processors and to usual FPGA approach. The presented implementation gathers 128 processing elements on a single FPGA providing 19.2 GOPS performance and very high application development productivity.

[1]  Alex K. Jones,et al.  A 64-way VLIW/SIMD FPGA architecture and design flow , 2004, Proceedings of the 2004 11th IEEE International Conference on Electronics, Circuits and Systems, 2004. ICECS 2004..

[2]  Shen Chih Tung,et al.  An 88-way multiprocessor within an FPGA with customizable instructions , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[3]  A. de Luca,et al.  SIMD architecture for image segmentation using Sobel operators implemented in FPGA technology , 2005, 2005 2nd International Conference on Electrical and Electronics Engineering.

[4]  Ed F. Deprettere,et al.  Efficient Automated Synthesis, Programing, and Implementation of Multi-Processor Platforms on FPGA Chips , 2006, 2006 International Conference on Field Programmable Logic and Applications.

[5]  Gordon J. Brebner,et al.  Micro-Coded Datapaths: Populating the Space Between Finite State Machine and Processor , 2006, 2006 International Conference on Field Programmable Logic and Applications.

[6]  Paul Wielage,et al.  XETAL-II: A 107 GOPS, 600mW Massively-Parallel Processor for Video Scene Analysis , 2007, 2007 IEEE International Solid-State Circuits Conference. Digest of Technical Papers.

[7]  R.P. Kleihorst,et al.  Xetal-II: A 107 GOPS, 600 mW Massively Parallel Processor for Video Scene Analysis , 2008, IEEE Journal of Solid-State Circuits.