Parameterized Hardware Design on Reconfigurable Computers: An Image Processing Case Study

Reconfigurable Computers (RCs) with hardware (FPGA) co-processors can achieve significant performance improvement compared with traditional microprocessor (µP)-based computers for many scientific applications. The potential amount of speedup depends on the intrinsic parallelism of the target application as well as the characteristics of the target platform. In this work, we use image processing applications as a case study to demonstrate how hardware designs are parameterized by the co-processor architecture, particularly the data I/O, i.e., the local memory of the FPGA device and the interconnect between the FPGA and the µP. The local memory has to be used by applications that access data randomly. A typical case belonging to this category is image registration. On the other hand, an application such as edge detection can directly read data through the interconnect in a sequential fashion. Two different algorithms of image registration, the exhaustive search algorithm and the Discrete Wavelet Transform (DWT)-based search algorithm, are implemented on hardware, i.e., Xilinx Vertex-IIPro 50 on the Cray XD1 reconfigurable computer. The performance improvements of hardware implementations are 10× and 2×, respectively. Regarding the category of applications that directly access the interconnect, the hardware implementation of Canny edge detection can achieve 544× speedup.

[1]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Ryan Kastner,et al.  Fpga-based face detection system using Haar classifiers , 2009, FPGA '09.

[3]  Miaoqing Huang,et al.  An Image Processing Architecture to Exploit I/O Bandwidth on Reconfigurable Computers , 2008, 2008 4th Southern Conference on Programmable Logic.

[4]  Donald G. Bailey,et al.  Using design patterns to overcome image processing constraints on FPGAs , 2006, Third IEEE International Workshop on Electronic Design, Test and Applications (DELTA'06).

[5]  Jan Flusser,et al.  Image registration methods: a survey , 2003, Image Vis. Comput..

[6]  Daggu Venkateshwar Rao,et al.  Image processing algorithms on reconfigurable architecture using HandelC , 2004, Euromicro Symposium on Digital System Design, 2004. DSD 2004..

[7]  D. C. Gharpure,et al.  FPGA implementation of MFNN for image registration , 2002, 2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings..

[8]  A. El-Ghazaw,et al.  Wavelet-Based Image Registration on Parallel Computers , 1997, ACM/IEEE SC 1997 Conference (SC'97).

[9]  Tarek A. El-Ghazawi,et al.  The Promise of High-Performance Reconfigurable Computing , 2008, Computer.

[10]  Jacqueline Le Moigne,et al.  An automated parallel image registration technique based on the correlation of wavelet features , 2013, IEEE Trans. Geosci. Remote. Sens..

[11]  Lisa M. Brown,et al.  A survey of image registration techniques , 1992, CSUR.

[12]  César Torres-Huitzil,et al.  FPGA-Based Configurable Systolic Architecture for Window-Based Image Processing , 2005, EURASIP J. Adv. Signal Process..

[13]  Raj Shekhar,et al.  FPGA-Accelerated Deformable Image Registration for Improved Target-Delineation During CT-Guided Interventions , 2007, IEEE Transactions on Biomedical Circuits and Systems.

[14]  Mohamed Akil,et al.  Low level image processing operators on FPGA: implementation examples and performance evaluation , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 2 - Conference B: Computer Vision & Image Processing. (Cat. No.94CH3440-5).

[15]  Tarek El-Ghazawi,et al.  Wavelet-Based Image Registration on Parallel Computers , 1997 .

[16]  Lin Li,et al.  Efficient Codesign For Geology Image Processing , 2006, 2006 49th IEEE International Midwest Symposium on Circuits and Systems.