Accelerating matrix product on reconfigurable hardware for image processing applications

Matrix multiplication is very important in many types of applications including image and signal processing. The suitability of reconfigurable hardware devices, in the form of field programmable gate arrays (FPGAs), is investigated as a low-cost solution for implementing two matrix multipliers for 3-D affine transformations and colour space conversion. A first solution based on processing large matrix multiplication, for large 3-D models, and for the evaluation of the Celoxica fixed-point library and Xilinx CoreGen performance has been reported. A novel architecture for efficient implementation of a colour space converter (CSC) based on distributed arithmetic (DA) principles has been presented. The two multipliers have been developed and implemented on the RC1000-PP Celoxica board-based development platform. Results show that the FPGA-based first parallel multiplier can achieve the performance of a graphics card when performing 3-D affine transformations, while the second multiplier, which is fully pipelined and platform-independent, has a low latency (8 cycles) and is capable of a sustained data rate of over 234 mega-conversions per second.

[1]  Mark A. Willis 3-D computer graphics , 1983 .

[2]  Satnam Singh,et al.  Virtual hardware for graphics applications using FPGAs , 1994, Proceedings of IEEE Workshop on FPGA's for Custom Computing Machines.

[3]  Abbes Amira,et al.  An FPGA based coprocessor for large matrix product implementation , 2003, Proceedings. 2003 IEEE International Conference on Field-Programmable Technology (FPT) (IEEE Cat. No.03EX798).

[4]  F. Bensaali,et al.  An FPGA implementation of 3D affine transformations , 2003, 10th IEEE International Conference on Electronics, Circuits and Systems, 2003. ICECS 2003. Proceedings of the 2003.

[5]  Abbes Amira,et al.  Novel FPGA implementations of Walsh-Hadamard transforms for signal processing , 2001 .

[6]  David Eadie,et al.  Correction of geometric image distortion using FPGAs , 2003, SPIE OPTO-Ireland.

[7]  Wayne Luk,et al.  Customising graphics applications: techniques and programming interface , 2000, Proceedings 2000 IEEE Symposium on Field-Programmable Custom Computing Machines (Cat. No.PR00871).

[8]  Joan L. Mitchell,et al.  MPEG Video: Compression Standard , 1996 .

[9]  Abbes Amira,et al.  Design of efficient architectures for discrete orthogonal transforms using bit level systolic structures , 2002 .

[10]  S. David,et al.  Human Face Detection and Tracking using Skin Color Modeling and Connected Component Operators , 2002 .