Heterogeneity exploration for multiple 2D filter designs

Many image processing applications require fast convolution of an image with a set of large 2D filters. Field-programmable gate arrays (FPGAs) are often used to achieve this goal due to their fine grain parallelism and reconfigurability. This paper presents a novel algorithm for the class of designs that implement a convolution with a set of 2D filters. Firstly, it explores the heterogeneous nature of modern reconfigurable devices using a singular value decomposition based algorithm, which orders the coefficients according to their impact to the filters' approximation. Secondly, it exploits any redundancy that exists within each filter and between different filters in the set, leading to designs with minimized area. Experiments with real filter sets from computer vision applications demonstrate up to 60% reduction in the required area.

[1]  Peter Y. K. Cheung,et al.  Migrating functionality from ROMs to embedded multipliers , 2004, 12th Annual IEEE Symposium on Field-Programmable Custom Computing Machines.

[2]  Christos-Savvas Bouganis,et al.  A novel 2D filter design methodology for heterogeneous devices , 2005, 13th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM'05).

[3]  G. Strang Introduction to Linear Algebra , 1993 .

[4]  Amnon Shashua,et al.  Linear image coding for regression and classification using the tensor-rank principle , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[5]  William H. Press,et al.  Numerical recipes in C , 2002 .

[6]  Wayne Luk,et al.  Wordlength optimization for linear digital signal processing , 2003, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[7]  Christos-Savvas Bouganis,et al.  A novel 2D filter design methodology , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[8]  Shaogang Gong,et al.  Dynamic Vision - From Images to Face Recognition , 2000 .

[9]  Johan Håstad,et al.  Tensor Rank is NP-Complete , 1989, ICALP.

[10]  Joseph JáJá Optimal Evaluation of Pairs of Bilinear Forms , 1978, STOC.

[11]  Steven J. E. Wilton,et al.  SMAP: heterogeneous technology mapping for area reduction in FPGAs with embedded memory arrays , 1998, FPGA '98.

[12]  A. Dempster,et al.  Use of minimum-adder multiplier blocks in FIR digital filters , 1995 .

[13]  J. Ja' Optimal evaluation of pairs of bilinear forms , 1978, STOC 1978.

[14]  Christos-Savvas Bouganis,et al.  A Steerable Complex Wavelet Construction and Its Implementation on FPGA , 2004, FPL.

[15]  D. Kodek Design of optimal finite wordlength FIR digital filters using integer programming techniques , 1980 .

[16]  Israel Koren Computer arithmetic algorithms , 1993 .