A Modified Sliding Window Architecture for Efficient BRAM Resource Utilization

Sliding window is one of the most commonly used techniques in image processing algorithms. Implementing it in hardware requires buffering image rows on-chip to exploit data locality and avoid redundant off-chip pixel transfers. However, scaling this architecture to large window sizes or high resolutions linearly increases on-chip memory utilization. This imposes limitations on porting many image processing algorithms into hardware efficiently. In this paper, we propose a new sliding window architecture that utilizes less on-chip memory resources while maintaining performance as compared to the traditional method. The proposed architecture exploits that most natural images have smooth color variations with fine details in between these variations to compress images. It decomposes non-zero image pixels into their wavelet components and represents each wavelet coefficient with a minimum number of bits. The architecture is also flexible to use lossless or lossy compression based on a configurable threshold value. The FPGA implementation of our proposed architecture shows memory saving of 25-70% compared to the traditional architecture using lossless compression, and for lossy compression with up to a mean square error of 5 achieves up to 84% in memory savings.

[1]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[2]  Y. Dong,et al.  A Design Space Exploration Algorithm Incompiling Window Operation onto Reconfigurable Hardware , 2008 .

[3]  Oana Boncalo,et al.  An FPGA sliding window-based architecture harris corner detector , 2014, 2014 24th International Conference on Field Programmable Logic and Applications (FPL).

[4]  Frank Vahid,et al.  Scalable object detection accelerators on FPGAs using custom design space exploration , 2011, 2011 IEEE 9th Symposium on Application Specific Processors (SASP).

[5]  Donald G. Bailey,et al.  Exploring the implementation of JPEG compression on FPGA , 2012, 2012 6th International Conference on Signal Processing and Communication Systems.

[6]  Greg Brown,et al.  A performance and energy comparison of FPGAs, GPUs, and multicores for sliding-window applications , 2012, FPGA '12.

[7]  Liu Jian,et al.  Integer Haar wavelet for remote sensing image compression , 2002, 6th International Conference on Signal Processing, 2002..

[9]  Greg Stitt,et al.  A Parallel Sliding-Window Generator for High-Performance Digital-Signal Processing on FPGAs , 2016, ACM Trans. Reconfigurable Technol. Syst..

[10]  Miriam Leeser,et al.  Automatic Sliding Window Operation Optimization for FPGA-Based , 2006, 2006 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines.

[11]  Sudipta Mahapatra,et al.  Efficient FPGA implementation of DWT and modified SPIHT for lossless image compression , 2007, J. Syst. Archit..

[12]  M. Leeser,et al.  Optimizing data intensive window-based image processing on reconfigurable hardware boards , 2005, IEEE Workshop on Signal Processing Systems Design and Implementation, 2005..

[13]  Wei Hai,et al.  FPGA architecture for real-time barrel distortion correction of colour images , 2011, 2011 IEEE International Conference on Multimedia and Expo.