An efficient architecture solution for low-power real-time background subtraction

Embedded vision is a rapidly growing market with a host of challenging algorithms. Among vision algorithms, Mixture of Gaussian (MoG) background subtraction is a frequently used kernel involving massive computation and communication. Tremendous challenges need to be reolved to provide MoG's high computation and communication demands with minimal power consumption allowing its embedded deployment. This paper proposes a customized architecture for power-efficient realization of MoG background subtraction operating at Full-HD resolution. Our design process benefits from system-level design principles. An SLDL-captured specification (result of high-level explorations) serves as a specification for architecture realization and hand-crafted RTL design. To optimize the architecture, this paper employs a set of optimization techniques including parallelism extraction, algorithm tuning, operation width sizing and deep pipelining. The final MoG implementation consists of 77 pipeline stages operating at 148.5 MHz implemented on a Zynq-7000 SoC. Furthermore, our background subtraction solution is flexible allowing end users to adjust algorithm parameters according to scene complexity. Our results demonstrate a very high efficiency for both indoor and outdoor scenes with 145 mW on-chip power consumption and more than 600× speedup over software execution on ARM Cortex A9 core.

[1]  Vamsi Boppana,et al.  Xilinx Zynq-7000 EPP: An extensible processing platform family , 2011, 2011 IEEE Hot Chips 23 Symposium (HCS).

[2]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[3]  J. Bier Implementing Vision Capabilities in Embedded Sysstems , 2011 .

[4]  Aishy Amer,et al.  Embedded architecture for noise-adaptive video object detection using parameter-compressed background modeling , 2014, Journal of Real-Time Image Processing.

[5]  Gunar Schirner,et al.  Power-efficient real-time solution for adaptive vision algorithms , 2015, IET Comput. Digit. Tech..

[6]  Christopher Dyken,et al.  State-of-the-art in heterogeneous computing , 2010, Sci. Program..

[7]  Marek Wojcikowski,et al.  FPGA-Based Real-Time Implementation of Detection Algorithm for Automatic Traffic Surveillance Sensor Network , 2012, J. Signal Process. Syst..

[8]  Chandrika Kamath,et al.  Robust techniques for background subtraction in urban traffic video , 2004, IS&T/SPIE Electronic Imaging.

[9]  Gunar Schirner,et al.  Algorithm and architecture co-design of Mixture of Gaussian (MoG) background subtraction for embedded vision , 2013, 2013 Asilomar Conference on Signals, Systems and Computers.

[10]  Andrew Hunter,et al.  A single-chip FPGA implementation of real-time adaptive background model , 2005, Proceedings. 2005 IEEE International Conference on Field-Programmable Technology, 2005..

[11]  Ahmed Hemani,et al.  Parallel distributed scalable runtime address generation scheme for a coarse grain reconfigurable computation and storage fabric , 2014, Microprocess. Microsystems.

[12]  Juanjo Noguera,et al.  Zynq-7000 All Programmable SoC Accelerator for Floating-Point Matrix Multiplication using Vivado HLS , 2013 .

[13]  Chandrika Kamath,et al.  Robust Background Subtraction with Foreground Validation for Urban Traffic Video , 2005, EURASIP J. Adv. Signal Process..

[14]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[15]  Jason Schlessman,et al.  Heterogeneous MPSoC Architectures for Embedded Computer Vision , 2007, 2007 IEEE International Conference on Multimedia and Expo.