Embedded architecture for noise-adaptive video object detection using parameter-compressed background modeling

Video processing algorithms are computationally intensive and place stringent requirements on performance and efficiency of memory bandwidth and capacity. As such, efficient hardware accelerations are inevitable for fast video processing systems. In this paper, we propose resource- and power-optimized FPGA-based configurable architecture for video object detection by integrating noise estimation, Mixture-of-Gaussian background modeling, motion detection, and thresholding. Due to large amount of background modeling parameters, we propose a novel Gaussian parameter compression technique suitable for resource- and power-constraint embedded video systems. The proposed architecture is simulated, synthesized and verified for its functionality, accuracy and performance on a Virtex-5 FPGA-based embedded platform by directly interfacing to a digital video input. Intentional exploitation of heterogeneous resources in FPGAs, and advanced design techniques such as heavy pipelining and data parallelism yield real-time processing of HD-1080p video streams at 30 frames per second. Objective and subjective evaluations to existing hardware-based methods show that the proposed architecture obtains orders of magnitude performance improvements, while utilizing minimal hardware resources. This work is an early attempt to devise a complete video surveillance system onto a stand-alone resource-constraint FPGA-based smart camera.

[1]  Kentaro Toyama,et al.  Wallflower: principles and practice of background maintenance , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[2]  Yin-Tien Wang,et al.  Moving Object Detection Using Monocular Vision , 2012, IAS.

[3]  G.M. Aly,et al.  JPEG encoder for low-cost FPGAs , 2007, 2007 International Conference on Computer Engineering & Systems.

[4]  Jason Cong,et al.  FCUDA: Enabling efficient compilation of CUDA kernels onto FPGAs , 2009, 2009 IEEE 7th Symposium on Application Specific Processors.

[5]  Larry S. Davis,et al.  Efficient Kernel Density Estimation Using the Fast Gauss Transform with Applications to Color Modeling and Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Aishy Amer,et al.  An FPGA-Based Implementation of Spatio-Temporal Object Segmentation , 2006, 2006 International Conference on Image Processing.

[7]  Wayne Luk,et al.  Performance Comparison of Graphics Processors to Reconfigurable Logic: A Case Study , 2010, IEEE Transactions on Computers.

[8]  Rita Cucchiara,et al.  Detecting Moving Objects, Ghosts, and Shadows in Video Streams , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Guojun Lu,et al.  Segmentation of moving objects in image sequence: A review , 2001 .

[10]  Viktor Öwall,et al.  An Embedded Real-Time Surveillance System: Implementation and Evaluation , 2008, J. Signal Process. Syst..

[11]  Aishy Amer,et al.  Hysteresis-based selective Gaussian-mixture model for real-time background update , 2007, Electronic Imaging.

[12]  Zoran Zivkovic,et al.  Improved adaptive Gaussian mixture model for background subtraction , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[13]  Ettore Napoli,et al.  FPGA-based architecture for real time segmentation and denoising of HD video , 2013, Journal of Real-Time Image Processing.

[14]  Aishy Amer,et al.  An FPGA Architecture of Stable-Sorting on a Large Data Volume : Application to Video Signals , 2007, 2007 41st Annual Conference on Information Sciences and Systems.

[15]  Sudipta Mahapatra,et al.  An FPGA-Based Implementation of Multi-Alphabet Arithmetic Coding , 2007, IEEE Transactions on Circuits and Systems I: Regular Papers.

[16]  K. P. Karmann,et al.  Moving object recognition using an adaptive background memory , 1990 .

[17]  Jason Schlessman,et al.  Heterogeneous MPSoC Architectures for Embedded Computer Vision , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[18]  Yong Peng,et al.  A New Detection Algorithm of Moving Objects Based on Human Morphology , 2012, 2012 Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[19]  Liang-Gee Chen,et al.  An UVLC encoder architecture for H.26L , 2002, 2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353).

[20]  Andrew Hunter,et al.  A single-chip FPGA implementation of real-time adaptive background model , 2005, Proceedings. 2005 IEEE International Conference on Field-Programmable Technology, 2005..

[21]  Marco Platzner,et al.  A self-adaptive heterogeneous multi-core architecture for embedded real-time video object tracking , 2011, Journal of Real-Time Image Processing.

[22]  Dar-Shyang Lee,et al.  Effective Gaussian mixture learning for video background subtraction , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Qi Tian,et al.  Statistical modeling of complex backgrounds for foreground object detection , 2004, IEEE Transactions on Image Processing.

[24]  Eric Dubois,et al.  Fast and reliable structure-oriented video noise estimation , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Martin Sweeting,et al.  A New High-Level Reconfigurable Lossless Image Compression System for Space Applications , 2008, 2008 NASA/ESA Conference on Adaptive Hardware and Systems.

[26]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Raimundo Carlos Silvério Freire,et al.  FPGA architecture for static background subtraction in real time , 2006, SBCCI '06.

[28]  Aishy Amer Memory-based spatio-temporal real-time object segmentation for video surveillance , 2003, IS&T/SPIE Electronic Imaging.

[29]  Sankar K. Pal,et al.  Granulation, rough entropy and spatiotemporal moving object detection , 2013, Appl. Soft Comput..

[30]  Viktor Öwall,et al.  A Hardware Architecture for Real-Time Video Segmentation Utilizing Memory Reduction Techniques , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[31]  Paul L. Rosin Thresholding for change detection , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[32]  Massimo Piccardi,et al.  Background subtraction techniques: a review , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[33]  Chandrika Kamath,et al.  Robust techniques for background subtraction in urban traffic video , 2004, IS&T/SPIE Electronic Imaging.

[34]  Jukka Teuhola,et al.  A Compression Method for Clustered Bit-Vectors , 1978, Inf. Process. Lett..

[35]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..