Extensive Benchmark and Survey of Modeling Methods for Scene Background Initialization

Scene background initialization is the process by which a method tries to recover the background image of a video without foreground objects in it. Having a clear understanding about which approach is more robust and/or more suited to a given scenario is of great interest to many end users or practitioners. The aim of this paper is to provide an extensive survey of scene background initialization methods as well as a novel benchmarking framework. The proposed framework involves several evaluation metrics and state-of-the-art methods, as well as the largest video data set ever made for this purpose. The data set consists of several camera-captured videos that: 1) span categories focused on various background initialization challenges; 2) are obtained with different cameras of different lengths, frame rates, spatial resolutions, lighting conditions, and levels of compression; and 3) contain indoor and outdoor scenes. The wide variety of our data set prevents our analysis from favoring a certain family of background initialization methods over others. Our evaluation framework allows us to quantitatively identify solved and unsolved issues related to scene background initialization. We also identify scenarios for which state-of-the-art methods systematically fail.

[1]  Manoranjan Paul,et al.  Efficient video coding using optimal compression plane and background modelling , 2012 .

[2]  El-hadi Zahzah,et al.  Matrix and tensor completion algorithms for background model initialization: A comparative evaluation , 2017, Pattern Recognit. Lett..

[3]  Benjamin Höferlin,et al.  Evaluation of background subtraction techniques for video surveillance , 2011, CVPR 2011.

[4]  Soon Ki Jung,et al.  Motion-Aware Graph Regularized RPCA for background modeling of complex scenes , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[5]  Thierry Chateau,et al.  A Benchmark Dataset for Outdoor Foreground/Background Extraction , 2012, ACCV Workshops.

[6]  Allen R. Hanson,et al.  Improvements in Joint Domain-Range Modeling for Background Subtraction , 2012, BMVC.

[7]  Rita Cucchiara,et al.  Fast Background Initialization with Recursive Hadamard Transform , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[8]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[9]  Bob Zhang,et al.  Background modeling methods in video analysis: A review and comparative evaluation , 2016, CAAI Trans. Intell. Technol..

[10]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Yildiray Yalman,et al.  A new color image quality measure based on YUV transformation and PSNR for human vision system , 2013 .

[12]  Lucia Maddalena,et al.  The SOBS algorithm: What are the limits? , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[13]  David Suter,et al.  A Novel Robust Statistical Method for Background Initialization and Visual Surveillance , 2006, ACCV.

[14]  David Salesin,et al.  Interactive digital photomontage , 2004, SIGGRAPH 2004.

[15]  Mubarak Shah,et al.  Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  B. Silverman Density estimation for statistics and data analysis , 1986 .

[17]  Brian C. Lovell,et al.  A Low-Complexity Algorithm for Static Background Estimation from Cluttered Image Sequences in Surveillance Contexts , 2013, EURASIP J. Image Video Process..

[18]  Andrea Fusiello,et al.  Exemplar-based background model initialization , 2005, VSSN@MM.

[19]  Paul Rodríguez,et al.  Automatic vehicle counting method based on principal component pursuit background modeling , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[20]  Lucia Maddalena,et al.  Background Model Initialization for Static Cameras , 2014 .

[21]  Yaser Sheikh,et al.  Bayesian modeling of dynamic scenes for object detection , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Mubarak Shah,et al.  A Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and Stability Analysis , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Marc Van Droogenbroeck,et al.  ViBE: A powerful random technique to estimate the background in video sequences , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[25]  Bohyung Han,et al.  SEQUENTIAL KERNEL DENSITY APPROXIMATION THROUGH MODE PROPAGATION: APPLICATIONS TO BACKGROUND MODELING , 2004 .

[26]  Yufeng Shen,et al.  Background estimation using graph cuts and inpainting , 2010, Graphics Interface.

[27]  Atsushi Shimada,et al.  Statistical Local Difference Pattern for Background Modeling , 2011, IPSJ Trans. Comput. Vis. Appl..

[28]  Fatih Murat Porikli,et al.  A Novel Video Dataset for Change Detection Benchmarking , 2014, IEEE Transactions on Image Processing.

[29]  J.M. Ferryman,et al.  PETS Metrics: On-Line Performance Evaluation Service , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[30]  Kentaro Toyama,et al.  Wallflower: principles and practice of background maintenance , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[31]  Narciso García,et al.  Detection of stationary foreground objects: A survey , 2016, Comput. Vis. Image Underst..

[32]  Thierry Bouwmans,et al.  Background Modeling using Mixture of Gaussians for Foreground Detection - A Survey , 2008 .

[33]  Lucia Maddalena,et al.  Neural Background Subtraction for Pan-Tilt-Zoom Cameras , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[34]  Atsushi Shimada,et al.  Evaluation report of integrated background modeling based on spatio-temporal features , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[35]  Shafiee Mohammad Javad,et al.  Embedded Motion Detection via Neural Response Mixture Background Modeling , 2016 .

[36]  Rita Cucchiara,et al.  Detecting Moving Objects, Ghosts, and Shadows in Video Streams , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Xianguo Zhang,et al.  Background-Modeling-Based Adaptive Prediction for Surveillance Video Coding , 2014, IEEE Transactions on Image Processing.

[38]  Allen R. Hanson,et al.  Background subtraction: separating the modeling and the inference , 2013, Machine Vision and Applications.

[39]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[40]  Soon Ki Jung,et al.  Spatiotemporal Low-Rank Modeling for Complex Scene Background Initialization , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[41]  Fatih Murat Porikli,et al.  Changedetection.net: A new change detection benchmark dataset , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[42]  Sergio A. Velastin,et al.  Automatic congestion detection system for underground platforms , 2001, Proceedings of 2001 International Symposium on Intelligent Multimedia, Video and Speech Processing. ISIMP 2001 (IEEE Cat. No.01EX489).

[43]  Jen-Hui Chuang,et al.  Learning a Scene Background Model via Classification , 2009, IEEE Transactions on Signal Processing.

[44]  Hans-Peter Seidel,et al.  Background estimation from non-time sequence images , 2008, Graphics Interface.

[45]  Larry S. Davis,et al.  Non-parametric Model for Background Subtraction , 2000, ECCV.

[46]  Allen R. Hanson,et al.  Background modeling using adaptive pixelwise kernel variances in a hybrid feature space , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[47]  Jianping Fan,et al.  Automatic generation of privacy-protected videos using background estimation , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[48]  Gerhard Rigoll,et al.  Background segmentation with feedback: The Pixel-Based Adaptive Segmenter , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[49]  Atsushi Shimada,et al.  Background initialization based on bidirectional analysis and consensus voting , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[50]  Nizar Bouguila,et al.  A Robust Video Foreground Segmentation by Using Generalized Gaussian Mixture Modeling , 2007, Fourth Canadian Conference on Computer and Robot Vision (CRV '07).

[51]  W. Eric L. Grimson,et al.  Unsupervised Activity Perception by Hierarchical Bayesian Models , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[52]  Mohan M. Trivedi,et al.  Detecting Moving Shadows: Algorithms and Evaluation , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[53]  Scott Cohen,et al.  Background estimation as a labeling problem , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[54]  Thierry Bouwmans,et al.  Traditional and recent approaches in background modeling for foreground detection: An overview , 2014, Comput. Sci. Rev..

[55]  Brendon J. Woodford,et al.  Video background modeling: recent approaches, issues and our proposed techniques , 2013, Machine Vision and Applications.

[56]  Mario Ignacio Chacon Murguia,et al.  Evaluation of the background modeling method Auto-Adaptive Parallel Neural Network Architecture in the SBMnet dataset , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[57]  Simone Palazzo,et al.  An innovative web-based collaborative platform for video annotation , 2014, Multimedia Tools and Applications.

[58]  Zhou Wang,et al.  Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[59]  José María Martínez Sanchez,et al.  Rejection based multipath reconstruction for background estimation in video sequences with stationary objects , 2016, Comput. Vis. Image Underst..

[60]  W. Eric L. Grimson,et al.  Unsupervised Activity Perception in Crowded and Complicated Scenes Using Hierarchical Bayesian Models , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[61]  Massimo De Gregorio,et al.  Background Modeling by Weightless Neural Networks , 2015, ICIAP Workshops.

[62]  Fahd Bouzaraa,et al.  CNN-based initial background estimation , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[63]  Hui Li,et al.  Scene background estimation based on temporal median filter with Gaussian filtering , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[64]  Thierry Bouwmans,et al.  Recent Advanced Statistical Background Modeling for Foreground Detection - A Systematic Survey , 2011 .

[65]  Pascal Fua,et al.  Multicamera People Tracking with a Probabilistic Occupancy Map , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[66]  Lucia Maddalena,et al.  Scene background initialization: A taxonomy , 2017, Pattern Recognit. Lett..

[67]  Andrea Fusiello,et al.  Patch-Based Background Initialization in Heavily Cluttered Video , 2010, IEEE Transactions on Image Processing.

[68]  Xun Xu,et al.  A Loopy Belief Propagation approach for robust background estimation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[69]  Norbert Wiener,et al.  Extrapolation, Interpolation, and Smoothing of Stationary Time Series , 1964 .

[70]  Nigel J. B. McFarlane,et al.  Segmentation and tracking of piglets in images , 1995, Machine Vision and Applications.

[71]  Thierry Bouwmans,et al.  Robust PCA via Principal Component Pursuit: A review for a comparative evaluation in video surveillance , 2014, Comput. Vis. Image Underst..

[72]  Thierry Bouwmans,et al.  Comparison of Matrix Completion Algorithms for Background Initialization in Videos , 2015, ICIAP Workshops.

[73]  James M. Keller,et al.  Linguistic summarization of video for fall detection using voxel person and fuzzy logic , 2009, Comput. Vis. Image Underst..

[74]  Rensso Mora Colque,et al.  Progressive Background Image Generation of Surveillance Traffic Videos Based on a Temporal Histogram Ruled by a Reward/Penalty Function , 2011, SIBGRAPI.

[75]  Mark E Hallenbeck,et al.  Extracting Roadway Background Image , 2006 .

[76]  Joonki Paik,et al.  Evolutionary Algorithm-Based Background Generation for Robust Object Detection , 2006, ICIC.

[77]  Marc Van Droogenbroeck,et al.  LaBGen-P: A pixel-level stationary background generation method based on LaBGen , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[78]  Gooman Park Background Initialization by Spatiotemporal Similarity , 2007 .

[79]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[80]  Lucia Maddalena,et al.  Towards Benchmarking Scene Background Initialization , 2015, ICIAP Workshops.

[81]  Fatih Murat Porikli,et al.  CDnet 2014: An Expanded Change Detection Benchmark Dataset , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[82]  Lucia Maddalena,et al.  Extracting a background image by a multi-modal scene background model , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[83]  Jin-Jang Leou,et al.  Background initialization and foreground segmentation for bootstrapping video sequences , 2013, EURASIP J. Image Video Process..