Robust Foreground Estimation via Structured Gaussian Scale Mixture Modeling

Recovering the background and foreground parts from video frames has important applications in video surveillance. Under the assumption that the background parts are stationary and the foreground are sparse, most of existing methods are based on the framework of robust principal component analysis (RPCA), i.e., modeling the background and foreground parts as a low-rank and sparse matrices, respectively. However, in realistic complex scenarios, the conventional $\ell _{1}$ norm sparse regularizer often fails to well characterize the varying sparsity of the foreground components. How to select the sparsity regularizer parameters adaptively according to the local statistics is critical to the success of the RPCA framework for background subtraction task. In this paper, we propose to model the sparse component with a Gaussian scale mixture (GSM) model. Compared with the conventional $\ell _{1}$ norm, the GSM-based sparse model has the advantages of jointly estimating the variances of the sparse coefficients (and hence the regularization parameters) and the unknown sparse coefficients, leading to significant estimation accuracy improvements. Moreover, considering that the foreground parts are highly structured, a structured extension of the GSM model is further developed. Specifically, the input frame is divided into many homogeneous regions using superpixel segmentation. By characterizing the set of sparse coefficients in each homogeneous region with the same GSM prior, the local dependencies among the sparse coefficients can be effectively exploited, leading to further improvements for background subtraction. Experimental results on several challenging scenarios show that the proposed method performs much better than most of existing background subtraction methods in terms of both performance and speed.

[1]  Marko Heikkilä,et al.  A texture-based method for modeling the background and detecting moving objects , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Ashish Ghosh,et al.  Real-Time Adaptive Histogram Min-Max Bucket (HMMB) Model for Background Subtraction , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  M. West On scale mixtures of normal distributions , 1987 .

[4]  Yi Ma,et al.  Robust principal component analysis? , 2009, JACM.

[5]  Tobias Knopp,et al.  Sensitivity Enhancement in Magnetic Particle Imaging by Background Subtraction , 2016, IEEE Transactions on Medical Imaging.

[6]  Oihana Otaegui,et al.  Adaptive Multicue Background Subtraction for Robust Vehicle Counting and Classification , 2012, IEEE Transactions on Intelligent Transportation Systems.

[7]  Yasuyuki Matsushita,et al.  Fast randomized Singular Value Thresholding for Nuclear Norm Minimization , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[9]  Guangming Shi,et al.  Image Restoration via Simultaneous Sparse Coding: Where Structured Sparsity Meets Gaussian Scale Mixture , 2015, International Journal of Computer Vision.

[10]  Ming Qin,et al.  A Background Basis Selection-Based Foreground Detection Method , 2016, IEEE Transactions on Multimedia.

[11]  Rubén Heras Evangelio,et al.  Splitting Gaussians in Mixture Models , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[12]  Xiaochun Cao,et al.  Robust Foreground Detection Using Smoothness and Arbitrariness Constraints , 2014, ECCV.

[13]  Shuicheng Yan,et al.  Online Robust PCA via Stochastic Optimization , 2013, NIPS.

[14]  Gerhard Rigoll,et al.  Background segmentation with feedback: The Pixel-Based Adaptive Segmenter , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[15]  Loong Fah Cheong,et al.  Block-Sparse RPCA for Salient Motion Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Xiaochun Cao,et al.  Motion saliency detection using low-rank and sparse decomposition , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Rui Wang,et al.  Static and Moving Object Detection Using Flux Tensor with Split Gaussian Models , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[18]  Soon Ki Jung,et al.  Online Stochastic Tensor Decomposition for Background Subtraction in Multispectral Video Sequences , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[19]  Mingliang Chen,et al.  Spatiotemporal GMM for Background Subtraction with Superpixel Hierarchy , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Alessandro Rozza,et al.  A Robust Approach for the Background Subtraction Based on Multi-Layered Self-Organizing Maps , 2016, IEEE Transactions on Image Processing.

[21]  Guoying Zhao,et al.  Background Subtraction Based on Low-Rank and Structured Sparse Decomposition , 2015, IEEE Transactions on Image Processing.

[22]  Guillaume-Alexandre Bilodeau,et al.  SuBSENSE: A Universal Change Detection Method With Local Adaptive Sensitivity , 2015, IEEE Transactions on Image Processing.

[23]  Soon Ki Jung,et al.  Improving OR-PCA via smoothed spatially-consistent low-rank modeling for background subtraction , 2017, SAC.

[24]  Tao Xiang,et al.  Background Subtraction with Dirichlet Processes , 2012, ECCV.

[25]  Pan Hui,et al.  Ubii: Physical World Interaction Through Augmented Reality , 2017, IEEE Transactions on Mobile Computing.

[26]  Soon Ki Jung,et al.  Decomposition into Low-rank plus Additive Matrices for Background/Foreground Separation: A Review for a Comparative Evaluation with a Large-Scale Dataset , 2015, Comput. Sci. Rev..

[27]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Pablo A. Parrilo,et al.  Guaranteed Minimum-Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization , 2007, SIAM Rev..

[29]  Stephen P. Boyd,et al.  Enhancing Sparsity by Reweighted ℓ1 Minimization , 2007, 0711.1612.

[30]  Soon Ki Jung,et al.  Spatiotemporal Low-Rank Modeling for Complex Scene Background Initialization , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[31]  Rama Chellappa,et al.  Entropy rate superpixel segmentation , 2011, CVPR 2011.

[32]  Li Fei-Fei,et al.  Towards total scene understanding: Classification, annotation and segmentation in an automatic framework , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Ping Li,et al.  Online optimization for max-norm regularization , 2014, Machine Learning.

[34]  Wen Gao,et al.  Background Subtraction via generalized fused lasso foreground modeling , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Yuan Xie,et al.  Moving Object Detection Using Tensor-Based Low-Rank and Saliently Fused-Sparse Decomposition , 2017, IEEE Transactions on Image Processing.

[36]  Laura Balzano,et al.  Incremental gradient on the Grassmannian for online foreground and background separation in subsampled video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[37]  Fatih Murat Porikli,et al.  Changedetection.net: A new change detection benchmark dataset , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[38]  Z. Zivkovic Improved adaptive Gaussian mixture model for background subtraction , 2004, ICPR 2004.

[39]  HuangTao,et al.  Mixed Noise Removal via Laplacian Scale Mixture Modeling and Nonlocal Low-Rank Approximation , 2017 .

[40]  Fatih Murat Porikli,et al.  CDnet 2014: An Expanded Change Detection Benchmark Dataset , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[41]  Li Song,et al.  Foreground Estimation Based on Linear Regression Model With Fused Sparsity on Outliers , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[42]  Marc Van Droogenbroeck,et al.  ViBe: A Universal Background Subtraction Algorithm for Video Sequences , 2011, IEEE Transactions on Image Processing.

[43]  Soon Ki Jung,et al.  Background–Foreground Modeling Based on Spatiotemporal Sparse Subspace Clustering , 2017, IEEE Transactions on Image Processing.

[44]  Martin J. Wainwright,et al.  Image denoising using scale mixtures of Gaussians in the wavelet domain , 2003, IEEE Trans. Image Process..

[45]  Dacheng Tao,et al.  Bilateral random projections , 2012, 2012 IEEE International Symposium on Information Theory Proceedings.

[46]  Qi Tian,et al.  Statistical modeling of complex backgrounds for foreground object detection , 2004, IEEE Transactions on Image Processing.

[47]  Jingdong Wang,et al.  A Probabilistic Approach to Robust Matrix Factorization , 2012, ECCV.

[48]  Emmanuel J. Candès,et al.  A Singular Value Thresholding Algorithm for Matrix Completion , 2008, SIAM J. Optim..

[49]  Hefeng Wu,et al.  Hierarchical Ensemble of Background Models for PTZ-Based Video Surveillance , 2015, IEEE Transactions on Cybernetics.

[50]  Andrzej Cichocki,et al.  Total Variation Regularized Tensor RPCA for Background Subtraction From Compressive Measurements , 2015, IEEE Transactions on Image Processing.

[51]  Xiaochun Cao,et al.  Total Variation Regularized RPCA for Irregularly Moving Object Detection Under Dynamic Background , 2016, IEEE Transactions on Cybernetics.

[52]  Lucia Maddalena,et al.  The SOBS algorithm: What are the limits? , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[53]  Xiaowei Zhou,et al.  Moving Object Detection by Detecting Contiguous Outliers in the Low-Rank Representation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[54]  Guoying Zhao,et al.  Background Subtraction Using Spatio-Temporal Group Sparsity Recovery , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[55]  Nicoletta Noceti,et al.  Online Space-Variant Background Modeling With Sparse Coding , 2015, IEEE Transactions on Image Processing.

[56]  Qinghua Hu,et al.  Efficient Background Modeling Based on Sparse Representation and Outlier Iterative Removal , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[57]  Bruno A. Olshausen,et al.  Group Sparse Coding with a Laplacian Scale Mixture Prior , 2010, NIPS.

[58]  Kun Li,et al.  Foreground–Background Separation From Video Clips via Motion-Assisted Matrix Restoration , 2015, IEEE Transactions on Circuits and Systems for Video Technology.