Visual-Attention-Based Background Modeling for Detecting Infrequently Moving Objects

Motion is one of the most important cues to separate foreground objects from the background in a video. Using a stationary camera, it is usually assumed that the background is static, while the foreground objects are moving most of the time. However, in practice, the foreground objects may show infrequent motions, such as abandoned objects and sleeping persons. Meanwhile, the background may contain frequent local motions, such as waving trees and/or grass. Such complexities may prevent the existing background subtraction algorithms from correctly identifying the foreground objects. In this paper, we propose a new approach that can detect the foreground objects with frequent and/or infrequent motions. Specifically, we use a visual-attention mechanism to infer a complete background from a subset of frames and then propagate it to the other frames for accurate background subtraction. Furthermore, we develop a feature-matching-based local motion stabilization algorithm to identify frequent local motions in the background for reducing false positives in the detected foreground. The proposed approach is fully unsupervised, without using any supervised learning for object detection and tracking. Extensive experiments on a large number of videos have demonstrated that the proposed approach outperforms the state-of-the-art motion detection and background subtraction methods in comparison.

[1]  Tao Xiang,et al.  Background Subtraction with DirichletProcess Mixture Models , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Laura Balzano,et al.  Incremental gradient on the Grassmannian for online foreground and background separation in subsampled video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Fatih Murat Porikli,et al.  Changedetection.net: A new change detection benchmark dataset , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[4]  Kentaro Toyama,et al.  Wallflower: principles and practice of background maintenance , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[5]  Michael J. Black,et al.  Layered segmentation and optical flow estimation over time , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[7]  Mariano Rivera,et al.  Change detection by probabilistic segmentation from monocular view , 2013, Machine Vision and Applications.

[8]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Yuan Yan Tang,et al.  A Visual-Attention Model Using Earth Mover's Distance-Based Saliency Measurement and Nonlinear Feature Combination , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Zhaoping Li A saliency map in primary visual cortex , 2002, Trends in Cognitive Sciences.

[11]  T. Xiang Background Subtraction with Dirichlet Process Mixture Models , 2013 .

[12]  Hyun Wook Park,et al.  A Disparity-Based Adaptive Multihomography Method for Moving Target Detection Based on Global Motion Compensation , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Jean-Marc Odobez,et al.  Multi-Layer Background Subtraction Based on Color and Texture , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Shaogang Gong,et al.  International Journal of Computer Vision (The original publication is available at www.springerlink.com) Time-Delayed Correlation Analysis for Multi-Camera Activity Understanding , 2009 .

[15]  Hai Jin,et al.  Adaptive Object Tracking by Learning Hybrid Template Online , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Hefeng Wu,et al.  Hierarchical Ensemble of Background Models for PTZ-Based Video Surveillance , 2015, IEEE Transactions on Cybernetics.

[17]  Bin Wang,et al.  A Fast Self-Tuning Background Subtraction Algorithm , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[18]  Junzhou Huang,et al.  Background Subtraction Using Low Rank and Group Sparsity Constraints , 2012, ECCV.

[19]  Antonio Torralba,et al.  Unsupervised Detection of Regions of Interest Using Iterative Link Analysis , 2009, NIPS.

[20]  Allen R. Hanson,et al.  Background modeling using adaptive pixelwise kernel variances in a hybrid feature space , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Matti Pietikäinen,et al.  Modeling pixel process with scale invariant local patterns for background subtraction in complex scenes , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Deborah Estrin,et al.  Warping background subtraction , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[24]  ZuWhan Kim Real time object tracking based on dynamic feature grouping with background subtraction , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Gerhard Rigoll,et al.  Background segmentation with feedback: The Pixel-Based Adaptive Segmenter , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[26]  Qi Tian,et al.  Statistical modeling of complex backgrounds for foreground object detection , 2004, IEEE Transactions on Image Processing.

[27]  Atsushi Shimada,et al.  Adaptive Background Modeling for Paused Object Regions , 2010, ACCV Workshops.

[28]  Atsushi Shimada,et al.  Background Modeling Based on Bidirectional Analysis , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  C. Schmid,et al.  Category-Specific Video Summarization , 2014, ECCV.

[30]  Dale Schuurmans,et al.  Real-Time Discriminative Background Subtraction , 2011, IEEE Transactions on Image Processing.

[31]  Shengcai Liao,et al.  Moving Object Detection Revisited: Speed and Robustness , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[32]  Fatih Murat Porikli,et al.  CDnet 2014: An Expanded Change Detection Benchmark Dataset , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[33]  Christof Koch,et al.  Image Signature: Highlighting Sparse Salient Regions , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Yi-Ping Hung,et al.  Left-Luggage Detection from Finite-State-Machine Analysis in Static-Camera Videos , 2014, 2014 22nd International Conference on Pattern Recognition.

[35]  Thierry Bouwmans,et al.  Recent Advanced Statistical Background Modeling for Foreground Detection - A Systematic Survey , 2011 .

[36]  Bin Fang,et al.  A Computational Model for Saliency Maps by Using Local Entropy , 2010, AAAI.

[37]  Lucia Maddalena,et al.  Stopped Object Detection by Learning Foreground Model in Videos , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[38]  Benjamin Höferlin,et al.  Evaluation of background subtraction techniques for video surveillance , 2011, CVPR 2011.

[39]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[40]  Yaser Sheikh,et al.  Bayesian modeling of dynamic scenes for object detection , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Marc Van Droogenbroeck,et al.  ViBe: A Universal Background Subtraction Algorithm for Video Sequences , 2011, IEEE Transactions on Image Processing.

[42]  Rogério Schmidt Feris,et al.  Robust Detection of Abandoned and Removed Objects in Complex Surveillance Videos , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[43]  Liang Lin,et al.  PISA: Pixelwise Image Saliency by Aggregating Complementary Appearance Contrast Measures With Edge-Preserving Coherence , 2015, IEEE Transactions on Image Processing.

[44]  Ying Wu,et al.  A unified approach to salient object detection via low rank matrix recovery , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[45]  Larry S. Davis,et al.  Non-parametric Model for Background Subtraction , 2000, ECCV.

[46]  Antoine Vacavant,et al.  A comprehensive review of background subtraction algorithms evaluated with synthetic and real videos , 2014, Comput. Vis. Image Underst..

[47]  Girish A. Kulkarni,et al.  Abandoned Object Detection via Temporal Consistency Modeling and Back-Tracing Verification for Visual Surveillance , 2018 .

[48]  Esa Rahtu,et al.  A Simple and efficient saliency detector for background subtraction , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[49]  Rui Wang,et al.  Static and Moving Object Detection Using Flux Tensor with Split Gaussian Models , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[50]  Nuno Vasconcelos,et al.  Background subtraction in highly dynamic scenes , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[51]  Lucia Maddalena,et al.  The SOBS algorithm: What are the limits? , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[52]  Yi-Ping Hung,et al.  Abandoned Object Detection via Temporal Consistency Modeling and Back-Tracing Verification for Visual Surveillance , 2015, IEEE Transactions on Information Forensics and Security.

[53]  Yacov Hel-Or,et al.  Foreground detection using spatiotemporal projection kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[54]  Ho Gi Jung,et al.  Mixture of Gaussians-Based Background Subtraction for Bayer-Pattern Image Sequences , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[55]  Jian-Huang Lai,et al.  Complex Background Subtraction by Pursuing Dynamic Spatio-Temporal Models , 2014, IEEE Transactions on Image Processing.