Incrementally Detecting Moving Objects in Video with Sparsity and Connectivity

Moving object detection is crucial for cognitive vision-based robot tasks. However, due to noise, dynamic background, variations in illumination, and high frame rate, it is a challenging task to robustly and efficiently detect moving objects in video using the clue of motion. State-of-the-art batch-based methods view a sequence of images as a whole and then model the background and foreground together with the constraints of foreground sparsity and connectivity (smoothness) in a unified framework. But the efficiency of the batch-based methods is very low. State-of-the-art incremental methods model the background by a subspace whose bases are updated frame by frame. However, such incremental methods do not make full use of the foreground sparsity and connectivity. In this paper, we develop an incremental method for detecting moving objects in video. Compared to existing methods, the proposed method not only incrementally models the subspace for background reconstruction but also takes into account the sparsity and connectivity of the foreground. The optimization of the model is very efficient. Experimental results on nine public videos demonstrate that the proposed method is much efficient than the state-of-the-art batch methods and has higher F1-score than the state-of-the-art incremental methods.

[1]  Yi Ma,et al.  Robust principal component analysis? , 2009, JACM.

[2]  Brendon J. Woodford,et al.  Video background modeling: recent approaches, issues and our proposed techniques , 2013, Machine Vision and Applications.

[3]  Alessandro Neri,et al.  Automatic moving object and background separation , 1998, Signal Process..

[4]  Kongqiao Wang,et al.  Distributed Object Detection With Linear SVMs , 2014, IEEE Transactions on Cybernetics.

[5]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[6]  Olga Veksler,et al.  Fast Approximate Energy Minimization via Graph Cuts , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Xiaowei Zhou,et al.  Moving Object Detection by Detecting Contiguous Outliers in the Low-Rank Representation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Erfu Yang,et al.  Visual Attention Model Based Vehicle Target Detection in Synthetic Aperture Radar Images: A Novel Approach , 2015, Cognitive Computation.

[9]  René Vidal,et al.  A closed form solution to robust subspace estimation and clustering , 2011, CVPR 2011.

[10]  Alan Edelman,et al.  The Geometry of Algorithms with Orthogonality Constraints , 1998, SIAM J. Matrix Anal. Appl..

[11]  Larry S. Davis,et al.  W4: Real-Time Surveillance of People and Their Activities , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Jingjing Zhao,et al.  Biologically Motivated Model for Outdoor Scene Classification , 2013, Cognitive Computation.

[13]  T. Xiang Background Subtraction with Dirichlet Process Mixture Models , 2013 .

[14]  Xuelong Li,et al.  Ranking Graph Embedding for Learning to Rerank , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[15]  S. Shankar Sastry,et al.  Generalized principal component analysis (GPCA) , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Yanwei Pang,et al.  Learning Regularized LDA by Clustering , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Wei Zeng,et al.  Model-Based Human Gait Recognition Via Deterministic Learning , 2013, Cognitive Computation.

[18]  Robert D. Nowak,et al.  Online identification and tracking of subspaces from highly incomplete information , 2010, 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[19]  Erfu Yang,et al.  A Biologically Inspired Vision-Based Approach for Detecting Multiple Moving Objects in Complex Outdoor Scenes , 2015, Cognitive Computation.

[20]  Paul W. H. Chung,et al.  Landmark-Based Methods for Temporal Alignment of Human Motions , 2014, IEEE Computational Intelligence Magazine.

[21]  Michael J. Black,et al.  A Framework for Robust Subspace Learning , 2003, International Journal of Computer Vision.

[22]  Pietro Perona,et al.  Integral Channel Features , 2009, BMVC.

[23]  Xuelong Li,et al.  Scene segmentation based on IPCA for visual surveillance , 2009, Neurocomputing.

[24]  Pietro Perona,et al.  Fast Feature Pyramids for Object Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Tao Xiang,et al.  Background Subtraction with DirichletProcess Mixture Models , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Laura Balzano,et al.  Incremental gradient on the Grassmannian for online foreground and background separation in subsampled video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[28]  Peter Meer,et al.  Conjugate gradient on Grassmann manifolds for robust subspace estimation , 2012, Image Vis. Comput..

[29]  Hongyu Zhao,et al.  Low-Rank Modeling and Its Applications in Image Analysis , 2014, ACM Comput. Surv..

[30]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[31]  Thierry Bouwmans,et al.  Robust PCA via Principal Component Pursuit: A review for a comparative evaluation in video surveillance , 2014, Comput. Vis. Image Underst..

[32]  Yang Gao,et al.  ExoMars Rover PanCam: Autonomous & Computational Intelligence [Application Notes] , 2013, IEEE Computational Intelligence Magazine.

[33]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[34]  Xiaochun Cao,et al.  Robust Foreground Detection Using Smoothness and Arbitrariness Constraints , 2014, ECCV.

[35]  James M. Rehg,et al.  GOSUS: Grassmannian Online Subspace Updates with Structured-Sparsity , 2013, 2013 IEEE International Conference on Computer Vision.

[36]  Victor Vianu,et al.  Invited articles section foreword , 2010, JACM.

[37]  Xuelong Li,et al.  Classifying Discriminative Features for Blur Detection , 2016, IEEE Transactions on Cybernetics.

[38]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Namrata Vaswani,et al.  Recursive sparse recovery in large but correlated noise , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[40]  Qi Tian,et al.  Statistical modeling of complex backgrounds for foreground object detection , 2004, IEEE Transactions on Image Processing.

[41]  Kongqiao Wang,et al.  One-against-all frame differences based hand detection for human and mobile interaction , 2013, Neurocomputing.