Combining where and what in change detection for unsupervised foreground learning in surveillance

Change detection is the most important task for video surveillance analytics such as foreground and anomaly detection. Current foreground detectors learn models from annotated images since the goal is to generate a robust foreground model able to detect changes in all possible scenarios. Unfortunately, manual labelling is very expensive. Most advanced supervised learning techniques based on generic object detection datasets currently exhibit very poor performance when applied to surveillance datasets because of the unconstrained nature of such environments in terms of types and appearances of objects. In this paper, we take advantage of change detection for training multiple foreground detectors in an unsupervised manner. We use statistical learning techniques which exploit the use of latent parameters for selecting the best foreground model parameters for a given scenario. In essence, the main novelty of our proposed approach is to combine the where (motion segmentation) and what (learning procedure) in change detection in an unsupervised way for improving the specificity and generalization power of foreground detectors at the same time. We propose a framework based on latent support vector machines that, given a noisy initialization based on motion cues, learns the correct position, aspect ratio, and appearance of all moving objects in a particular scene. Specificity is achieved by learning the particular change detections of a given scenario, and generalization is guaranteed since our method can be applied to any possible scene and foreground object, as demonstrated in the experimental results outperforming the state-of-the-art. HighlightsBuild multi-appearance detectors for unknown and uncontrolled sequences in unsupervised manner.Global discriminative optimization based on latent SVM is able to build accurate multi-class detectors without pretrained detectors.From a noisy initialization (motion cues) learn position, scale and appearance of multiple foreground objects.Combine in an unsupervised way where (motion segmentation) and what (learning procedure) in change detection.Handle an unknown number of objects in an unconstrained scenario.

[1]  Vinod Nair,et al.  An unsupervised, online learning framework for moving object detection , 2004, CVPR 2004.

[2]  Yoram Singer,et al.  Pegasos: primal estimated sub-gradient solver for SVM , 2007, ICML '07.

[3]  Max Mignotte,et al.  Statistical background subtraction using spatial cues , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Alan Hanjalic,et al.  A framework for unsupervised training of object detectors from unlabeled surveillance video , 2011, J. Ambient Intell. Smart Environ..

[5]  Luc Van Gool,et al.  The 2005 PASCAL Visual Object Classes Challenge , 2005, MLCW.

[6]  Alan L. Yuille,et al.  The Concave-Convex Procedure (CCCP) , 2001, NIPS.

[7]  Tieniu Tan,et al.  Recent developments in human motion analysis , 2003, Pattern Recognit..

[8]  Yoram Singer,et al.  Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[9]  Mubarak Shah,et al.  Semi-supervised Learning of Feature Hierarchies for Object Detection in a Video , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Kentaro Toyama,et al.  Wallflower: principles and practice of background maintenance , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[11]  Afshin Dehghan,et al.  Improving an Object Detector and Extracting Regions Using Superpixels , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Alan Hanjalic,et al.  Unsupervised and simultaneous training of multiple object detectors from unlabeled surveillance video , 2009, Comput. Vis. Image Underst..

[13]  Bernt Schiele,et al.  Learning people detection models from few training samples , 2011, CVPR 2011.

[14]  Fatih Murat Porikli,et al.  Changedetection.net: A new change detection benchmark dataset , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[15]  Pingkun Yan,et al.  Robust visual tracking with discriminative sparse learning , 2013, Pattern Recognit..

[16]  Yi-Ping Hung,et al.  Efficient hierarchical method for background subtraction , 2007, Pattern Recognit..

[17]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[18]  Simone Calderara,et al.  Visual Tracking: An Experimental Survey , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  François Fleuret,et al.  FlowBoost — Appearance learning from sparsely annotated video , 2011, CVPR 2011.

[20]  Andrew Zisserman,et al.  Structured output regression for detection with partial truncation , 2009, NIPS.

[21]  Kaihua Zhang,et al.  Real-time visual tracking via online weighted multiple instance learning , 2013, Pattern Recognit..

[22]  Alan L. Yuille,et al.  The Concave-Convex Procedure , 2003, Neural Computation.

[23]  Ramakant Nevatia,et al.  Improving Part based Object Detection by Unsupervised, Online Boosting , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Francesc Moreno-Noguer,et al.  Bootstrapping Boosted Random Ferns for discriminative and efficient object classification , 2012, Pattern Recognit..

[25]  Carsten Rother,et al.  Learning discriminative localization from weakly labeled data , 2014, Pattern Recognit..

[26]  David A. Forsyth,et al.  30Hz Object Detection with DPM V5 , 2014, ECCV.

[27]  F. Xavier Roca,et al.  Exploiting multiple cues in motion segmentation based on background subtraction , 2013, Neurocomputing.

[28]  Ferdinand van der Heijden,et al.  Efficient adaptive density estimation per image pixel for the task of background subtraction , 2006, Pattern Recognit. Lett..

[29]  Alessandro Leone,et al.  Shadow detection for moving objects based on texture analysis , 2007, Pattern Recognit..

[30]  Adrian Hilton,et al.  A survey of advances in vision-based human motion capture and analysis , 2006, Comput. Vis. Image Underst..

[31]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Gang Hua,et al.  Detection by detections: Non-parametric detector adaptation for a video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  F. Xavier Roca,et al.  Toward Real-Time Pedestrian Detection Based on a Deformable Template Model , 2014, IEEE Transactions on Intelligent Transportation Systems.

[34]  Thomas B. Moeslund,et al.  Detection and removal of chromatic moving shadows in surveillance scenarios , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[35]  Ming-Hsuan Yang,et al.  Visual tracking with online Multiple Instance Learning , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  ZissermanAndrew,et al.  The Pascal Visual Object Classes Challenge , 2015 .

[37]  Hélène Laurent,et al.  Comparative study of background subtraction algorithms , 2010, J. Electronic Imaging.

[38]  Zdenek Kalal,et al.  Tracking-Learning-Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Paul A. Viola,et al.  Unsupervised improvement of visual detectors using cotraining , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[40]  Mohan M. Trivedi,et al.  Detecting Moving Shadows: Algorithms and Evaluation , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  David Vázquez,et al.  Learning appearance in virtual scenarios for pedestrian detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[42]  Jordi Gonzàlez,et al.  A coarse-to-fine approach for fast deformable object detection , 2011, CVPR 2011.

[43]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[44]  Cordelia Schmid,et al.  Learning object class detectors from weakly annotated video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[45]  Junjie Yan,et al.  The Fastest Deformable Part Model for Object Detection , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[46]  Jiri Matas,et al.  P-N learning: Bootstrapping binary classifiers by structural constraints , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[47]  Mubarak Shah,et al.  Online detection and classification of moving objects using progressively improving detectors , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).