论文信息 - Pixel-to-Model background modeling in crowded scenes

Pixel-to-Model background modeling in crowded scenes

Background modeling is an important step for many video surveillance applications such as object detection and scene understanding. In this paper, we present a novel Pixel-to-Model (P2M) paradigm for background modeling in crowded scenes. In particular, the proposed method models the background with a set of context features for each pixel, which are compressively sensed from local patches. We determine whether a pixel belongs to the background according to the minimum P2M distance, which measures the similarity between the pixel and its background model in the space of compressive local descriptors. Moreover, the background updating utilizes minimum and maximum P2M distances to update the pixel feature descriptors in local and neighboring background models, respectively. We evaluate the proposed approach with foreground detection tasks on real crowded surveillance videos. Experiments results show that the proposed P2M approach outperforms the state-of-the-art methods both in indoor and outdoor crowded scenes.

[1] Fatih Murat Porikli,et al. A Bayesian Approach to Background Modeling , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[2] Lei Zhang,et al. Real-Time Compressive Tracking , 2012, ECCV.

[3] Thomas S. Huang,et al. Robust estimation of foreground in surveillance videos by sparse error estimation , 2008, 2008 19th International Conference on Pattern Recognition.

[4] Qi Tian,et al. Statistical modeling of complex backgrounds for foreground object detection , 2004, IEEE Transactions on Image Processing.

[5] Afshin Dehghan,et al. Improving an Object Detector and Extracting Regions Using Superpixels , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[6] Fuchun Sun,et al. Visual Tracking Using Sparsity Induced Similarity , 2010, 2010 20th International Conference on Pattern Recognition.

[7] P. KaewTrakulPong,et al. An Improved Adaptive Background Mixture Model for Real-time Tracking with Shadow Detection , 2002 .

[8] Eli Shechtman,et al. In defense of Nearest-Neighbor based image classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[9] Gerhard Rigoll,et al. Background segmentation with feedback: The Pixel-Based Adaptive Segmenter , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[10] W. Eric L. Grimson,et al. Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[11] Marc Van Droogenbroeck,et al. ViBe: A Universal Background Subtraction Algorithm for Video Sequences , 2011, IEEE Transactions on Image Processing.

[12] Massimo Piccardi,et al. Background subtraction techniques: a review , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[13] Yaser Sheikh,et al. Bayesian modeling of dynamic scenes for object detection , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Ferdinand van der Heijden,et al. Efficient adaptive density estimation per image pixel for the task of background subtraction , 2006, Pattern Recognit. Lett..

[15] Tiejun Huang,et al. Selective Eigenbackground for Background Modeling and Subtraction in Crowded Scenes , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[16] Hong Cheng,et al. Image-to-Class Dynamic Time Warping for 3D hand gesture recognition , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[17] Lu Yang,et al. Sparse representation and learning in visual recognition: Theory and applications , 2013, Signal Process..

[18] Mubarak Shah,et al. Floor Fields for Tracking in High Density Crowd Scenes , 2008, ECCV.

[19] Benjamin Höferlin,et al. Evaluation of background subtraction techniques for video surveillance , 2011, CVPR 2011.

[20] Larry S. Davis,et al. Real-time foreground-background segmentation using codebook model , 2005, Real Time Imaging.

[21] Volkan Cevher,et al. Compressive Sensing for Background Subtraction , 2008, ECCV.