Foreground/background segmentation with learned dictionary

Video sequences are viewed as a temporal collection of inverse problems. This parallel with the classical inverse problem of denoising brings us to investigate a sparse representation based approach for background subtraction. A global trained dictionary is obtained using a k-means classifier and using the matching pursuit method a set of coefficients is estimated. By linear combination of dictionary vectors (atoms) and the set of coefficients a background estimate is computed for each frame to obtain the foreground-background segmentation. The global dictionary and the coefficients are propagated and updated along the sequence. The approach yields surprisingly preliminary results, encouraging for further investigations on the possible extensions of the algorithm.

[1]  Quming Zhou,et al.  Tracking and Classifying Moving Objects from Video , 2001 .

[2]  Berna Erol,et al.  A Bayesian framework for Gaussian mixture background modeling , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[3]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Emmanuel J. Candès,et al.  Decoding by linear programming , 2005, IEEE Transactions on Information Theory.

[5]  Michael Elad,et al.  Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[6]  Larry S. Davis,et al.  Non-parametric Model for Background Subtraction , 2000, ECCV.

[7]  Jitendra Malik,et al.  Towards robust automatic traffic scene analysis in real-time , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[8]  Rita Cucchiara,et al.  Detecting Moving Objects, Ghosts, and Shadows in Video Streams , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Haifeng Chen,et al.  Robust Computer Vision through Kernel Density Estimation , 2002, ECCV.

[12]  Peter H. N. de With,et al.  Background Estimation and Adaptation Model with Light-Change Removal for Heavily Down-Sampled Video Surveillance Signals , 2006, 2006 International Conference on Image Processing.

[13]  Nikos Paragios,et al.  Motion-based background subtraction using adaptive kernel density estimation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[14]  Thrasyvoulos N. Pappas,et al.  Spatiotemporal Algorithm for Background Subtraction , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[15]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[16]  Volkan Cevher,et al.  Compressive Sensing for Background Subtraction , 2008, ECCV.