Video Objects Segmentation by Robust Background Modeling

This paper deals with the problem of segmenting a video shot into a background (still) mosaic and one or more foreground moving objects. The method is based on ego-motion compensation and background estimation. In order to be able to cope with sequences where occluding objects persist in the same position for a considerable portion of time, the papers concentrates on robust background estimation method. First the sequence is subdivided in patches that are clustered along the time-line in order to narrow down the number of background candidates. Then the background is grown incrementally by selecting at each step the best continuation of the current background, according to the principles of visual grouping. The method rests on sound principles in all its stages, and only few, intelligible parameters are needed. Experiments with real sequences illustrate the approach.

[1]  P. Rousseeuw,et al.  Wiley Series in Probability and Mathematical Statistics , 2005 .

[2]  Mubarak Shah,et al.  A hierarchical approach to robust background subtraction using color and gradient information , 2002, Workshop on Motion and Video Computing, 2002. Proceedings..

[3]  Guojun Lu,et al.  Segmentation of moving objects in image sequence: A review , 2001 .

[4]  Werner A. Stahel,et al.  Robust Statistics: The Approach Based on Influence Functions , 1987 .

[5]  Emanuele Trucco,et al.  Layered Representation of a Video Shot with Mosaicing , 2002, Pattern Analysis & Applications.

[6]  Larry S. Davis,et al.  W/sup 4/: Who? When? Where? What? A real time system for detecting and tracking people , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[7]  Cormac Herley,et al.  Automatic occlusion removal from minimum number of images , 2005, IEEE International Conference on Image Processing 2005.

[8]  Nikos Paragios,et al.  A MRF-based approach for real-time subway monitoring , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[9]  Andrea Fusiello,et al.  Segmentation and tracking of multiple video objects , 2007, Pattern Recognit..

[10]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Graeme A. Jones,et al.  Segmentation of Global Motion using Temporal Probabilistic Classification , 1998, BMVC.

[12]  Alex Pentland,et al.  Pfinder: real-time tracking of the human body , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[13]  Tai-Pang Wu,et al.  Video repairing: inference of foreground and background under severe occlusion , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[14]  Christopher Rasmussen,et al.  Spatiotemporal inpainting for recovering texture maps of partially occluded building facades , 2005, IEEE International Conference on Image Processing 2005.

[15]  Touradj Ebrahimi,et al.  Cast shadow segmentation using invariant color features , 2004, Comput. Vis. Image Underst..

[16]  Liyuan Li,et al.  Integrating intensity and texture differences for robust change detection , 2002, IEEE Trans. Image Process..

[17]  Naoya Ohta,et al.  Accuracy bounds and optimal computation of homography for image mosaicing applications , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[18]  Eli Shechtman,et al.  Space-time video completion , 2004, CVPR 2004.

[19]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[20]  John Law,et al.  Robust Statistics—The Approach Based on Influence Functions , 1986 .

[21]  M. Wertheimer Laws of organization in perceptual forms. , 1938 .

[22]  Andrea Fusiello,et al.  Exemplar-based background model initialization , 2005, VSSN@MM.

[23]  Fernando Pereira,et al.  MPEG-4: Context and objectives , 1997, Signal Process. Image Commun..

[24]  Yee-Hong Yang,et al.  Stationary background generation: An alternative to the difference of two images , 1990, Pattern Recognit..

[25]  R. Brunelli,et al.  A Survey on the Automatic Indexing of Video Data, , 1999, J. Vis. Commun. Image Represent..

[26]  P. Anandan,et al.  Efficient representations of video sequences and their applications , 1996, Signal Process. Image Commun..

[27]  Andrea Fusiello,et al.  Background Initialization in Cluttered Sequences , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[28]  Paul L. Rosin Thresholding for change detection , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[29]  Guillermo Sapiro,et al.  Video inpainting of occluding and occluded objects , 2005, IEEE International Conference on Image Processing 2005.

[30]  Fernando Pereira,et al.  Coding Video Objects with the Emerging MPEG-4 Standard , 1997 .

[31]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.