Information integration for accurate foreground segmentation in complex scenes

The authors propose a hybrid framework that combines frame difference and background subtraction to integrate complementary sources of information for monocular video segmentation. This framework is modelled as an optimisation process of an energy function, which is established on a Markov random field (MRF) and optimised by Gibbs sampling. It provides a way to exploit different kinds of information obtained from frame difference and background subtraction. Central to the proposed method are two facts - that shape prior can be flexibly obtained from frame difference, and shadow removal can be integrated into the framework with a background texture model. The experiments show that this approach reliably and accurately performs on sequences that include different scenarios (indoors, outdoors) and also addresses several canonical segmentation problems, such as camouflage, foreground aperture and so forth.

[1]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Cheng Lu,et al.  Entropy Minimization for Shadow Removal , 2009, International Journal of Computer Vision.

[3]  Jenq-Neng Hwang,et al.  Fast and automatic video object segmentation and tracking for content-based applications , 2002, IEEE Trans. Circuits Syst. Video Technol..

[4]  Larry S. Davis,et al.  W/sup 4/: Who? When? Where? What? A real time system for detecting and tracking people , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[5]  Matteo Matteucci,et al.  A revaluation of frame difference in fast and robust motion detection , 2006, VSSN '06.

[6]  Laurent Bonnaud,et al.  A Human Body Analysis System , 2006, EURASIP J. Adv. Signal Process..

[7]  Venkatesh Saligrama,et al.  Foreground-Adaptive Background Subtraction , 2009, IEEE Signal Processing Letters.

[8]  Edward Jones,et al.  Removal of non-uniform complex and compound shadows from textured surfaces using adaptive directional smoothing and the thin plate model , 2011 .

[9]  Mohan M. Trivedi,et al.  Detecting Moving Shadows: Algorithms and Evaluation , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Yee-Hong Yang,et al.  First Sight: A Human Body Outline Labeling System , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Yang Wang,et al.  A dynamic conditional random field model for foreground and shadow segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Tamás Szirányi,et al.  Bayesian Foreground and Shadow Detection in Uncertain Frame Rate Surveillance Videos , 2008, IEEE Transactions on Image Processing.

[13]  Wen-Kai Tai,et al.  Moving cast shadow detection by exploiting multiple cues , 2008 .

[14]  Rita Cucchiara,et al.  Detecting Moving Objects, Ghosts, and Shadows in Video Streams , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Yaser Sheikh,et al.  Bayesian modeling of dynamic scenes for object detection , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Geoffrey H. Ball,et al.  ISODATA, A NOVEL METHOD OF DATA ANALYSIS AND PATTERN CLASSIFICATION , 1965 .

[18]  Marko Heikkilä,et al.  A texture-based method for modeling the background and detecting moving objects , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  M. Drahansky,et al.  Real-time Terrain Deformations , 2007, Fourth International Conference on Image and Graphics (ICIG 2007).