A robust framework for joint background/foreground segmentation of complex video scenes filmed with freely moving camera

This paper explores a robust region-based general framework for discriminating between background and foreground objects within a complex video sequence. The proposed framework works under difficult conditions such as dynamic background and nominally moving camera. The originality of this work lies essentially in our use of the semantic information provided by the regions while simultaneously identifying novel objects (foreground) and non-novel ones (background). The information of background regions is exploited to make moving objects detection more efficient, and vice-versa. In fact, an initial panoramic background is modeled using region-based mosaicing in order to be sufficiently robust to noise from lighting effects and shadowing by foreground objects. After the elimination of the camera movement using motion compensation, the resulting panoramic image should essentially contain the background and the ghost-like traces of the moving objects. Then, while comparing the panoramic image of the background with the individual frames, a simple median-based background subtraction permits a rough identification of foreground objects. Joint background-foreground validation, based on region segmentation, is then used for a further examination of individual foreground pixels intended to eliminate false positives and to localize shadow effects. Thus, we first obtain a foreground mask from a slow-adapting algorithm, and then validate foreground pixels (moving visual objects + shadows) by a simple moving object model built by using both background and foreground regions. The tests realized on various well-known challenging real videos (across a variety of domains) show clearly the robustness of the suggested solution. This solution, which is relatively computationally inexpensive, can be used under difficult conditions such as dynamic background, nominally moving camera and shadows. In addition to the visual evaluation, spatial-based evaluation statistics, given hand-labeled ground truth, has been used as a performance measure of moving visual objects detection.

[1]  Michel Barlaud,et al.  Region-based active contours for video object segmentation with camera compensation , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[2]  Kentaro Toyama,et al.  Wallflower: principles and practice of background maintenance , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[3]  Harpreet S. Sawhney,et al.  Compact Representations of Videos Through Dominant and Multiple Motion Estimation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Gregory D. Hager,et al.  Dynamic Foreground/Background Extraction from Images and Videos using Random Patches , 2006, NIPS.

[5]  Chin-Seng Chua,et al.  Statistical background modeling for non-stationary camera , 2003, Pattern Recognit. Lett..

[6]  Rozenn Dahyot Unsupervised Camera Motion Estimation and Moving Object Detection in Videos , 2006 .

[7]  Andrew Blake,et al.  Bi-layer segmentation of binocular stereo video , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8]  Michel Barlaud,et al.  Watershed-driven active contours for moving object segmentation , 2005, IEEE International Conference on Image Processing 2005.

[9]  Michael F. Cohen,et al.  Monocular Video Foreground/Background Segmentation by Tracking Spatial-Color Gaussian Mixture Models , 2007, 2007 IEEE Workshop on Motion and Video Computing (WMVC'07).

[10]  Wenyi Zhao,et al.  Flexible Image Blending for Image Mosaicing with Reduced Artifacts , 2006, Int. J. Pattern Recognit. Artif. Intell..

[11]  Chingchun Huang,et al.  Motion-based Background Modeling for Moving Object Detection on Moving Platforms , 2007, 2007 16th International Conference on Computer Communications and Networks.

[12]  Vassilios Morellas,et al.  Robust Foreground Detection In Video Using Pixel Layers , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Rita Cucchiara,et al.  Detecting Moving Objects, Ghosts, and Shadows in Video Streams , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Andrea Fusiello,et al.  Segmentation and tracking of multiple video objects , 2007, Pattern Recognit..

[15]  Chandrika Kamath,et al.  Robust techniques for background subtraction in urban traffic video , 2004, IS&T/SPIE Electronic Imaging.

[16]  Mohan S. Kankanhalli,et al.  Automatic video logo detection and removal , 2005, Multimedia Systems.

[17]  Michael Harville,et al.  Foreground segmentation using adaptive mixture models in color and depth , 2001, Proceedings IEEE Workshop on Detection and Recognition of Events in Video.

[18]  Feng Wu,et al.  Automatic video segmentation using a novel background model , 2002, 2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353).

[19]  Sudeep Sarkar,et al.  Perceptual Organization Based Computational Model for Robust Segmentation of Moving Objects , 2002, Comput. Vis. Image Underst..

[20]  Monica N. Nicolescu,et al.  A Support Vector Data Description Approach for Background Modeling in Videos with Quasi-Stationary Backgrounds , 2008, Int. J. Artif. Intell. Tools.

[21]  Tamás Szirányi,et al.  Markovian Framework for Foreground-Background-Shadow Separation of Real World Video Scenes , 2006, ACCV.

[22]  Steve Mann VideoOrbits: The Projective Geometry Renaissance , 2002 .

[23]  Huijun Di,et al.  Panoramic Background Model under Free Moving Camera , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[24]  Wolfgang Effelsberg,et al.  Video-object segmentation using multi-sprite background subtraction , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[25]  Andrea Fusiello,et al.  Video Objects Segmentation by Robust Background Modeling , 2007, 14th International Conference on Image Analysis and Processing (ICIAP 2007).

[26]  Ig-Jae Kim,et al.  Video Surveillance using Dynamic Configuration of Mutiple Active Cameras , 2006, 2006 International Conference on Image Processing.

[27]  Yo-Sung Ho,et al.  Automatic Video Object Tracking Using a Mosaic-Based Background , 2004, PCM.

[28]  Stan Sclaroff,et al.  Segmenting foreground objects from a dynamic textured background via a robust Kalman filter , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[29]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[30]  Georgios Tziritas,et al.  Adaptive detection and localization of moving objects in image sequences , 1999, Signal Process. Image Commun..

[31]  Luc Vincent,et al.  Watersheds in Digital Spaces: An Efficient Algorithm Based on Immersion Simulations , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Mohammad H. Mahoor,et al.  Fast image blending using watersheds and graph cuts , 2009, Image Vis. Comput..

[33]  Nikos Paragios,et al.  Scene modeling and change detection in dynamic scenes: A subspace approach , 2009, Comput. Vis. Image Underst..

[34]  Jacques Verly,et al.  The State of the Art in Multiple Object Tracking Under Occlusion in Video Sequences , 2003 .

[35]  Kenichi Kanatani,et al.  Extracting Moving Objects from a Moving Camera VideoSequence , 2005 .

[36]  Gérard G. Medioni,et al.  Detecting and tracking moving objects for video surveillance , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[37]  S. Ogilvie,et al.  A NEW HIGH RESOLUTION OPTICAL METHOD FOR OBTAINING THE TOPOGRAPHY OF FRACTURE SURFACES IN ROCKS , 2011 .

[38]  S. R. Subramanya,et al.  Shadow Detection by Combined Photometric Invariants for Improved Foreground Segmentation , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[39]  Ninad Thakoor,et al.  Automatic Extraction and Localization of Multiple Moving Objects with Stereo Camera in Motion , 2005, 2005 IEEE International Conference on Systems, Man and Cybernetics.

[40]  Patrick Pérez,et al.  Detection and segmentation of moving objects in highly dynamic scenes , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[41]  James Orwell,et al.  Adaptive eigen-backgrounds for object detection , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[42]  Chiou-Ting Hsu,et al.  Mosaics of video sequences with moving objects , 2004, Signal Process. Image Commun..

[43]  Rama Chellappa,et al.  Simultaneous background and foreground modeling for tracking in surveillance video , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[44]  Tiziana D'Orazio,et al.  Moving object segmentation by background subtraction and temporal analysis , 2006, Image Vis. Comput..

[45]  Larry H. Matthies,et al.  Real-time detection of moving objects from moving vehicles using dense stereo and optical flow , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[46]  George K. I. Mann,et al.  Task-driven moving object detection for robots using visual attention , 2007, 2007 7th IEEE-RAS International Conference on Humanoid Robots.

[47]  Yan Zhang,et al.  Robust Moving Object Detection at Distance in the Visible Spectrum and Beyond Using A Moving Camera , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[48]  B. Frey,et al.  Transformation-Invariant Clustering Using the EM Algorithm , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[49]  A. Criminisi,et al.  Bilayer Segmentation of Live Video , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[50]  Qian Yu,et al.  Map-Enhanced Detection and Tracking from a Moving Platform with Local and Global Data Association , 2007, 2007 IEEE Workshop on Motion and Video Computing (WMVC'07).

[51]  W. David Pan,et al.  Fast and accurate global motion estimation algorithm using pixel subsampling , 2008, Inf. Sci..

[52]  Xin Li,et al.  Contour-based object tracking with occlusion handling in video acquired using mobile cameras , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Wolfgang Effelsberg,et al.  Robust background estimation for complex video sequences , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[54]  Shireen Elhabian,et al.  Moving Object Detection in Spatial Domain using Background Removal Techniques - State-of-Art , 2008 .

[55]  Faouzi Ghorbel,et al.  FUSION DE L’INFORMATION PAR LA THÉORIE DE L’ÉVIDENCE : Application en Diagnostic du Mélanome , 2004 .

[56]  Stuart C. Schwartz,et al.  A transform domain approach to real-time foreground segmentation in video sequences , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[57]  Paul L. Rosin,et al.  Evaluation of global image thresholding for change detection , 2003, Pattern Recognit. Lett..

[58]  Ezzeddine Zagrouba,et al.  An efficient image-mosaicing method based on multifeature matching , 2009, Machine Vision and Applications.

[59]  Bart De Moor,et al.  Region-Based Statistical Background Modeling for Foreground Object Segmentation , 2006, 2006 International Conference on Image Processing.