Automatic Real-Time Video Background Segmentation System

This paper proposes an automatic background seg- mentation system based on mixture background models and lbp contrast. An adaptive per-pixel background model is developed to set the data cost of an image graph. lbp constrast is adopted to capture the contrast information. We verify our algorithm in different video sequences including public video sequences. Experimental results of composite video demonstrate that our proposed algorithm is efficient. I. INTRODUCTION Foreground segmentation is important and has a lot of ap- plications including background substitution and composition, human-computer interaction, compression, object tracking, etc. In this paper, we focus on how to segment foreground objects from monocular video streams. Recently, some automatic video matting methods for natural scenes are proposed using binocular and monocular video seg- mentation methods. Kolmogorov et al. (1) presented binocular stereo video segmentation algorithm to fuse color, contrast and stereo matching information so as to infer layers accurately and efficiently. However, this method requires synchronized stereo input so as to obtain valuable depth information. In contrast, monocular video segmentation is more popular for its convenience and low cost. The authors of (2)-(4) fused dif- ferent cues to achieve the monocular foreground segmentation based on graph cut algorithm. Although satisfactory results are obtained in their experiments, they pay less attention to the problem of shadows that foreground objects cast on the background. In fact, it is often necessary to separate the foreground object from their shadows in video segmentation (6). In this paper, we present an automatic background segmen- tation system under the circumstance of a static background and fixed web-camera. In the dynamic graph cut framework, an adaptive background model which combines the advantages of both Eigenbackground and pixel-based gaussian models is proposed to set the data cost. And the link cost based on lbp contrast is introduced to improve the segmentation spatial smooth. The automatic segmentation system we proposed can automatically generate high-quality and real-time composite video with shadow removal.

[1]  Harry Shum,et al.  Background Cut , 2006, ECCV.

[2]  A. Criminisi,et al.  Bilayer Segmentation of Live Video , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Larry S. Davis,et al.  W4S : A real-time system for detecting and tracking people in 2 D , 1998, eccv 1998.

[5]  Irfan Essa,et al.  Tree - based Clas - sifiers for Bilayer Video Segmentation In CVPR , 2007, CVPR 2007.

[6]  Bohyung Han,et al.  SEQUENTIAL KERNEL DENSITY APPROXIMATION THROUGH MODE PROPAGATION: APPLICATIONS TO BACKGROUND MODELING , 2004 .

[7]  Konrad Schindler,et al.  Smooth Foreground-Background Segmentation for Video Processing , 2006, ACCV.

[8]  Andrew Blake,et al.  Bi-layer segmentation of binocular stereo video , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[9]  P. Kohli,et al.  Efficiently solving dynamic Markov random fields using graph cuts , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[10]  David Salesin,et al.  A Bayesian approach to digital matting , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[11]  Marko Heikkilä,et al.  A texture-based method for modeling the background and detecting moving objects , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Yangsheng Wang,et al.  Monocular video foreground segmentation system , 2008, 2008 19th International Conference on Pattern Recognition.

[13]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[14]  Alex Pentland,et al.  Pfinder: real-time tracking of the human body , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[15]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interaction , 1999, ICVS.

[16]  Irfan A. Essa,et al.  Tree-based Classifiers for Bilayer Video Segmentation , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Marie-Pierre Jolly,et al.  Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[18]  Larry S. Davis,et al.  Non-parametric Model for Background Subtraction , 2000, ECCV.