Automatic moving object extraction toward compact video representation

An automatic object-oriented video segmentation and representation algorithm is proposed, where the local variance contrast and the frame difference contrast are jointly exploited for meaningful moving object extraction because these two visual features can indicate the spatial homogeneity of the gray levels and the temporal coherence of the motion fields efficiently. The 2-D entropic thresholding technique and the watershed transformation method are further developed to determine the global feature thresholds adaptively according to the variation of the video components. The obtained video components are first represented by a group of 4x4 blocks coarsely, and then the meaningful moving objects are generated by an iterative region-merging procedure according to the spatiotemporal similarity measure. The temporal tracking procedure is further proposed to obtain more semantic moving objects and to establish the correspondence of the moving objects among frames. Therefore, the proposed automatic moving object extraction algorithm can detect the appearance of new objects as well as the disappearance of existing objects efficiently because the correspondence of the video objects among frames is also established. Moreover, an object-oriented video representation and indexing approach is suggested, where both the operation of the camera (i.e., change of the viewpoint) and the birth or death of the individual objects are exploited to detect the breakpoints of the video data and to select the key frames adaptively.

[1]  Demin Wang Unsupervised video segmentation based on watersheds and temporal tracking , 1998, IEEE Trans. Circuits Syst. Video Technol..

[2]  Andrew K. C. Wong,et al.  A new method for gray-level picture thresholding using the entropy of the histogram , 1985, Comput. Vis. Graph. Image Process..

[3]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[4]  Jun Shen,et al.  Fast computation of moment invariants , 1991, Pattern Recognit..

[5]  A. D. Brink Thresholding of digital images using two-dimensional entropies , 1992, Pattern Recognit..

[6]  R. Kirby,et al.  A Note on the Use of (Gray Level, Local Average Gray Level) Space as an Aid in Threshold Selection. , 1979 .

[7]  Michael Hötter,et al.  Object-oriented analysis-synthesis coding based on moving two-dimensional objects , 1990, Signal Process. Image Commun..

[8]  Ahmed S. Abutableb Automatic thresholding of gray-level pictures using two-dimensional entropy , 1989 .

[9]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  King Ngi Ngan,et al.  Automatic segmentation of moving objects for video object plane generation , 1998, IEEE Trans. Circuits Syst. Video Technol..

[11]  Rong Wang,et al.  Image sequence segmentation based on 2D temporal entropic thresholding , 1996, Pattern Recognit. Lett..

[12]  Til Aach,et al.  Statistical model-based change detection in moving video , 1993, Signal Process..

[13]  Jonathan D. Courtney Automatic video indexing via object motion analysis , 1997, Pattern Recognit..

[14]  Roland Mech,et al.  A noise robust method for 2D shape estimation of moving objects in video sequences considering a moving camera , 1998, Signal Process..

[15]  Jie Chen,et al.  Affine curve moment invariants for shape recognition , 1997, Pattern Recognit..

[16]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[17]  Chaur-Chin Chen Improved moment invariants for shape discrimination , 1993, Pattern Recognit..

[18]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Rong Wang,et al.  Adaptive image sequence coding based on global and local compensability analysis , 1996 .

[20]  Sanjit K. Mitra,et al.  Image Representation Using Block Pattern Models and Its Image Processing Applications , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Fuxi Gan,et al.  Motion estimation based on global and local uncompensability analysis , 1997, IEEE Trans. Image Process..

[22]  Chaur-Chin Chen,et al.  Improved moment invariants for shape discrimination , 1993, Optics & Photonics.

[23]  Stephen W. Smoliar,et al.  An integrated system for content-based video retrieval and browsing , 1997, Pattern Recognit..

[24]  Luis Torres,et al.  Region-based video coding using mathematical morphology , 1995 .

[25]  Thierry Pun,et al.  A new method for grey-level picture thresholding using the entropy of the histogram , 1980 .

[26]  Nariman Farvardin,et al.  A perceptually motivated three-component image model-Part I: description of the model , 1995, IEEE Trans. Image Process..

[27]  Jörn Ostermann,et al.  Object-oriented analysis-synthesis coding of moving images , 1989, Signal Process. Image Commun..

[28]  M. Hötter,et al.  Image segmentation based on object oriented mapping parameter estimation , 1988 .

[29]  Azriel Rosenfeld,et al.  A Threshold Selection Technique , 1974, IEEE Transactions on Computers.

[30]  Ciro Cafforio,et al.  Methods for measuring small displacements of television images , 1976, IEEE Trans. Inf. Theory.

[31]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[32]  Norbert Diehl,et al.  Object-oriented motion estimation and segmentation in image sequences , 1991, Signal Process. Image Commun..

[33]  Jan Flusser,et al.  Pattern recognition by affine moment invariants , 1993, Pattern Recognit..

[34]  Luc Vincent,et al.  Watersheds in Digital Spaces: An Efficient Algorithm Based on Immersion Simulations , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  Steven D. Blostein,et al.  Motion-based object segmentation and estimation using the MDL principle , 1995, IEEE Trans. Image Process..

[36]  David W. Murray,et al.  Scene Segmentation from Visual Motion Using Global Optimization , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  W. B. Thompson,et al.  Combining motion and contrast for segmentation , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Murat Kunt,et al.  Contour simplification and motion compensation for very low bit-rate video coding , 1994, Proceedings of 1st International Conference on Image Processing.

[39]  B. Julesz Textons, the elements of texture perception, and their interactions , 1981, Nature.

[40]  J. Limb,et al.  Measuring the Speed of Moving Objects from Television Signals , 1975, IEEE Trans. Commun..

[41]  Cheng-Hong Yang,et al.  Motion-based video segmentation using continuation method and robust cost functions , 1998, Electronic Imaging.

[42]  Hiroshi Watanabe,et al.  Two-stage motion compensation using adaptive global MC and local affine MC , 1997, IEEE Trans. Circuits Syst. Video Technol..

[43]  Gilad Adiv,et al.  Determining Three-Dimensional Motion and Structure from Optical Flow Generated by Several Moving Objects , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Minoru Asada,et al.  MDL-Based Segmentation and Motion Modeling in a Long Image Sequence of Scene with Multiple Independently Moving Objects , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[45]  Chin-Wen Yang,et al.  A fast two-dimensional entropic thresholding algorithm , 1994, Pattern Recognit..

[46]  Joern Ostermann,et al.  Modeling of 3-D moving objects for an analysis-synthesis coder , 1990, Other Conferences.

[47]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[48]  Fuxi Gan,et al.  Spatiotemporal segmentation based on two-dimensional spatiotemporal entropic thresholding , 1997 .

[49]  Horst Bunke,et al.  Simple and fast computation of moments , 1991, Pattern Recognit..

[50]  Yushan Tan,et al.  Automated three-dimensional surface profilometry using dual-frequency optic fiber phase-shifting method , 1997 .

[51]  A. Murat Tekalp,et al.  Object-based indexing of MPEG-4 compressed video , 1997, Electronic Imaging.

[52]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[53]  Jörn Ostermann,et al.  Feedback loop for coder control in a block-based hybrid coder with mesh-based motion compensation , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.