Automatic Video Object Segmentation Using Volume Growing and Hierarchical Clustering

We introduce an automatic segmentation framework that blends the advantages of color-, texture-, shape-, and motion-based segmentation methods in a computationally feasible way. A spatiotemporal data structure is first constructed for each group of video frames, in which each pixel is assigned a feature vector based on low-level visual information. Then, the smallest homogeneous components, so-called as volumes, are expanded from selected marker points using an adaptive, three-dimensional, centroid-linkage method. Self descriptors that characterize each volume and relational descriptors that capture the mutual properties between pairs of volumes are determined by evaluating the boundary, trajectory, and motion of the volumes. These descriptors are used to measure the similarity between volumes based on which volumes are further grouped into objects. A fine-to-coarse clustering algorithm yields a multiresolution object tree representation as an output of the segmentation.

[1]  大田 友一,et al.  A region-oriented image-analysis system by computer , 1980 .

[2]  F. Porikli IMAGE SIMPLIFICATION BY ROBUST ESTIMATOR BASED RECONSTRUCTION FILTER , 2001 .

[3]  Levent Onural,et al.  Image sequence analysis for emerging interactive multimedia services-the European COST 211 framework , 1998, IEEE Trans. Circuits Syst. Video Technol..

[4]  Gilad Adiv,et al.  Determining Three-Dimensional Motion and Structure from Optical Flow Generated by Several Moving Objects , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Montse Pardàs,et al.  Hierarchical morphological segmentation for image sequence coding , 1994, IEEE Trans. Image Process..

[6]  Azriel Rosenfeld,et al.  Segmentation and Estimation of Image Region Properties through Cooperative Hierarchial Computation , 1981, IEEE Transactions on Systems, Man, and Cybernetics.

[7]  Patrick Bouthemy,et al.  Motion segmentation and qualitative dynamic scene analysis from an image sequence , 1993, International Journal of Computer Vision.

[8]  J.K. Aggarwal,et al.  Correspondence processes in dynamic scene analysis , 1981, Proceedings of the IEEE.

[9]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[10]  A. Murat Tekalp,et al.  Video object tracking with feedback of performance measures , 2003, IEEE Trans. Circuits Syst. Video Technol..

[11]  T. Vlachos,et al.  Motion measurement using shape adaptive phase correlation , 2001 .

[12]  Andreas Koschan,et al.  Colour Image Segmentation: A Survey , 1994 .

[13]  Linda G. Shapiro,et al.  Computer and Robot Vision , 1991 .

[14]  José Crespo,et al.  The flat zone approach: A general low-level region merging segmentation method , 1997, Signal Process..

[15]  David W. Murray,et al.  Scene Segmentation from Visual Motion Using Global Optimization , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Ishwar K. Sethi,et al.  Finding Trajectories of Feature Points in a Monocular Image Sequence , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Takashi Ida,et al.  Self-affine mapping system and its application to object contour extraction , 2000, IEEE Trans. Image Process..

[18]  B. N. Chatterji,et al.  An FFT-based technique for translation, rotation, and scale-invariant image registration , 1996, IEEE Trans. Image Process..

[19]  Josef Bigün,et al.  Spatio-Temporal Robust Motion Estimation and Segmentation , 1995, CAIP.

[20]  Shih-Fu Chang,et al.  Long-term moving object segmentation and tracking using spatio-temporal consistency , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[21]  Patrick Bouthemy,et al.  Region-Based Tracking Using Affine Motion Models in Long Image Sequences , 1994 .

[22]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[23]  Rachid Deriche,et al.  Tracking line segments , 1990, Image Vis. Comput..

[24]  Azriel Rosenfeld,et al.  Some experiments in image segmentation by clustering of local feature values , 1979, Pattern Recognit..

[25]  William B. Thompson,et al.  Detecting moving objects , 1989, International Journal of Computer Vision.

[26]  Emanuele Trucco,et al.  Computer and Robot Vision , 1995 .

[27]  Yao Wang,et al.  Video object segmentation , 2002 .

[28]  Shih-Fu Chang,et al.  A fully automated content-based video search engine supporting spatiotemporal queries , 1998, IEEE Trans. Circuits Syst. Video Technol..

[29]  M. Kunt,et al.  Second-generation image-coding techniques , 1985, Proceedings of the IEEE.

[30]  Michael J. Black Combining Intensity and Motion for Incremental Segmentation and Tracking Over Long Image Sequences , 1992, ECCV.