An integrated approach for content-based video object segmentation and retrieval

Object-based video data representations enable unprecedented functionalities of content access and manipulation. We present an integrated approach using region-based analysis for semantic video object segmentation and retrieval. We first present an active system that combines low-level region segmentation with user inputs for defining and tracking semantic video objects. The proposed technique is novel in using an integrated feature fusion framework for tracking and segmentation at both region and object levels. Experimental results and extensive performance evaluation show excellent results compared to existing systems. Building upon the segmentation framework, we then present a unique region-based query system for semantic video object. The model facilitates powerful object search, such as spatio-temporal similarity searching at multiple levels.

[1]  Rachid Deriche,et al.  Tracking complex primitives in an image sequence , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[2]  Rajiv Mehrotra,et al.  Similar-Shape Retrieval in Shape Data Management , 1995, Computer.

[3]  Shih-Fu Chang,et al.  Semantic visual templates: linking visual features to semantics , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[4]  Shih-Fu Chang,et al.  Video object model and segmentation for content-based video indexing , 1997, Proceedings of 1997 IEEE International Symposium on Circuits and Systems. Circuits and Systems in the Information Age ISCAS '97.

[5]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[6]  Thomas S. Huang,et al.  Supporting similarity queries in MARS , 1997, MULTIMEDIA '97.

[7]  Gye-Young Kim,et al.  Model-based tracking of moving object , 1997, Pattern Recognit..

[8]  B. S. Manjunath,et al.  Content-based search of video using color, texture, and motion , 1997, Proceedings of International Conference on Image Processing.

[9]  Forouzan Golshani,et al.  Rx for semantic video database retrieval , 1994, MULTIMEDIA '94.

[10]  John R. Smith,et al.  Sequential processing for content-based retrieval of composite objects , 1997, Electronic Imaging.

[11]  A. Murat Tekalp,et al.  Fusion of color and edge information for improved segmentation and edge linking , 1997, Image Vis. Comput..

[12]  Patrick Bouthemy,et al.  Region-Based Tracking Using Affine Motion Models in Long Image Sequences , 1994 .

[13]  Shih-Fu Chang,et al.  Spatio-temporal video search using the object based video representation , 1997, Proceedings of International Conference on Image Processing.

[14]  Montse Pardàs,et al.  Hierarchical morphological segmentation for image sequence coding , 1994, IEEE Trans. Image Process..

[15]  Alberto Del Bimbo,et al.  Symbolic Description and Visual Querying of Image Sequences Using Spatio-Temporal Logic , 1995, IEEE Trans. Knowl. Data Eng..

[16]  Ming-Chieh Lee,et al.  Semantic video object segmentation and tracking using mathematical morphology and perspective motion model , 1997, Proceedings of International Conference on Image Processing.

[17]  Amarnath Gupta,et al.  Virage image search engine: an open framework for image management , 1996, Electronic Imaging.

[18]  Shi-Kuo Chang,et al.  Iconic Indexing by 2-D Strings , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Vijay V. Raghavan,et al.  Design and evaluation of algorithms for image retrieval by spatial similarity , 1995, TOIS.

[20]  Narendra Ahuja,et al.  Multiscale image segmentation by integrated edge and region detection , 1997, IEEE Trans. Image Process..

[21]  Rachid Deriche,et al.  Tracking line segments , 1990, Image Vis. Comput..

[22]  Luc Vincent,et al.  Watersheds in Digital Spaces: An Efficient Algorithm Based on Immersion Simulations , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Alain Trémeau,et al.  A region growing and merging algorithm to color segmentation , 1997, Pattern Recognit..

[24]  Naonori Ueda,et al.  Tracking Moving Contours Using Energy-Minimizing Elastic Contour Models , 1992, ECCV.

[25]  Josef Kittler,et al.  Motion based image segmentation for video coding , 1995, Proceedings., International Conference on Image Processing.

[26]  Shih-Fu Chang,et al.  Scene change detection in an MPEG-compressed video sequence , 1995, Electronic Imaging.

[27]  M. Bierling,et al.  Displacement Estimation By Hierarchical Blockmatching , 1988, Other Conferences.

[28]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[29]  A. Murat Tekalp,et al.  Fusion of color and edge information for improved segmentation and edge linking , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[30]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[31]  Jun Zhang,et al.  Cluster validation for unsupervised stochastic model-based image segmentation , 1994, Proceedings of 1st International Conference on Image Processing.

[32]  V. S. Hwang,et al.  Tracking feature points in time-varying images using an opportunistic selection approach , 1989, Pattern Recognit..

[33]  Harpreet S. Sawhney,et al.  Layered representation of motion video using robust maximum-likelihood estimation of mixture models and MDL encoding , 1995, Proceedings of IEEE International Conference on Computer Vision.

[34]  John P. Oakley,et al.  Storage and Retrieval for Image and Video Databases , 1993 .

[35]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Multimedia Systems.

[37]  Shih-Fu Chang,et al.  Efficient video sequence retrieval in large repositories , 1998, Electronic Imaging.

[38]  Shih-Fu Chang,et al.  VideoQ: an automated content based video search system using visual cues , 1997, MULTIMEDIA '97.

[39]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.