NeTra-V: toward an object-based video representation

There is a growing need for new representations of video that allow not only compact storage of data but also content-based functionalities such as search and manipulation of objects. We present here a prototype system, called NeTra-V, that is currently being developed to address some of these content related issues. The system has a two-stage video processing structure: a global feature extraction and clustering stage, and a local feature extraction and object-based representation stage. Key aspects of the system include a new spatio-temporal segmentation and object-tracking scheme, and a hierarchical object-based video representation model. The spatio-temporal segmentation scheme combines the color/texture image segmentation and affine motion estimation techniques. Experimental results show that the proposed approach can handle large motion. The output of the segmentation, the alpha plane as it is referred to in the MPEG-4 terminology, can be used to compute local image properties. This local information forms the low-level content description module in our video representation. Experimental results illustrating spatio- temporal segmentation and tracking are provided.

[1]  Edoardo Ardizzone,et al.  Multifeature image and video content-based storage and retrieval , 1996, Other Conferences.

[2]  Amarnath Gupta,et al.  Virage video engine , 1997, Electronic Imaging.

[3]  Giridharan Iyengar,et al.  VideoBook: an experiment in characterization of video , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[4]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  B. S. Manjunath,et al.  Content-based search of video using color, texture, and motion , 1997, Proceedings of International Conference on Image Processing.

[6]  Henri Sanson Toward a robust parametric identification of motion on regions of arbitrary shape by nonlinear optimization , 1995, Proceedings., International Conference on Image Processing.

[7]  Shih-Fu Chang,et al.  Video object model and segmentation for content-based video indexing , 1997, Proceedings of 1997 IEEE International Symposium on Circuits and Systems. Circuits and Systems in the Information Age ISCAS '97.

[8]  Arun N. Netravali,et al.  Digital Video: An introduction to MPEG-2 , 1996 .

[9]  J. Kittler,et al.  Robust motion analysis , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Josef Bigün,et al.  Spatio-Temporal Robust Motion Estimation and Segmentation , 1995, CAIP.

[11]  Shih-Fu Chang,et al.  Clustering methods for video browsing and annotation , 1996, Electronic Imaging.

[12]  B. S. Manjunath,et al.  Edge flow: A framework of boundary detection and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  James Lee Hafner,et al.  Efficient Color Histogram Indexing for Quadratic Form Distance Functions , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Michael J. Black,et al.  Skin and bones: multi-layer, locally affine, optical flow and regularization with transparency , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  B. S. Manjunath,et al.  Dimensionality reduction using multi-dimensional scaling for content-based retrieval , 1997, Proceedings of International Conference on Image Processing.

[16]  Stephen W. Smoliar,et al.  An integrated system for content-based video retrieval and browsing , 1997, Pattern Recognit..

[17]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[18]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Multimedia Systems.

[19]  Edward H. Adelson,et al.  Spatio-temporal segmentation of video data , 1994, Electronic Imaging.

[20]  Sanjit K. Mitra,et al.  Region-based video coder using edge flow segmentation and hierarchical affine region matching , 1998, Electronic Imaging.

[21]  David Doermann,et al.  Archiving, indexing, and retrieval of video in the compressed domain , 1996, Other Conferences.

[22]  D. Barba,et al.  Spatio-temporal segmentation of image sequences for object-oriented low bit-rate image coding , 1995, Proceedings., International Conference on Image Processing.

[23]  Shmuel Peleg,et al.  A Three-Frame Algorithm for Estimating Two-Component Image Motion , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Jonathan D. Courtney Automatic video indexing via object motion analysis , 1997, Pattern Recognit..

[25]  Andrew Lippman,et al.  Spatio-temporal segmentation based on motion and static segmentation , 1995, Proceedings., International Conference on Image Processing.