Video compression and retrieval have been treated as separate problems in the past. We present an object-based video representation that facilitates both compression and retrieval. Typically in retrieval applications, a video sequence is subdivided in time into a set of shorter segments each of which contains similar content. These segments are represented by 2-D representative images called "key-frames" that greatly reduce amount of data that is searched. However, key-frames do not describe the motions and actions of objects within the segment. We propose a representation that extends the ideas of the key-frame to further include what we define as "key-objects". These key-objects consist of regions within a key-frame that move with similar motion. Thus our key-objects allow a retrieval system to more efficiently present information to users and assist them in browsing and retrieving relevant video content.
[1]
Stephen W. Smoliar,et al.
Video parsing, retrieval and browsing: an integrated and content-based solution
,
1997,
MULTIMEDIA '95.
[2]
Edward H. Adelson,et al.
Representing moving images with layers
,
1994,
IEEE Trans. Image Process..
[3]
Walter Bender,et al.
Salient video stills: content and context preserved
,
1993,
MULTIMEDIA '93.
[4]
Minerva M. Yeung,et al.
Efficient matching and clustering of video shots
,
1995,
Proceedings., International Conference on Image Processing.