Video parsing and browsing using compressed data

Parsing video content is an important first step in the video indexing process. This paper presents algorithms to automate the video parsing task, including partitioning a source video into clips and classifying those clips according to camera operations, using compressed video data. We have developed two algorithms and a hybrid approach to partitioning video data compressed according to the JPEG and MPEG standards. The algorithms utilize both the video content encoded in DCT (Discrete Cosine Transform) coefficients and the motion vectors between frames. The hybrid approach integrates the two algorithms and incorporates multi-pass strategies and motion analyses to improve both accuracy and processing speed. Also, we present content-based video browsing tools which utilize the information, particularly about the shot boundaries and key frames, obtained from parsing.

[1]  Arding Hsu,et al.  Image processing on compressed data for large video databases , 1993, MULTIMEDIA '93.

[2]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.

[3]  Linda G. Shapiro,et al.  Computer and Robot Vision , 1991 .

[4]  Yihong Gong,et al.  Video parsing using compressed data , 1994, Electronic Imaging.

[5]  Stephen W. Smoliar,et al.  Developing power tools for video indexing and retrieval , 1994, Electronic Imaging.

[6]  RaIf Steinmetz,et al.  Data compression in multimedia computing — standards and systems , 1994 .

[7]  Michael Mills,et al.  A magnifier tool for video data , 1992, CHI.

[8]  Richard L. Baker,et al.  Camera zoom/pan estimation and compensation for video compression , 1991, Electronic Imaging.

[9]  Didier Le Gall,et al.  MPEG: a video compression standard for multimedia applications , 1991, CACM.

[10]  Ramesh C. Jain,et al.  Knowledge-guided parsing in video databases , 1993, Electronic Imaging.

[11]  Brian C. O'Connor,et al.  Selecting Key Frames of Moving Image Documents: A Digital Environment for Analysis and Navigation. , 1991 .

[12]  Walter Bender,et al.  Salient video stills: content and context preserved , 1993, MULTIMEDIA '93.

[13]  Takafumi Miyatake,et al.  IMPACT: an interactive natural-motion-picture dedicated multimedia authoring system , 1991, CHI.

[14]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1992 .

[15]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.