Video segmentation of MPEG compressed data

An approach for parsing MPEG compressed video into shots and sub-shots based only on the macroblock (MB) and motion vector (MV) information is presented. The system follows a two-pass scheme and has a hybrid rule-based/neural structure. A rough scan over the P frames locates the potential shot boundaries and the solution is then refined by a precise scan over the B frames of the respective neighborhoods. The "simpler" boundaries are recognized by the rule-based module, while the decisions for the "complex" ones are refined by the neural part. The latter is also used to distinguish dissolves from object and camera motions and to break the shots into sub-shots. The experiments demonstrate high speed and accuracy in shot detection and their characterization.

[1]  Edward J. Delp,et al.  A fast algorithm for video parsing using MPEG compressed sequences , 1995, Proceedings., International Conference on Image Processing.

[2]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[3]  Shih-Fu Chang,et al.  Scene change detection in an MPEG-compressed video sequence , 1995, Electronic Imaging.

[4]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[5]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, J. Electronic Imaging.

[6]  Junji Maeda Method for extracting camera operations in order to describe subscenes in video sequences , 1994, Electronic Imaging.

[7]  Nilesh V. Patel,et al.  Video shot detection and characterization for video databases , 1997, Pattern Recognit..