Video shot detection and characterization for video databases

Abstract The organization of video information for video databases requires segmentation of a video into its constituent shots and their subsequent characterization in terms of content and camera work. In this paper, we look at these two steps using compressed video data directly. For shot detection, we suggest a scheme consisting of comparing intensity, row, and column histograms of successive I frames of MPEG video using the chi-square test. For characterization of segmented shots, we address the problem of classifying shot motion into different categories using a set of features derived from motion vectors of P and B frames of MPEG video. The central component of the proposed shot motion characterization scheme is a decision tree classifier built through a process of supervised learning. Experimental results using a variety of videos are presented to demonstrate the effectiveness of performing shot detection and characterization directly on compressed video.

[1]  Lin-Shan Lee,et al.  A Polarization Control System for Satellite Communications with Multiple Uplinks , 1978, IEEE Trans. Commun..

[2]  D. Legall,et al.  MPEG : A video compression standard for multimedia applications , 1991 .

[3]  Ramesh C. Jain,et al.  Indexing in video databases , 1995, Electronic Imaging.

[4]  Nilesh V. Patel,et al.  Statistical approach to scene change detection , 1995, Electronic Imaging.

[5]  Wei-I Hsu,et al.  An algorithm for the general solution of hidden line removal for intersecting solids , 1991, Comput. Graph..

[6]  Hans-Hellmut Nagel,et al.  Formation of an object concept by analysis of systematic time variations in the optically perceptible environment , 1978 .

[7]  Shinji Abe,et al.  Scene retrieval method for video database applications using temporal condition changes , 1989, International Workshop on Industrial Applications of Machine Intelligence and Vision,.

[8]  Yoshinobu Tonomura,et al.  Video browsing using brightness data , 1991, Other Conferences.

[9]  Yukinobu Taniguchi,et al.  Structured Video Computing , 1994, IEEE MultiMedia.

[10]  Ishwar K. Sethi,et al.  Design of multicategory multifeature split decision trees using perceptron learning , 1994, Pattern Recognit..

[11]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.

[12]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[13]  Yihong Gong,et al.  Video parsing using compressed data , 1994, Electronic Imaging.

[14]  Gregory L. Zick,et al.  Scene decomposition of MPEG-compressed video , 1995, Electronic Imaging.

[15]  I. K. Sethi,et al.  Hierarchical Classifier Design Using Mutual Information , 1982, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Shinji Abe,et al.  Content oriented visual interface using video icons for visual database systems , 1990, J. Vis. Lang. Comput..

[17]  D. C. Coll,et al.  Image Activity Characteristics in Broadcast Television , 1976, IEEE Trans. Commun..

[18]  Philippe Aigrain,et al.  The automatic real-time analysis of film editing and transition effects and its applications , 1994, Comput. Graph..

[19]  Takafumi Miyatake,et al.  IMPACT: an interactive natural-motion-picture dedicated multimedia authoring system , 1991, CHI.

[20]  Didier Le Gall,et al.  MPEG: a video compression standard for multimedia applications , 1991, CACM.

[21]  Arding Hsu,et al.  Image processing on compressed data for large video databases , 1993, MULTIMEDIA '93.

[22]  Yihong Gong,et al.  Automatic parsing of news video , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[23]  Glorianna Davenport,et al.  Cinematic primitives for multimedia , 1991, IEEE Computer Graphics and Applications.

[24]  D. Arijon,et al.  Grammar of Film Language , 1976 .

[25]  Mourad Cherfaoui,et al.  Temporal segmentation of videos: a new approach , 1995, Electronic Imaging.

[26]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1991, CACM.

[27]  Shih-Fu Chang,et al.  Scene change detection in an MPEG-compressed video sequence , 1995, Electronic Imaging.

[28]  Ramesh C. Jain,et al.  Knowledge-guided parsing in video databases , 1993, Electronic Imaging.