Automatic video summarization by using color and utterance information

In recent years, digital video is rapidly becoming important for education, entertainment, and a host of multimedia applications. With the increasing size of video collections, technology is needed to effectively browse a video in a short time without losing any important contents. We propose a system which automatically summarizes video by analyzing the video color information and utterance information. To do so, we use the color histogram of a shot as color information. We discover important intervals having several color change patterns by using the probability model. Furthermore, we extract utterance information by using closed caption. We find the dialogue structure by analyzing the connectivity between utterances. Finally, we integrate them so as to skim meaningful portions in a video.

[1]  John D. Lafferty,et al.  Statistical Models for Text Segmentation , 1999, Machine Learning.

[2]  Michael A. Smith,et al.  Video skimming and characterization through the combination of image and language understanding techniques , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Takeo Kanade,et al.  Video skimming and characterization through the combination of image and language understanding , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[4]  Koji Zettsu,et al.  Semantic Structures for Video Data Indexing , 1998, AMCP.

[5]  HongJiang Zhang,et al.  Automatic video scene extraction by shot grouping , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.