论文信息 - Automatic Generation of a Multimedia Encyclopedia from TV Programs by Using Closed Captions and Detecting Principal Video Objects

Automatic Generation of a Multimedia Encyclopedia from TV Programs by Using Closed Captions and Detecting Principal Video Objects

This paper proposes a method for automatically generating a multimedia encyclopedia from video clips using closed-caption text information. The goal is to automatically index each video segment of the television program by the principal video object. We focus on several features of the closed-caption text style in order to identify the principal video objects. Using Quinlan's C4.5 decision-tree learning algorithm and the predicted accuracies of production rule indicators, one object noun is extracted for each video shot. To show the effectiveness of the method, we conducted experiments on the extraction of video segments in which animals appear in twenty television programs on animals and nature. We obtained a precision rate of 74.6 percent and a recall rate of 51.4 percent on the extraction of video segments in which animals appear, and generated a multimedia encyclopedia comprising 322 video clips showing 82 kinds of animals

Nobuyuki Yagi | Hideki Sumiyoshi | Ichiro Yamada | Kikuka Miura

[1] Takeo Kanade,et al. Name-It: Naming and Detecting Faces in Video by the Integration of Image and Natural Language Processing , 1997, IJCAI.

[2] Boon-Lock Yeo,et al. Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[3] Lalitha Agnihotri,et al. Summarization of video programs based on closed captions , 2000, IS&T/SPIE Electronic Imaging.

[4] Yuji Matsumoto,et al. Japanese Dependency Analysis using Cascaded Chunking , 2002, CoNLL.