Spotting by Association in News Video
暂无分享,去创建一个
This paper introduces the Spotting by Association method for video analysis, which is a novel method to detect video segments with typical semantics. Video data contains various kinds of information by means of continuous images, natural language, and sound. For use in a Digital Library, it is essential to segment the video data into meaningful pieces. To detect meaningful segments, we should associate data from each modality, including video, language, and sound. For this purpose, we propose a new method for segment spotting by making correspondences between image clues detected by image analysis and language clues created by natural language analysis. As a result, relevant video segments with sufficient information in every modMity are obtained. We applied our method to closed-captioned CNN Headline News. Video segments with important situations, that is a speech, meeting, or visit, are detected fairly well.
[1] Takeo Kanade,et al. Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..
[2] Takeo Kanade,et al. elligent Access Video: formedia Project , 1996 .
[3] George A. Miller,et al. Introduction to WordNet: An On-line Lexical Database , 1990 .
[4] Daniel Dominic Sleator,et al. Parsing English with a Link Grammar , 1995, IWPT.
[5] Alexander G. Hauptmann,et al. Text, Speech, and Vision for Video Segmentation: The InformediaTM Project , 1995 .