论文信息 - Video skimming and characterization through the combination of image and language understanding techniques

Video skimming and characterization through the combination of image and language understanding techniques

Digital video is rapidly becoming important for education, entertainment, and a host of multimedia applications. With the size of the video collections growing to thousands of hours, technology is needed to effectively browse segments in a short time without losing the content of the video. We propose a method to extract the significant audio and video information and create a "skim" video which represents a very short synopsis of the original. The goal of this work is to show the utility of integrating language and image understanding techniques for video skimming by extraction of significant information, such as specific objects, audio keywords and relevant video structure. The resulting skim video is much shorter, where compaction is as high as 20:1, and yet retains the essential content of the original segment.

Michael A. Smith | Michael A. Smith

[1] Kirk Smallman. Creative film-making , 1969 .

[2] Michael Loren Mauldin,et al. Information retrieval by text skimming , 1989 .

[3] Richard Mander,et al. Working with audio: integrating personal tape recorders and desktop computers , 1992, CHI '92.

[4] Howard D. Wactlar,et al. Informedia: improving access to digital video , 1994, INTR.

[5] Remi Depommier,et al. Content-based browsing of video sequences , 1994, MULTIMEDIA '94.

[6] Yoshinobu Tonomura,et al. Video tomography: an efficient method for camerawork extraction and motion analysis , 1994, MULTIMEDIA '94.

[7] Boon-Lock Yeo,et al. Video browsing using clustering and scene transitions on compressed sequences , 1995, Electronic Imaging.

[8] David C. Gibbon,et al. Automated authoring of hypermedia documents of video programs , 1995, MULTIMEDIA '95.

[9] Wolfgang Effelsberg,et al. Abstracting Digital Movies Automatically , 1996, J. Vis. Commun. Image Represent..

[10] Takeo Kanade,et al. Intelligent Access to Digital Video: Informedia Project , 1996, Computer.

[11] Howard D. Wactlar,et al. Automated video indexing of very large video libraries , 1997 .

[12] Takeo Kanade,et al. Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..