Using content models to build audio-video summaries

The amount of digitized video in archives is becoming so huge, that easier access and content browsing tools are desperately needed. Also, video is no longer one big piece of data, but a collection of useful smaller building blocks, which can be accessed and used independently from the original context of presentation. In this paper, we demonstrate a content model for audio video sequences, with the purpose of enabling the automatic generation of video summaries. The model is based on descriptors, which indicate various properties and relations of audio and video segments. In practice, these descriptors could either be generated automatically by methods of analysis, or produced manually (or computer-assisted) by the content provider. We analyze the requirements and characteristics of the different data segments, with respect to the problem of summarization, and we define our model as a set of constraints, which allow to produce good quality summaries.

[1]  Bernard Merialdo,et al.  Automatic indexing of TV News , 1997 .

[2]  David S. Doermann,et al.  Video summarization by curve simplification , 1998, MULTIMEDIA '98.

[3]  G Salton,et al.  Automatic Analysis, Theme Generation, and Summarization of Machine-Readable Texts , 1994, Science.

[4]  Mark T. Maybury,et al.  Multimedia summaries of broadcast news , 1997, Proceedings Intelligent Information Systems. IIS'97.

[5]  Marc Davis,et al.  IDIC: assembling video sequences from story plans and content annotations , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[6]  Takeo Kanade,et al.  Intelligent Access to Digital Video: Informedia Project , 1996, Computer.

[7]  Wolfgang Effelsberg,et al.  Video abstracting , 1997, CACM.

[8]  M. Smith,et al.  Video Skimming for Quick Browsing based on Audio and Image Characterization , 1995 .

[9]  KanadeTakeo,et al.  Intelligent Access to Digital Video , 1996 .

[10]  Wolfgang Effelsberg,et al.  Abstracting Digital Movies Automatically , 1996, J. Vis. Commun. Image Represent..

[11]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.