Using MPEG-7 for Generic Audiovisual Content Automatic Summarization

This paper proposes and evaluates a fully automatic summarization application for generic audiovisual based on MPEG-7 compliant hierarchical summary descriptions, which allows providing flexibility, low complexity, and interoperability. The novelty of this paper regards the exploitation of a three features, low-level arousal model to generate the summary metadata needed to instantiate MPEG-7 compliant summary descriptions with the advantages this brings in terms of interoperability. Moreover, a novel, solid performance evaluation methodology has been proposed and its application has been performed.

[1]  Alan Hanjalic,et al.  Extracting Moods from Pictures and Sounds , 2006 .

[2]  Fumiko Satoh,et al.  Learning personalized video highlights from detailed MPEG-7 metadata , 2002, Proceedings. International Conference on Image Processing.

[3]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[4]  Alan Hanjalic,et al.  Affective video content representation and modeling , 2005, IEEE Transactions on Multimedia.

[5]  A. Murat Tekalp,et al.  Automatic Soccer Video Analysis and Summarization , 2003, IS&T/SPIE Electronic Imaging.

[6]  A. Hanjalic,et al.  Extracting moods from pictures and sounds: towards truly personalized TV , 2006, IEEE Signal Processing Magazine.