Shot genre classification using compressed audio-visual features

This paper proposes shot genre classification from MPEG compressed movies, as one of the high-level indexing methods for audio-visual contents. Through statistical analysis of low-level and mid-level audio-visual features on compressed domain, the proposed method can achieve subjectively accurate shot classification within the movies into predefined genre set, which can be applied to various content handling applications, such as summarization, navigation, editing, filtering, and so on. By feeding subjectively evaluated feature set for each shot genre into the linear machine decision tree classifier, each shot is classified at very low cost. The experimental results show that most of the shots in the movies can be classified into subjectively accurate genres, and also the dominant shot genre can correctly resolve each movie genre.

[1]  Akio Yoneyama,et al.  Universal scene change detection on MPEG-coded data domain , 1997, Electronic Imaging.

[2]  Ba Tu Truong,et al.  Automatic genre identification for content-based video categorization , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[3]  Lei Chen,et al.  Rule-based scene extraction from video , 2002, Proceedings. International Conference on Image Processing.

[4]  John S. D. Mason,et al.  Classification of video genre using audio , 2001, INTERSPEECH.

[5]  Yang Lu,et al.  A fast audio classification from MPEG coded data , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[6]  Carla E. Brodley,et al.  Linear Machine Decision Trees , 1991 .

[7]  Zhu Liu,et al.  Classification TV programs based on audio information using hidden Markov model , 1998, 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175).

[8]  P. Beek,et al.  Text of 15938-5 FCD Information Technology-Multimedia Content Description Interface-Pard 5 Multimedia Description Schemes , 2001 .

[9]  D. Arijon,et al.  Grammar of Film Language , 1976 .

[10]  Mubarak Shah,et al.  Movie genre classification by exploiting audio-visual features of previews , 2002, Object recognition supported by user interaction for service robots.

[11]  Wolfgang Effelsberg,et al.  Automatic recognition of film genres , 1995, MULTIMEDIA '95.

[12]  George Tzanetakis,et al.  Sound analysis using MPEG compressed audio , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[13]  Gerhard Rigoll,et al.  Content based indexing of images and video using face detection and recognition methods , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[14]  Nuno Vasconcelos,et al.  Towards semantically meaningful feature spaces for the characterization of video content , 1997, Proceedings of International Conference on Image Processing.