Affective content analysis in comedy and horror videos by audio emotional event detection

We study the problem of affective content analysis. In this paper, we think of affective contents as those video/audio segments, which may cause an audience's strong reactions or special emotional experiences, such as laughing or fear. Those emotional factors are related to the users' attention, evaluation, and memories of the content. The modeling of affective effects depends on the video genres. In this work, we focus on comedy and horror films to extract the affective content by detecting a set of so-called audio emotional events (AEE) such as laughing, horror sounds, etc. Those AEE can be modeled by various audio processing techniques, and they can directly reflect an audience's emotion. We use the AEE as a clue to locate corresponding video segments. Domain knowledge is more or less employed at this stage. Our experimental dataset consists of 40-minutes comedy video and 40-minutes horror film. An average recall and precision of above 90% is achieved. It is shown that, in addition to rich visual information, an appropriate usage of special audios is an effective way to assist affective content analysis.

[1]  Seiji Inokuchi,et al.  Sentiment extraction in music , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[2]  Li-Qun Xu,et al.  User-oriented affective video content analysis , 2001, Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries (CBAIVL 2001).

[3]  Svetha Venkatesh,et al.  Affect computing in film through sound energy dynamics , 2001, MULTIMEDIA '01.

[4]  E. Vesterinen,et al.  Affective Computing , 2009, Encyclopedia of Biometrics.

[5]  Mohan S. Kankanhalli,et al.  Creating audio keywords for event detection in soccer video , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[6]  Lie Lu,et al.  Automatic mood detection from acoustic music data , 2003, ISMIR.

[7]  Qi Tian,et al.  A fusion scheme of visual and auditory modalities for event detection in sports video , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[8]  Hang-Bong Kang Emotional event detection using relevance feedback , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[9]  Hang-Bong Kang,et al.  Affective content detection using HMMs , 2003, ACM Multimedia.