论文信息 - Voiced and Unvoiced Content of fear-type emotions in the SAFE Corpus

Voiced and Unvoiced Content of fear-type emotions in the SAFE Corpus

The present research focuses on the development of a fear detection system for surveillance applications based on acoustic cues. The emotional speech material used for this study comes from the previously collected SAFE Database (Situation Analysis in a Fictional and Emotional Database) which consists of audiovisual sequences extracted from movie fictions. We address here the question of a specific detection model based on unvoiced speech. In this purpose a set of features is considered for voiced and unvoiced speech. The salience of each feature is evaluated by computing the Fisher Discriminant Ratio for fear versus neutral discrimination. This study confirms that the voiced content and the prosodic features in particular are the most relevant. Finally the detection system merges information conveyed by both voiced and unvoiced acoustic content to enhance its performance. fear is recognized with 69.5% of success.

L. Devillers | I. Vasilescu

[1] T. Moon. The expectation-maximization algorithm , 1996, IEEE Signal Process. Mag..

[2] Mehryar Mohri,et al. Voice signatures , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[3] Roddy Cowie,et al. Emotional speech: Towards a new generation of databases , 2003, Speech Commun..

[4] Christophe d'Alessandro,et al. Predicting the perceptive judgment of voices in a telecom context: selection of acoustic parameters , 2003, INTERSPEECH.

[5] Klaus R. Scherer,et al. Vocal communication of emotion: A review of research paradigms , 2003, Speech Commun..

[6] Valérie Maffiolo,et al. Analysis of emotional speech in voice mail messages: the influence of speakers' gender , 2004, INTERSPEECH.

[7] Chloé Clavel,et al. Fiction database for emotion detection in abnormal situations , 2004, INTERSPEECH.

[8] Laurence Devillers,et al. Reliability of Lexical and Prosodic Cues in Two Real-life Spoken Dialog Corpora , 2004, LREC.

[9] Chloé Clavel,et al. Events Detection for an Audio-Based Surveillance System , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[10] Chloé Clavel,et al. De la construction du corpus émotionnel au système de détection. Le point de vue applicatif de la surveillance dans les lieux publics , 2006, Rev. d'Intelligence Artif..