Prosodic cues for perceptual task-oriented Human-H

This paper addresses the question of perceptual detection and prosodic cues analysis of emotional behavior in a spontaneous speech corpus of real Human-Human dialogs. Detecting real emotions should be a clear focus for research on modeling human dialog, as it could help with analyzing the evolution of the dialog. Our aims are to define appropriate emotions for call center services, to validate the presence of emotions via perceptual tests and to find robust cues for emotion detection. Most research has focused mainly on artificial data in which predefined-emotions were simulated by actors. For real-life corpora a set of appropriate emotion labels must be determined. To this purpose, we conducted a perceptual test exploring 2 experimental conditions: with and without the capacity of listening the audio-signal. Perceived emotions reflect the presence of shaded and mixed emotions/attitudes. We report correlations between objective values and both perceived prosodic parameters and emotion labels.

[1]  Klaus R. Scherer,et al.  Acoustic correlates of task load and stress , 2002, INTERSPEECH.

[2]  P. Boersma ACCURATE SHORT-TERM ANALYSIS OF THE FUNDAMENTAL FREQUENCY AND THE HARMONICS-TO-NOISE RATIO OF A SAMPLED SOUND , 1993 .

[3]  Rosalind W. Picard,et al.  Modeling drivers' speech under stress , 2003, Speech Commun..

[4]  Frank Dellaert,et al.  Recognizing emotion in speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[5]  Shrikanth Narayanan,et al.  Recognition of negative emotions from the speech signal , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[6]  K. Fischer,et al.  DESPERATELY SEEKING EMOTIONS OR: ACTORS, WIZARDS, AND HUMAN BEINGS. , 2000 .

[7]  Shrikanth S. Narayanan,et al.  Combining acoustic and language information for emotion recognition , 2002, INTERSPEECH.

[8]  R. Plutchik The psychology and biology of emotion , 1994 .