Detecting Practical Speech Emotion in a Cognitive Task

In this paper we analysis the speech emotions related to cognitive process. An automatic system is established for detecting speech emotions including anxiety, hesitation, confidence and joy. In order to obtain a naturalistic database we use noise to induce negative emotions, sleep deprivation is also used for this purpose. The lack of sleep is an important cause for anxiety. Annotation of emotional speech is then done manually with a self evaluation for the felt emotions in each utterance. Acoustic features are extracted both for valence dimension and arousal dimension including voice quality features. For the recognition algorithm Gaussian Mixture Model is adopted for detecting each type of emotions from neutral speech. Based on the detection of each emotion in the continuous recognition of emotion states an error correcting method is proposed. With the previous emotion states and cognitive performance the detection errors in current emotion state is corrected with empirical probability. Experimental results show that our system can detect "practical" speech emotions related to cognitive process. With the proposed error correcting method the recognition performance is improved compared to the baseline system based on Gaussian Mixture Model. We believe the detection of these "practical" emotions is important for real world applications, especially for helping people cope with negative emotions in cognitive activities.

[1]  K. Truong How does real affect affect affect recognition in speech , 2009 .

[2]  Rafaela R. Campagnoli,et al.  Sustained and transient modulation of performance induced by emotional picture viewing. , 2006, Emotion.

[3]  Chloé Clavel,et al.  Fear-type emotion recognition for future audio-based surveillance systems , 2008, Speech Commun..

[4]  Shogo Muramatsu,et al.  Interval calculation of EM algorithm for GMM parameter estimation , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[5]  Mohan M. Trivedi,et al.  Speech Emotion Analysis: Exploring the Role of Context , 2010, IEEE Transactions on Multimedia.

[6]  Gerhard Rigoll,et al.  Bimodal fusion of emotional data in an automotive environment , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[7]  Fakhri Karray,et al.  Survey on speech emotion recognition: Features, classification schemes, and databases , 2011, Pattern Recognit..

[8]  Chengwei Huang,et al.  A Study on Feature Analysis and Recognition of Practical Speech Emotion: A Study on Feature Analysis and Recognition of Practical Speech Emotion , 2011 .

[9]  Ing-Marie Jonsson,et al.  Automatic recognition of affective cues in the speech of car drivers to allow appropriate responses , 2005, OZCHI.

[10]  L. Pessoa,et al.  Emotion affects action: Midcingulate cortex as a pivotal node of interaction between negative emotion and motor signals , 2010, Cognitive, affective & behavioral neuroscience.