Towards automatic emotional state categorization from speech signals

This paper investigates the performance of automatic emotional state categorization from speech signals on the Serbian Emotional Speech Corpus, named GEES, against the corresponding human performance. We employ a multistage strategy along with sophisticated features used for automatic emotional state categorization. Our study is the first attempt to apply a machine learning technique to the GEES where the human performance was only available prior to our study. Our investigation indicates that the use of a multistage categorization strategy yields behaviors similar to what human perceives and the performance close to human being’s.

[1]  Rosalind W. Picard,et al.  Classical and novel discriminant features for affect recognition from speech , 2005, INTERSPEECH.

[2]  Albino Nogueiras,et al.  Speech emotion recognition using hidden Markov models , 2001, INTERSPEECH.

[3]  Ralf Kompe,et al.  Emotional space improves emotion recognition , 2002, INTERSPEECH.

[4]  Rosalind W. Picard Affective computing: challenges , 2003, Int. J. Hum. Comput. Stud..

[5]  Rosalind W. Picard,et al.  A computational model for the automatic recognition of affect in speech , 2004 .

[6]  S. Jovi Serbian emotional speech database : design , processing and evaluation , 2004 .

[7]  K.M. Iftekharuddin,et al.  Detection of Emotional Expressions in Speech , 2006, Proceedings of the IEEE SoutheastCon 2006.

[8]  Cynthia Breazeal,et al.  Recognition of Affective Communicative Intent in Robot-Directed Speech , 2002, Auton. Robots.

[9]  Zhongzhe Xiao,et al.  Two-stage Classification of Emotional Speech , 2006, International Conference on Digital Telecommunications (ICDT'06).

[10]  Oudeyer Pierre-Yves,et al.  The production and recognition of emotions in speech: features and algorithms , 2003 .

[11]  Constantine Kotropoulos,et al.  Emotional speech classification using Gaussian mixture models , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[12]  Frank Dellaert,et al.  Recognizing emotion in speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[13]  N. Frijda Recognition of Emotion , 1969 .

[14]  Malcolm Slaney,et al.  BabyEars: A recognition system for affective vocalizations , 2003, Speech Commun..

[15]  Nello Cristianini,et al.  An introduction to Support Vector Machines , 2000 .

[16]  Steven J. Simske,et al.  Recognition of emotions in interactive voice response systems , 2003, INTERSPEECH.

[17]  Werner Verhelst,et al.  An evaluation of the robustness of existing supervised machine learning approaches to the classification of emotions in speech , 2007, Speech Commun..

[18]  Astrid Paeschke,et al.  A database of German emotional speech , 2005, INTERSPEECH.

[19]  Tapio Seppänen,et al.  Prosody-based search features in information retrieval , 2002 .

[20]  Ryohei Nakatsu Nonverbal information recognition and its application to communications , 1998, MULTIMEDIA '98.