Automatic Recognition of Emotional State in Polish Speech

The paper presents the comparison of scores for emotional state automatic recognition tests. The database of Polish emotional speech used during tests includes recordings of six acted emotional states (anger, sadness, happiness, fear, disgust, surprise) and the neutral state of 13 amateur speakers (2118 utterances). The features based on F0, intensity, formants and LPC coefficients were applied in seven chosen classifiers. The highest scores were reached for SVM, ANN and DTC classifiers.

[1]  Roddy Cowie,et al.  Describing the emotional states that are expressed in speech , 2003, Speech Commun..

[2]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[3]  Anna Esposito,et al.  Biometric ID Management and Multimodal Communication, Joint COST 2101 and 2102 International Conference, BioID_MultiComm 2009, Madrid, Spain, September 16-18, 2009. Proceedings , 2009, COST 2101/2102 Conference.

[4]  Wojciech Majewski,et al.  Polish Emotional Speech Database - Recording and Preliminary Validation , 2009, COST 2102 Conference.

[5]  Klaus R. Scherer,et al.  Vocal communication of emotion: A review of research paradigms , 2003, Speech Commun..

[6]  Bin Yang,et al.  The Relevance of Voice Quality Features in Speaker Independent Emotion Recognition , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[7]  Guoyin Wang,et al.  Speech Emotion Recognition Based on Rough Set and SVM , 2006, 2006 5th IEEE International Conference on Cognitive Informatics.

[8]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[9]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[10]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[11]  Astrid Paeschke,et al.  A database of German emotional speech , 2005, INTERSPEECH.

[12]  Oh-Wook Kwon,et al.  EMOTION RECOGNITION BY SPEECH SIGNAL , 2003 .

[13]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[14]  Ian Witten,et al.  Data Mining , 2000 .

[15]  Roddy Cowie,et al.  Emotional speech: Towards a new generation of databases , 2003, Speech Commun..

[16]  Anna Esposito,et al.  Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions , 2009 .

[17]  Piotr Staroniewicz Recognition of Emotional State in Polish Speech - Comparison between Human and Automatic Efficiency , 2009, COST 2101/2102 Conference.