Speaker Independent Emotion Recognition Using HMMs Fusion System with Relative Features

Speaker independent emotion recognition is particularly difficult for the individual differences of acoustic character and culture background. So, relative features obtained by calculating the features change of emotion speech relative to natural speech are adopted to weaken the influence from the individual differences in the paper. Moreover, an improved ranked voting fusion system is proposed to combine the decisions from four hidden Markov model (HMM) classifiers which are based on different feature vectors respectively. The recognition results of the provided algorithm have been compared with the isolated HMMs with absolute features, by Berlin database of emotional speech, and the average recognition rate has reached 78.4% in speaker independent case.

[1]  Jianhua Tao,et al.  Features Importance Analysis for Emotional Speech Classification , 2005, ACII.

[2]  Chen Guanghua,et al.  An improved method of endpoints detection based on energy-frequency-value , 2006, Conference on High Density Microsystem Design and Packaging and Component Failure Analysis, 2006. HDP'06..

[3]  Björn W. Schuller,et al.  Hidden Markov model-based speech emotion recognition , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[4]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Jiucang Hao,et al.  Emotion recognition by speech signals , 2003, INTERSPEECH.

[6]  Rosalind W. Picard Affective Computing , 1997 .

[7]  Björn W. Schuller,et al.  Speaker Independent Speech Emotion Recognition by Ensemble Classification , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[8]  Ioannis Pitas,et al.  Automatic emotional speech classification , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Bernd Kleinjohann,et al.  Fuzzy emotion recognition in natural speech dialogue , 2005, ROMAN 2005. IEEE International Workshop on Robot and Human Interactive Communication, 2005..

[10]  Zhang Xiongwei,et al.  New methods of speech segmentation and enhancement based on fractal dimension , 2000, WCC 2000 - ICSP 2000. 2000 5th International Conference on Signal Processing Proceedings. 16th World Computer Congress 2000.

[11]  Yi Luo,et al.  Speech emotion recognition based on a hybrid of HMM/ANN , 2007 .

[12]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[13]  Albino Nogueiras,et al.  Speech emotion recognition using hidden Markov models , 2001, INTERSPEECH.