Speech Emotion Recognition Based on a Fusion of All-Class and Pairwise-Class Feature Selection

Traditionally in speech emotion recognition, feature selection(FS) is implemented by considering the features from all classes jointly. In this paper, a hybrid system based on all-class FS and pairwise-class FS is proposed to improve speech emotion classification performance. Besides a subset of features obtained from an all-class structure, FS is performed on the available data from each pair of classes. All these subsets are used in their corresponding K-nearest-neighbors(KNN) or Support Vector Machine(SVM) classifiers and the posterior probabilities of the multi-classifiers are fused hierarchically. The experiment results demonstrate that compared with the classical method based on all-class FS and the pairwise method based on pairwise-class FS, the proposed approach achieves 3.2%-8.4% relative improvement on the average F1-measure in speaker-independent emotion recognition.

[1]  A. Mehrabian Communication without words , 1968 .

[2]  S. Demleitner [Communication without words]. , 1997, Pflege aktuell.

[3]  Chung-Hsien Wu,et al.  Emotion recognition using acoustic features and textual content , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[4]  Ioannis Pitas,et al.  Automatic emotional speech classification , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Gaël Richard,et al.  Musical instrument recognition based on class pairwise feature selection , 2004, ISMIR.

[6]  Larry D. Hostetler,et al.  k-nearest-neighbor Bayes-risk estimation , 1975, IEEE Trans. Inf. Theory.

[7]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[8]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[9]  Luís A. Alexandre,et al.  On combining classifiers using sum and product rules , 2001, Pattern Recognit. Lett..

[10]  Björn W. Schuller,et al.  Hidden Markov model-based speech emotion recognition , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[11]  E. M. Wright,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[12]  Nello Cristianini,et al.  An introduction to Support Vector Machines , 2000 .

[13]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[14]  Dustin Boswell,et al.  Introduction to Support Vector Machines , 2002 .

[15]  Ching Y. Suen,et al.  A method of combining multiple classifiers-a neural network approach , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[16]  Guodong Guo,et al.  Pairwise face recognition , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.