Variance-Based Gaussian Kernel Fuzzy Vector Quantization for Emotion Recognition with Short Speech

Automatic recognition of emotion is becoming an important part in the design of process for affect-sensitive human-machine interaction (HMI) systems. This work proposes variance-based Gaussian kernel fuzzy vector quantization (VGKFVQ) method for speech emotion recognition. By non-linear kernel mapping, it mapped the data into the high-dimensional feature space, and made the dissimilarity among different emotions enlarged. VGKFVQ used the clustering centers to form the codebooks, and employed the minimum overall average fuzzy weighted vector quantization error (FWVQE) rule to classify emotions: happiness, anger, neutral and sadness. VGKFVQ used membership to present the ambiguous of an unknown emotion instead of a single hard label compared with non-fuzzy method such as Support Vector Machine (SVM) algorithm and sample variance replaced the dispersion parameter in the Gaussian kernel to realise adaptively adjustment of the parameter. Experimental results show that the recognition rate of this method is higher than SVM method with short speech as well as Fuzzy C-means Clustering Vector Quantization (FVQ) method.

[1]  Jianing Tong,et al.  Speech Emotion Recognition Based on Principal Component Analysis and Back Propagation Neural Network , 2010, 2010 International Conference on Measuring Technology and Mechatronics Automation.

[2]  Maja J. Mataric,et al.  A Framework for Automatic Human Emotion Classification Using Emotion Profiles , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Mark A. Girolami,et al.  Mercer kernel-based clustering in feature space , 2002, IEEE Trans. Neural Networks.

[4]  Shrikanth S. Narayanan,et al.  Emotion classification from speech using evaluator reliability-weighted combination of ranked lists , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Shuxun Wang,et al.  A KERNEL METHOD FOR SPEAKER RECOGNITION WITH LITTLE DATA , 2006, 2006 8th international Conference on Signal Processing.

[6]  WU Zhong-dong A study of a new fuzzy clustering algorithm based on the kernel method , 2004 .

[7]  Gang Wei,et al.  Speech emotion recognition based on HMM and SVM , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[8]  Dat Tran,et al.  A proposed decision rule for speaker recognition based on fuzzy c-means clustering , 1998, ICSLP.

[9]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.