Emotion detection using perceptual based speech features

Speech is one of the most popular modalities for emotion recognition. This work uses Mel and Bark scale dependent perceptual auditory features for recognizing seven emotions from Berlin speech corpus. A combination of Mel Frequency Cepstral Coefficients (MFCC's), Perceptual Linear Predictive Cepstrum (PLPC), Mel Frequency Perceptual Linear Predictive Cepstrum (MFPLPC) and Linear predictive coefficients (LPC) are chosen for the task. The role of perceptual based features is analyzed for effective Speech Emotion Recognition (SER). A search for a compact feature vector of perceptual features is carried out in the discrete and continuous emotion category. Neural network classifier is employed for classification. Comparative analysis with literature shows that the proposed algorithm outperforms in terms of recognition accuracy. Promising results are obtained with the mentioned combination of features in discrete emotional category as well as in arousal dimension of the continuous emotion circumflex.

[1]  S. Lalitha,et al.  Speech emotion recognition , 2014, 2014 International Conference on Advances in Electronics Computers and Communications.

[2]  K. V. Krishna Kishore,et al.  Emotion recognition in speech using MFCC and wavelet features , 2013 .

[3]  Marius Dan Zbancioc,et al.  Emotion recognition in Romanian language using LPC features , 2013, 2013 E-Health and Bioengineering Conference (EHB).

[4]  K. Sreenivasa Rao,et al.  Emotion recognition using LP residual at sub-segmental, segmental and supra-segmental levels , 2015, 2015 International Conference on Communication, Information & Computing Technology (ICCICT).

[5]  Eugen Lupu,et al.  Emotions recognition by speechand facial expressions analysis , 2009, 2009 17th European Signal Processing Conference.

[6]  Naomi Harte,et al.  Late integration of features for acoustic emotion recognition , 2013, 21st European Signal Processing Conference (EUSIPCO 2013).

[7]  A. Revathy,et al.  Emotion recognition using continuous density HMM , 2015, 2015 International Conference on Communications and Signal Processing (ICCSP).

[8]  Ning An,et al.  Speech Emotion Recognition Using Fourier Parameters , 2015, IEEE Transactions on Affective Computing.

[9]  K. V. Krishna Kishore,et al.  Emotion recognition in speech using MFCC and wavelet features , 2013, 2013 3rd IEEE International Advance Computing Conference (IACC).

[10]  Adam Pelikant,et al.  Comparison of perceptual features efficiency for automatic identification of emotional states from speech , 2013, 2013 6th International Conference on Human System Interactions (HSI).

[11]  Lijiang Chen,et al.  Multi-level Speech Emotion Recognition Based on HMM and ANN , 2009, 2009 WRI World Congress on Computer Science and Information Engineering.

[12]  Petr Horak,et al.  The influence of individual prosodic parameters on the perception of emotions in Czech , 2011, Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2011.