KNN based emotion recognition system for isolated Marathi speech

This paper gives a comparison of two extracted features namely pitch and formants for emotion recognition from speech. The research shows that various features namely prosodic and spectral have been used for emotion recognition from speech. The database used for recognition purpose was developed on Marathi language using 100 speakers. We have extracted features pitch and formants. Angry, stress, admiration, teasing and shocking have been recognized on the basis of features energy and formants. The classification technique used here is KNearest Neighbor (KNN). The result for formants was about 100% which is comparatively better than that of energy which was 80% of accuracy. Keywords-Database, Emotion recognition, Feature Extraction, Formants, KNN classification , Pitch, Speech signals.

[1]  Julia Sidorova,et al.  Optimization techniques for speech emotion recognition , 2010 .

[2]  Carlos Busso,et al.  Analysis of Emotionally Salient Aspects of Fundamental Frequency for Emotion Detection , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Tirupati Goskula,et al.  Comparison between k-nn and svm method for speech emotion recognition , 2011 .

[4]  A. Bombatkar,et al.  Emotion recognition using Speech Processing Using k-nearest neighbor algorithm , 2014 .

[5]  Hiok Chai Quek,et al.  Speech emotion recognition using auditory cortex , 2007, 2007 IEEE Congress on Evolutionary Computation.

[6]  Li Zhao,et al.  A Study on Emotional Feature Analysis and Recognition in Speech Signal , 2009, 2009 International Conference on Measuring Technology and Mechatronics Automation.