Comparison between k-nn and svm method for speech emotion recognition

Human Computer intelligent interaction (HCII) is an emerging field of science aimed at providing natural ways for humans to use computer as aids. Machine intelligence needs to include emotional intelligence it is argued that for the computer to be able to interact with humans, it needs to have the communication skills of human. One of these skills is the ability to understand the emotional state of the person. Two recognition methods namely K-Nearest Neighbor (K-NN) and Support vector machine (SVM) classifier have been experimented and compared. The paper explores the simplicity and effectiveness of SVM classifier for designing the real-time emotion recognition system.

[1]  Li Zhao,et al.  A method combining the global and time series structure features for emotion recognition in speech , 2003, International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003.

[2]  K. Scherer,et al.  Acoustic profiles in vocal emotion expression. , 1996, Journal of personality and social psychology.

[3]  Ling Guan,et al.  An investigation of speech-based human emotion recognition , 2004, IEEE 6th Workshop on Multimedia Signal Processing, 2004..

[4]  J. Lyons Coding facial expression with Gabor wavelets , 1998 .

[5]  Shrikanth S. Narayanan,et al.  Toward detecting emotions in spoken dialogs , 2005, IEEE Transactions on Speech and Audio Processing.

[6]  Björn W. Schuller,et al.  Towards More Reality in the Recognition of Emotional Speech , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[7]  K.M. Iftekharuddin,et al.  Detection of Emotional Expressions in Speech , 2006, Proceedings of the IEEE SoutheastCon 2006.

[8]  Thomas S. Huang,et al.  Emotional expressions in audiovisual human computer interaction , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[9]  Frank Dellaert,et al.  Recognizing emotion in speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[10]  Tsang-Long Pao,et al.  Mandarin Emotional Speech Recognition Based on SVM and NN , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[11]  Gang Wei,et al.  Speech emotion recognition based on HMM and SVM , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[12]  S. Ramamohan,et al.  Sinusoidal model-based analysis and classification of stressed speech , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Ishwar K. Sethi,et al.  Investigation of combining SVM and decision tree for emotion classification , 2005, Seventh IEEE International Symposium on Multimedia (ISM'05).

[14]  Fakhri Karray,et al.  Speech Emotion Recognition using Gaussian Mixture Vector Autoregressive Models , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[15]  Zhongzhe Xiao,et al.  Features extraction and selection for emotional speech classification , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..