Extraction and analysis of the speech emotion features based on multi-fractal spectrum

A calculating method for the multi-fractal spectrum of speech envelope, frequency wave and pitch contour of speech signal is presented to reflect the change of pitch, energy and frequency more accurately. According to these spectra, 18 novel speech emotion features are proposed, and the clustering performances of these features are analysed. The results of the recognition experiment for seven emotion states demonstrate that the speech emotion features proposed in this paper are effective for the speech emotion recognition, and that they can be used as the complementary features to the acoustics emotion features for the speech emotion recognition.

[1]  Yongzhao Zhan,et al.  Speech Emotion Feature Selection Method Based on Contribution Analysis Algorithm of Neural Network , 2008 .

[2]  Zhihong Zeng,et al.  Audio–Visual Affective Expression Recognition Through Multistream Fused HMM , 2008, IEEE Transactions on Multimedia.

[3]  Vidhyasaharan Sethu,et al.  Empirical mode decomposition based weighted frequency feature for speech-based emotion classification , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  Panayiotis G. Georgiou,et al.  Real-time Emotion Detection System using Speech: Multi-modal Fusion of Different Timescale Features , 2007, 2007 IEEE 9th Workshop on Multimedia Signal Processing.

[5]  Yoon Keun Kwak,et al.  Emotional Feature Extraction Based On Phoneme Information for Speech Emotion Recognition , 2007, RO-MAN 2007 - The 16th IEEE International Symposium on Robot and Human Interactive Communication.

[6]  Gao Hui,et al.  Emotion Classification of Mandarin Speech Based on TEO Nonlinear Features , 2007, Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007).

[7]  N. Ellouze,et al.  Pitch and Energy Contribution in Emotion and Speaking styles Recognition Enhancement , 2006, The Proceedings of the Multiconference on "Computational Engineering in Systems Applications".

[8]  Constantine Kotropoulos,et al.  Emotional speech recognition: Resources, features, and methods , 2006, Speech Commun..

[9]  Zhiping Wang,et al.  Speech Emotional Recognition Using Global and Time Sequence Structure Features with MMD , 2005, ACII.

[10]  Chung-Hsien Wu,et al.  IG-Based Feature Extraction and Compensation for Emotion Recognition from Speech , 2005, ACII.

[11]  Pierre-Yves Oudeyer,et al.  Erratum to: "The production and recognition of emotions in speech: features and algorithms": [Int. J. Hum.-Comput. Stud 59 (2003) 157] , 2005, Int. J. Hum. Comput. Stud..

[12]  Björn W. Schuller,et al.  Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Oudeyer Pierre-Yves,et al.  The production and recognition of emotions in speech: features and algorithms , 2003 .

[14]  Iasonas Kokkinos,et al.  Some advances in nonlinear speech modeling using modulations, fractals, and chaos , 2002, 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628).

[15]  Tiejun Lu,et al.  Study on fractal features of modulation signals , 2001, Science in China Series : Information Sciences.

[16]  Charles Thompson,et al.  Transition to chaos in acoustically driven flows , 1991 .

[17]  Petros Maragos,et al.  Fractal aspects of speech signals: dimension and interpolation , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[18]  Gao Hui,et al.  Emotion classification of mandarin speech based on TEO nonlinear features , 2007, SNPD.

[19]  Cai Lianhong,et al.  Speech emotion recognition using acoustic features , 2006 .

[20]  Cao Peng,et al.  Research and implementation of emotional feature extraction and recognition in speech signal , 2005 .

[21]  Ma Hong-mi Method for Speech Recognition Based on Mixed Parameter of MFCC and Fractal Dimension , 2005 .

[22]  Pierre-Yves Oudeyer,et al.  The production and recognition of emotions in speech: features and algorithms , 2003, Int. J. Hum. Comput. Stud..

[23]  Chengming Quo Multifractal theory and its application to objective measures of speech quality , 2002 .