Speech Emotion Recognition Using Novel HHT-TEO Based Features

Speech emotion recognition is an important issue in the development of human-computer interactions. In this paper a series of novel robust features for speech emotion recognition is proposed. Those features, which derived from the Hilbert-Huang transform (HHT) and Teager energy operator (TEO), have the characteristics of multi-resolution, self-adaptability and high precision of distinguish ability. In the experiments, seven status of emotion were selected to be recognized and the highest 85% recognition rate was achieved within the classification accuracy of boredom reached up to 100%. The numerical results indicate that the proposed features are robust and the performance of speech emotion recognition is improved substantially.

[1]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[2]  Rohit Moghe,et al.  Trend analysis techniques for incipient fault prediction , 2009, 2009 IEEE Power & Energy Society General Meeting.

[3]  Christian Wilhelm,et al.  Unusual pH-dependence of diadinoxanthin de-epoxidase activation causes chlororespiratory induced accumulation of diatoxanthin in the diatom , 2001 .

[4]  Ulrich Sommer,et al.  Sedimentation of principal phytoplankton species in Lake Constance , 1984 .

[5]  E. Vesterinen,et al.  Affective Computing , 2009, Encyclopedia of Biometrics.

[6]  Astrid Paeschke,et al.  A database of German emotional speech , 2005, INTERSPEECH.

[7]  Ying Liu,et al.  Application and Contrast in Brain-Computer Interface between Hilbert-Huang Transform and Wavelet Transform , 2008, 2008 The 9th International Conference for Young Computer Scientists.

[8]  T.F.B. Filho,et al.  Human-Machine Interface Based on Electro-Biological Signals for Mobile Vehicles , 2006, 2006 IEEE International Symposium on Industrial Electronics.

[9]  W. Timothy Liu,et al.  Interpretation of scatterometer ocean surface wind vector EOFs over the Northwestern Pacific , 2003 .

[10]  Chongchong Yu,et al.  A Study and Application of HHT in Vibration Signal Analysis of Bridge Structural Health Monitoring System , 2009, 2009 2nd International Congress on Image and Signal Processing.

[11]  John H. L. Hansen,et al.  Nonlinear feature based classification of speech under stress , 2001, IEEE Trans. Speech Audio Process..

[12]  Y. Mori,et al.  Development of a standing style transfer system "ABLE" for disabled lower limbs , 2006, IEEE/ASME Transactions on Mechatronics.

[13]  T. J. Thomas A finite element model of fluid flow in the vocal tract , 1986 .

[14]  Jiang Yang-bo Application of Hilbert marginal spectrum in speech emotion recognition , 2009 .

[15]  Shen Huai-rong Research and Comparison on the Application of Hilbert-Huang Transform and Wavelet Transform to Fault Feature Extraction , 2009 .

[16]  Yong Huang,et al.  Detection and location of power quality disturbances based on mathematical morphology and hilbert-huang transform , 2009, 2009 9th International Conference on Electronic Measurement & Instruments.

[17]  T. Y. Wu,et al.  Misalignment diagnosis of rotating machinery through vibration analysis via the hybrid EEMD and EMD approach , 2009 .

[18]  Yukinori Tani,et al.  Sedimentary Steryl Chlorin Esters (SCEs) and Other Photosynthetic Pigments as Indicators of Paleolimnological Change Over the Last 28,000 Years from the Buguldeika Saddle of Lake Baikal , 2007 .

[19]  Anthony E. Walsby,et al.  Buoyancy regulation by Microcystis in Lake Okaro , 1987 .

[20]  Nii O. Attoh-Okine,et al.  Comparative study of Hilbert–Huang transform, Fourier transform and wavelet transform in pavement profile analysis , 2009 .

[21]  Gao Hui,et al.  Emotion classification of mandarin speech based on TEO nonlinear features , 2007, SNPD.

[22]  Kyoungchul Kong,et al.  Design and control of an exoskeleton for the elderly and patients , 2006, IEEE/ASME Transactions on Mechatronics.

[23]  Li Yingmin,et al.  Analysis of earthquake ground motions using an improved Hilbert–Huang transform , 2008 .

[24]  Jeong-Sik Park,et al.  Feature vector classification based speech emotion recognition for service robots , 2009, IEEE Transactions on Consumer Electronics.

[25]  J. F. Kaiser,et al.  On a simple algorithm to calculate the 'energy' of a signal , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[26]  Yong Jung Kim A MATHEMATICAL INTRODUCTION TO FLUID MECHANICS , 2008 .