论文信息 - Speech emotion recognition based on MFCC

Speech emotion recognition based on MFCC

Emotion speech carries rich information,which is widely used in the human-computer interaction(HCI).Mel-frequency is proposed based on the human auditory characteristics,and it is nonlinearly corresponded with Hz-frequency.Mel-frequency cepstral coefficients(MFCC) is one kind of Hz spectral characteristics;MFCC is calculated based on the nonlinear relationship between Mel-frequency and Hz-frequency and has a wide application in the speech recognition area.But because of such nonlinear relationship,the accuracy of MFCC reduces as the frequency increases.Hence,low MFCCs are usually used and high MFCCs are discarded in applications.This paper analyses this problem and proposes an improved algorithm by amending the nonlinear relationship to improve the accuracy of high MFCCs which are the complementary features to low MFCCs for emotion speech recognition.The experiment result proves that the recognition rate of improved algorithm increases compared to the classical algorithm,and the proposed Mid MFCC is effective.

Yang Yong | Yang Yong