论文信息 - An auditory feature extraction method based on forward-masking and its application in robust speaker identification and speech recognition

An auditory feature extraction method based on forward-masking and its application in robust speaker identification and speech recognition

1 This work is supported by National Nature Science Funds of China, the project number i Abstract: This article presents a new auditory feature extraction method, which considers the forwardmasking mechanism of auditory nerves and feasible in practice. Two features based on this method are extracted: FMFRC (forward masking firing-rate cepstrum) and FMSRC (forward masking synchronized rate cepstrum). Isolate-word speech recognition and text-dependent speaker identification experiments based on TI46 are conducted. The experiment results show that the new auditory features has comparable performance with MFCC under clean environment but far better noise-resistant property than MFCC in both tasks.

Xihong Wu | Bin Zhen | Zhimin Liu | Huisheng Chi

[1] R. Meddis. Simulation of mechanical to neural transduction in the auditory receptor. , 1986, The Journal of the Acoustical Society of America.

[2] Jr. J.P. Campbell,et al. Speaker recognition: a tutorial , 1997, Proc. IEEE.

[3] B. Delgutte,et al. Speech coding in the auditory nerve: I. Vowel-like sounds. , 1984, The Journal of the Acoustical Society of America.

[4] S. Furui. On the role of spectral transition for speech perception. , 1986, The Journal of the Acoustical Society of America.

[5] R. Patterson,et al. Time-domain modeling of peripheral auditory processing: a modular architecture and a software platform. , 1995, The Journal of the Acoustical Society of America.

[6] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.