论文信息 - ROBUST MFCC FEATURE EXTRACTION ALGORITHM USING EFFICIENT ADDITIVE AND CONVOLUTIONAL NOISE REDUCTION PROCEDURES

ROBUST MFCC FEATURE EXTRACTION ALGORITHM USING EFFICIENT ADDITIVE AND CONVOLUTIONAL NOISE REDUCTION PROCEDURES

In this paper a robust mel frequency cepstral coefficient feature extraction procedure using noise reduction, frame attenuation and RASTA processing is presented. In the preprocessing stage a hybrid Hamming–Cosine window is applied. To minimize the effect of additive environmental noise on speech signal a spectral subtraction based on spectral smoothing is used. A general mel filtering approach is performed on noise reduced signal. To detect speech frames, a voice activity detection based on log filter-bank energies is performed. The log filter-bank magnitudes of noise-only frames are attenuated. To reduce the level of convolutional distortion, a RASTA filtering of log filter-bank energy trajectories is applied. At final stage, a noise robust feature vector, which consists of 12 mel cepstrum coefficients and the log energy is created. For evaluation of improvement of speech recognition with the proposed front-end, the Aurora 2, 3 databases together with the HTK speech recognition toolkit have been chosen. The total improvement of 41.14% (Aurora 2) and 45.06% (Aurora 3) relative to the baseline MFCC front-end is achieved.

Damjan Vlaj | Bogomir Horvat | Bojan Kotnik

[1] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[2] Zdravko Kacic,et al. A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm , 2001, INTERSPEECH.

[3] Jean-Claude Junqua,et al. Robustness in Automatic Speech Recognition , 1996 .

[4] Hynek Hermansky,et al. RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..