论文信息 - Blind channel estimation based on speech correlation structure

Blind channel estimation based on speech correlation structure

Cepstral mean normalization is the standard technique for channel robustness. Despite its good performance, the effectiveness of cepstral mean normalization (CMN) for short sentences is argued. CMN underlying hypothesis that the speech cepstral mean is constant is not valid for short processing windows. This implies the removal of some phonetic information. In this paper we show that the speech correlation structure may be used to estimate the communication channel and we propose an efficient algorithm to compute this estimate. We argue that the resulting channel estimate is more accurate because the underlying hypothesis is better verified than the original CMN hypothesis. Results for the Kai-Fu Lee phone recognition task on NTIMIT, with acoustic models trained on TIMIT (mismatch conditions), show that our method provides an 8% relative error rate reduction as compared to CMN.

[1] S. Furui,et al. Cepstral analysis technique for automatic speaker verification , 1981 .

[2] Jonathan G. Fiscus,et al. Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[3] Hynek Hermansky,et al. RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..

[4] Richard M. Stern,et al. Robust speech recognition , 1997 .

[5] Ki Yong Lee,et al. Adaptive filtering for speech enhancement in colored noise , 1997, IEEE Signal Processing Letters.

[6] Lang Tong,et al. Blind channel estimation using the second-order statistics: asymptotic performance and limitations , 1997, IEEE Trans. Signal Process..

[7] Byung-Gook Lee,et al. Adaptive filtering for speech enhancement in colored noise , 1997 .

[8] L. Tong,et al. Multichannel blind identification: from subspace to maximum likelihood methods , 1998, Proc. IEEE.

[9] Hermann Ney,et al. The RWTH Large Vocabulary Speech Recognition System for Spontaneous Speech , 2000, KONVENS.

[10] William H. Press,et al. Numerical recipes in C , 2002 .