Efficient blind dereverberation framework for automatic speech recognition

A speech signal captured by a distant microphone is generally smeared by reverberation, which severely degrades Automatic Speech Recognition (ASR) performance. In this paper, we propose a novel and practical single channel dereverberation scheme, which utilizes the relationship between speech and reverberation. A dereverberation filter derived by the proposed method is capable of efficiently suppressing late reflections, which constitute a major cause of ASR performance degradation. The proposed algorithm can achieve effective dereverberation with a more reasonable computational complexity than conventional methods. Experimental results reveal a substantial improvement in ASR performance even in severely reverberant environments.

[1]  S.C. Douglas,et al.  Multichannel blind deconvolution and equalization using the natural gradient , 1997, First IEEE Signal Processing Workshop on Signal Processing Advances in Wireless Communications.

[2]  Steven F. Boll A spectral subtraction algorithm for suppression of acoustic noise in speech , 1979, ICASSP.

[3]  Philip C. Woodland,et al.  Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[4]  Rainer Martin,et al.  Spectral Subtraction Based on Minimum Statistics , 2001 .

[5]  Les E. Atlas,et al.  Acoustic diversity for improved speech recognition in reverberant environments , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Tomohiro Nakatani,et al.  Fast estimation of a precise dereverberation filter based on speech harmonicity , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[7]  Brian Kingsbury,et al.  Recognizing reverberant speech with RASTA-PLP , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.