Multi-step linear prediction based speech dereverberation in noisy reverberant environment

A speech signal captured by a distant microphone is generally contaminated by reverberation and background noise, which severely degrade the automatic speech recognition (ASR) performance. In this paper, we first extend a previously proposed single channel dereverberation algorithm to a multi-channel scenario. The method estimates late reflections using multichannel multi-step linear prediction, and then suppresses them in the power spectral domain. Second, we analyze the effect of additive noise on the proposed method and provide one solution to the noisy reverberant environment. Experimental results show that the proposed method achieves good dereverberation in noisy reverberant environments, and can significantly improve the ASR performance to that obtained for a nonreverberant environment.

[1]  Georgios B. Giannakis,et al.  Signal Processing Advances in Wireless and Mobile Communications, Volume 2: Trends in Single- and Multi-User Systems , 2000 .

[2]  Ivan Tashev,et al.  Reverberation Reduction for Better Speech Recognition , 2005 .

[3]  David Gesbert,et al.  Robust blind channel identification and equalization based on multi-step predictors , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Tomohiro Nakatani,et al.  Efficient blind dereverberation framework for automatic speech recognition , 2005, INTERSPEECH.

[5]  Mark J. F. Gales,et al.  Robust continuous speech recognition using parallel model combination , 1996, IEEE Trans. Speech Audio Process..

[6]  Les E. Atlas,et al.  Acoustic diversity for improved speech recognition in reverberant environments , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Tomohiro Nakatani,et al.  Spectral Subtraction Steered by Multi-Step Forward Linear Prediction For Single Channel Speech Dereverberation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[8]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[9]  Juan Carlos,et al.  Review of "Discrete-Time Speech Signal Processing - Principles and Practice", by Thomas Quatieri, Prentice-Hall, 2001 , 2003 .

[10]  Thomas Quatieri,et al.  Discrete-Time Speech Signal Processing: Principles and Practice , 2001 .

[11]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[12]  Masato Miyoshi,et al.  Blind dereverberation algorithm for speech signals based on multi-channel linear prediction , 2005 .

[13]  DeLiang Wang,et al.  A one-microphone algorithm for reverberant speech enhancement , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[14]  Sabine Van Huffel,et al.  SLICOT—A Subroutine Library in Systems and Control Theory , 1999 .