On Blocking Matrix-Based Dereverberation for Automatic Speech Recognition

We investigate a two-channel reverberation suppression scheme comprising a blocking matrix for estimating late reverberation components by canceling the direct path and early reflections, and spectral enhancement filters for suppressing the late reverberation components. For idealized blocking matrices, we analyze the influence of the length of the blocking matrix filters and the impact of the coherence between the microphones on the resulting estimate. We show that blocking matrices that cancel more than the direct path of the desired signal can be of advantage for robust speech recognition in highly reverberant environments.

[1]  Marc Delcroix,et al.  Precise Dereverberation Using Multichannel Linear Prediction , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Walter Kellermann,et al.  BSS for improved interference estimation for Blind speech signal Extraction with two microphones , 2009, 2009 3rd IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP).

[3]  R. Maas,et al.  Towards a Better Understanding of the Effect of Reverberation on Speech Recognition Performance , 2010 .

[4]  Jon Barker,et al.  An audio-visual corpus for speech perception and automatic speech recognition. , 2006, The Journal of the Acoustical Society of America.

[5]  Eap Emanuël Habets Single- and multi-microphone speech dereverberation using spectral enhancement , 2007 .

[6]  Walter Kellermann,et al.  A two-channel reverberation suppression scheme based on blind signal separation and wiener filtering , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7]  Ning Ma,et al.  The CHiME corpus: a resource and a challenge for computational hearing in multisource environments , 2010, INTERSPEECH.

[8]  Masato Miyoshi,et al.  Inverse filtering of room acoustics , 1988, IEEE Trans. Acoust. Speech Signal Process..

[9]  Walter Kellermann,et al.  Speech enhancement for binaural hearing aids based on blind source separation , 2010, 2010 4th International Symposium on Communications, Control and Signal Processing (ISCCSP).

[10]  Roland Maas,et al.  AT wo-Channel Acoustic Front-End for Robust Automatic Speech Recognition in Noisy and Reverberant Environments , 2011 .

[11]  Walter Kellermann,et al.  TRINICON: a versatile framework for multichannel blind signal processing , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Emanuel A. P. Habets,et al.  Dual-Microphone Speech Dereverberation using a Reference Signal , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[13]  Alexander I. Rudnicky,et al.  Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System for Hand-Held Devices , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[14]  Heinrich Kuttruff,et al.  Room acoustics , 1973 .

[15]  Emanuel A. P. Habets,et al.  On the application of reverberation suppression to robust speech recognition , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).