An acoustic front-end for interactive TV incorporating multichannel acoustic echo cancellation and blind signal extraction

In this contribution, an acoustic front-end for distant-talking interfaces as developed within the European Union-funded project DICIT (Distant-talking interfaces for Control of Interactive TV) is presented. It comprises state-of-the-art multichannel acoustic echo cancellation and blind source separation-based signal extraction and only requires two microphone signals. The proposed scheme is analyzed and evaluated for different realistic scenarios when a speech recognizer is used as back-end. The results show that the system significantly outperforms simple alternatives, i.e., a two-channel Delay & Sum beamformer for speech signal extraction.

[1]  Jacob Benesty,et al.  Robust extended multidelay filter and double-talk detector for acoustic echo cancellation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Walter Kellermann,et al.  Acoustic Echo Cancellation for Surround Sound using Perceptually Motivated Convergence Enhancement , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[3]  Walter Kellermann,et al.  Strategies for combining acoustic echo cancellation and adaptive beamforming microphone arrays , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Kiyohiro Shikano,et al.  Blind Spatial Subtraction Array for Speech Enhancement in Noisy Environment , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Akihiko Sugiyama,et al.  A real time robust adaptive microphone array controlled by an SNR estimate , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[6]  Walter Kellermann,et al.  Combination of Adaptive Feedback Cancellation and Binaural Adaptive Filtering in Hearing Aids , 2009, EURASIP J. Adv. Signal Process..

[7]  Jacob Benesty,et al.  Generalized multichannel frequency-domain adaptive filtering: efficient realization and application to hands-free speech communication , 2005, Signal Process..

[8]  Walter Kellermann,et al.  BSS for improved interference estimation for Blind speech signal Extraction with two microphones , 2009, 2009 3rd IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP).

[9]  Walter Kellermann,et al.  A GENERALIZATION OF A CLASS OF BLIND SOURCE SEPARATION ALGORITHMS FOR CONVOLUTIVE MIXTURES , 2003 .

[10]  Wolfgang Herbordt,et al.  Application of a double-talk resilient DFT domain adaptive filter for bin-wise stepsize controls to adaptive beamforming , 2005 .

[11]  Walter Kellermann,et al.  Blind Source Separation for Convolutive Mixtures: A Unified Treatment , 2004 .

[12]  Simon Doclo,et al.  Multi-microphone noise reduction and dereverberation techniques for speech applications , 2003 .

[13]  W. Kellermann,et al.  A natural acoustic front-end for Interactive TV in the EU-Project DICIT , 2009, 2009 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing.