Low-Complexity Multi-Microphone Acoustic Echo Control in the Short-Time Fourier Transform Domain

Many modern communication and smart devices are equipped with several microphones, in addition to one or more loudspeakers. Each microphone not only acquires sounds produced in the near-end room, i.e., desired near-end speech, background noise, and other interferences, but also a far-end signal that is reproduced by the loudspeaker(s). This particular type of acoustic coupling, commonly denoted as acoustic echo, can be reduced in a distortionless manner by employing multi-microphone acoustic echo cancellation (MM-AEC) techniques. However, under noisy conditions, the performance of AEC is limited by the echo-to-noise ratio, and additional echo reduction is needed. Further, to ensure high-quality end-to-end communication in noisy environments, background noise has to be reduced as well. To achieve the latter, multi-microphone speech enhancement techniques, such as beamforming (BF), are often used as they are capable of reducing undesired signal components while causing little distortion to the desired near-end speech. In spite of its high computational cost, the most effective solution to reduce acoustic echoes and background noise is to cascade MM-AEC and BF. In this work, a low-complexity multi-microphone echo controller is introduced, which not only combines low-complexity MM-AEC with BF, but also integrates residual echo reduction into the beamformer design.

[1]  Israel Cohen,et al.  Convolutive Transfer Function Generalized Sidelobe Canceler , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Pascal Scalart,et al.  Comparison of three post-filtering algorithms for residual acoustic echo reduction , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  J. Capon High-resolution frequency-wavenumber spectrum analysis , 1969 .

[4]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[5]  B.D. Van Veen,et al.  Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.

[6]  Ehud Weinstein,et al.  System identification using nonstationary signals , 1996, IEEE Trans. Signal Process..

[7]  Jacob Benesty,et al.  Gaussian Model-Based Multichannel Speech Presence Probability , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Jacob Benesty,et al.  A Nonparametric VSS NLMS Algorithm , 2006, IEEE Signal Processing Letters.

[9]  Emanuel A. P. Habets,et al.  On the Spatial Coherence of Residual Echoes After Stft-Domain Multi-Microphone Acoustic Echo Cancellation , 2018, 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC).

[10]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[11]  Andreas Antoniou,et al.  Analysis of LMS-Newton adaptive filtering algorithms with variable convergence factor , 1995, IEEE Trans. Signal Process..

[12]  E. Hänsler,et al.  Acoustic Echo and Noise Control: A Practical Approach , 2004 .

[13]  Israel Cohen,et al.  Joint noise reduction and acoustic echo cancellation using the transfer-function generalized sidelobe canceller , 2007, Speech Commun..

[14]  Jacob Benesty,et al.  Robust general Kalman filter for echo cancellation , 2013, 21st European Signal Processing Conference (EUSIPCO 2013).

[15]  C. H. Sherman,et al.  Spatial‐Correlation Functions for Various Noise Models , 1962 .

[16]  Jacob Benesty,et al.  An Integrated Solution for Online Multichannel Noise Tracking and Reduction , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  Ying-Ren Chien,et al.  Convex Combined Adaptive Filtering Algorithm for Acoustic Echo Cancellation in Hostile Environments , 2018, IEEE Access.

[18]  Ali H. Sayed,et al.  Combinations of Adaptive Filters: Performance and convergence properties , 2021, IEEE Signal Processing Magazine.

[19]  Israel Cohen,et al.  Relative transfer function identification using speech signals , 2004, IEEE Transactions on Speech and Audio Processing.

[20]  Satoshi Nakamura,et al.  Joint optimization of LCMV beamforming and acoustic echo cancellation , 2004, 2004 12th European Signal Processing Conference.

[21]  Stefan Goetze,et al.  RESIDUAL ECHO POWER SPECTRAL DENSITY ESTIMATION BASED ON AN OPTIMAL SMOOTHED MISALIGNMENT FOR ACOUSTIC ECHO CANCELATION , 2005 .

[22]  Walter Kellermann,et al.  Source-specific system identification , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[23]  Emanuel A. P. Habets,et al.  A Low-Complexity State-Space Architecture for Multi-Microphone Acoustic Echo Control , 2018, 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC).

[24]  Walter Kellermann JOINT DESIGN OF ACOUSTIC ECHO CANCELLATION AND ADAPTIVE BEAMFORMING FOR MICROPHONE ARRAYS , 2022 .

[25]  Walter Kellermann,et al.  Limits for generalized sidelobe cancellers with embedded acoustic echo cancellation , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[26]  Joerg Bitzer,et al.  Post-Filtering Techniques , 2001, Microphone Arrays.

[27]  Simon Haykin,et al.  Adaptive Filter Theory 4th Edition , 2002 .

[28]  Israel Cohen,et al.  On Multiplicative Transfer Function Approximation in the Short-Time Fourier Transform Domain , 2007, IEEE Signal Processing Letters.

[29]  S. Gannot,et al.  Joint acoustic echo cancellation and transfer function GSC in the frequency domain , 2004, 2004 23rd IEEE Convention of Electrical and Electronics Engineers in Israel.

[30]  Christophe Beaugeant,et al.  Combined noise and echo reduction in hands-free systems: a survey , 2001, IEEE Trans. Speech Audio Process..

[31]  Emanuel A. P. Habets,et al.  Multi-Microphone acoustic echo cancellation using relative echo transfer functions , 2017, 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[32]  J. Shynk Frequency-domain and multirate adaptive filtering , 1992, IEEE Signal Processing Magazine.

[33]  Israel Cohen,et al.  System Identification in the Short-Time Fourier Transform Domain With Crossband Filtering , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[34]  Richard C. Hendriks,et al.  Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[35]  Eberhard Hänsler,et al.  Hands-free telephones - joint control of echo cancellation and postfiltering , 2000, Signal Process..

[36]  Sascha Spors,et al.  Source-domain adaptive filtering for MIMO systems with application to acoustic echo cancellation , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[37]  Piet C. W. Sommen,et al.  New constraining method for partitioned block frequency-domain adaptive filters , 2002, IEEE Trans. Signal Process..

[38]  Rainer Martin,et al.  Spectral Subtraction Based on Minimum Statistics , 2001 .

[39]  Rainer Martin,et al.  Combined acoustic echo cancellation, dereverberation and noise reduction: a two microphone approach , 1994 .

[40]  Satoru Emura,et al.  Residual Echo Reduction for Multichannel Acoustic Echo Cancelers With a Complex-Valued Residual Echo Estimate , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[41]  Giovanni Del Galdo,et al.  On the spatial coherence in mixed sound fields and its application to signal-to-diffuse ratio estimation. , 2012, The Journal of the Acoustical Society of America.

[42]  Walter Kellermann,et al.  GSAEC — Acoustic echo cancellation embedded into the generalized sidelobe canceller , 2000, 2000 10th European Signal Processing Conference.

[43]  Jesper Jensen,et al.  Analysis of Acoustic Feedback/Echo Cancellation in Multiple-Microphone and Single-Loudspeaker Systems Using a Power Transfer Function Method , 2011, IEEE Transactions on Signal Processing.

[44]  Jacob Benesty,et al.  A Variable Step-Size Affine Projection Algorithm Designed for Acoustic Echo Cancellation , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[45]  Emanuel A. P. Habets,et al.  Nonstationary Noise PSD Matrix Estimation for Multichannel Blind Speech Extraction , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[46]  Manfred R. Schroeder,et al.  Statistical parameters of the frequency response curves of large rooms , 1987 .

[47]  Woon-Seng Gan,et al.  Subband Adaptive Filtering: Theory and Implementation , 2009 .

[48]  Peter Vary,et al.  Frequency-domain adaptive Kalman filter for acoustic echo control in hands-free telephones , 2006, Signal Process..

[49]  Israel Cohen,et al.  Relative Transfer Function Identification Using Convolutive Transfer Function Approximation , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[50]  Walter Kellermann,et al.  Strategies for combining acoustic echo cancellation and adaptive beamforming microphone arrays , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[51]  Rainer Martin,et al.  Unbiased residual echo power estimation for hands-free telephony , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[52]  C. Avendano Acoustic echo suppression in the STFT domain , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[53]  Alfred Mertins,et al.  New aspects of combining echo cancellers with beamformers , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[54]  Gerald Enzner,et al.  Recursive Bayesian Control of Multichannel Acoustic Echo Cancellation , 2011, IEEE Signal Processing Letters.