Acoustic echo and noise canceller for personal hands-free video IP phone

This paper presents implementation and evaluation of a proposed acoustic echo and noise canceller (AENC) for videotelephony-enabled personal hands-free internet protocol (IP) phones. This canceller has the following features: noise-robust performance, low processing delay, and low computational complexity. The AENC employs an adaptive digital filter (ADF) and noise reduction (NR) methods that can effectively eliminate undesired acoustic echo and background noise included in a microphone signal even in a noisy environment. The ADF method uses the step-size control approach according to the level of disturbance such as background noise; it can minimize the effect of disturbance in a noisy environment. The NR method estimates the noise level under an assumption that the noise amplitude spectrum is constant in a short period, which cannot be applied to the amplitude spectrum of speech. In addition, this paper presents the method for decreasing the computational complexity of the ADF process without increasing the processing delay to make the processing suitable for real-time implementation. The experimental results demonstrate that the proposed AENC suppresses echo and noise sufficiently in a noisy environment; thus, resulting in natural-sounding speech.

[1]  Yusuke Hioka,et al.  Fast and Accurate Acoustic-Coupling Level Estimation for Echo Reduction , 2013 .

[2]  Shoji Makino,et al.  Implementation and evaluation of an acoustic echo canceller using duo-filter control system , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[3]  Yusuke Hioka,et al.  Noise-Power Estimation Based on Ratio of Stationary Noise to Input Signal for Noise Reduction , 2014 .

[4]  Ted S. Wada,et al.  Enhancement of Residual Echo for Robust Acoustic Echo Cancellation , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Kazuya Takeda,et al.  Blind source separation combining frequency-domain ICA and beamforming , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[6]  Richard C. Hendriks,et al.  Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Haneda Yoichi,et al.  Howling Canceller Based on Sparseness of Speech for Hands-Free System , 2007 .

[8]  Yoshitaka Masutani,et al.  Investigation of normalization coefficient for scale matching using Hessian matrix , 2007 .

[9]  Hui Jiang,et al.  Psychoacoustically-motivated adaptive β-order generalized spectral subtraction for cochlear implant patients , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[11]  Yusuke Hioka,et al.  Double-talk robust acoustic echo cancellation for CD-quality hands-free videoconferencing system , 2014, IEEE Transactions on Consumer Electronics.

[12]  Rainer Martin,et al.  Combined acoustic echo control and noise reduction for hands-free telephony , 1998, Signal Process..

[13]  Youichi Haneda,et al.  Robust Frequency Domain Acoustic Echo Cancellation Filter Employing Normalized Residual Echo Enhancement , 2008, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[14]  C. Avendano Acoustic echo suppression in the STFT domain , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[15]  METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .

[16]  Akira Nakagawa,et al.  Wiener solution considering cross-spectral term between echo and near-end speech for acoustic echo reduction , 2014 .

[17]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[18]  Christof Faller,et al.  Suppressing acoustic echo in a spectral envelope space , 2005, IEEE Transactions on Speech and Audio Processing.

[19]  Christophe Beaugeant,et al.  New optimal filtering approaches for hands-free telecommunication terminals , 1998, Signal Process..

[20]  Marc Moonen,et al.  A Frequency-Domain Adaptive Filter (FDAF) Prediction Error Method (PEM) Framework for Double-Talk-Robust Acoustic Echo Cancellation , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[21]  Jan Skoglund,et al.  Globally optimized least-squares post-filtering for microphone array speech enhancement , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Masahiro Fukui,et al.  Acoustic echo canceller software for voip hands-free application on smartphone and tablet devices , 2014, IEEE Transactions on Consumer Electronics.

[23]  Yusuke Hioka,et al.  Underdetermined Sound Source Separation Using Power Spectrum Density Estimated by Combination of Directivity Gain , 2013, IEEE Transactions on Audio, Speech, and Language Processing.