Multi-Channel Noise Reduction with Interference Suppression on Mobile Phones

Speech enhancement techniques have been widely incorporated in voice communications for improving the speech quality and intelligibility. However, existing methods are not sufficiently effective in addressing complex noisy conditions in real-world hands-free scenarios. This is especially true for interferences from other competing talkers or point sources. Additionally, the prior work needs more improvements to fully exploit the deployment of multi-channel microphone array on the burgeoning high-end mobile phones. In this paper, we propose a multi-channel noise reduction (NR) method based on a hybrid adaptive noise coherence models without the need for the prior knowledge of the interference direction. The method also adapts to the fast-varying noisy and reverberant conditions. Evaluation results on real-world recordings collected via a mobile phone confirm its superior effectiveness in terms of the background noise/interference suppression and the overall speech quality with lower distortions compared to the state-of-the-art NR techniques.

[1]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[2]  Joerg Bitzer,et al.  Post-Filtering Techniques , 2001, Microphone Arrays.

[3]  Andries P. Hekstra,et al.  Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[4]  Volkan Cevher,et al.  Structured Sparsity Models for Reverberant Speech Separation , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[5]  Dennis H. Klatt,et al.  Prediction of perceived phonetic distance from critical-band spectra: A first step , 1982, ICASSP.

[6]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..

[7]  Philipos C. Loizou,et al.  Reasons why Current Speech-Enhancement Algorithms do not Improve Speech Intelligibility and Suggested Solutions , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Jan Skoglund,et al.  Globally optimized least-squares post-filtering for microphone array speech enhancement , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Philipos C. Loizou,et al.  A Dual-Microphone Speech Enhancement Algorithm Based on the Coherence Function , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[11]  Richard C. Hendriks,et al.  Noise power estimation based on the probability of speech presence , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[12]  Christophe Beaugeant,et al.  Blind estimation of the coherent-to-diffuse energy ratio from noisy speech signals , 2011, 2011 19th European Signal Processing Conference.

[13]  Hervé Bourlard,et al.  Microphone array post-filter based on noise field coherence , 2003, IEEE Trans. Speech Audio Process..

[14]  Heinrich Kuttruff,et al.  Room acoustics , 1973 .

[15]  R. Zelinski,et al.  A microphone array with adaptive post-filtering for noise reduction in reverberant rooms , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[16]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[17]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  Philipos C. Loizou,et al.  A Dual-Microphone Algorithm That Can Cope With Competing-Talker Scenarios , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  John H. L. Hansen,et al.  A Hybrid Coherence Model for Noise Reduction in Reverberant Environments , 2015, IEEE Signal Processing Letters.

[20]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[21]  W. Marsden I and J , 2012 .

[22]  Emmanuel Vincent,et al.  Subjective and Objective Quality Assessment of Audio Source Separation , 2011, IEEE Transactions on Audio, Speech, and Language Processing.