Performance Prediction of the BinauralMVDR Beamformer with Partial Noise Estimation using a Binaural Speech IntelligibilityModel

An objective evaluation of binaural noise reduction algorithms allows for directly comparing the performance of different algorithm realizations. Here, a binaural speech intelligibility model (BSIM), which mimics the effective binaural processing of a human listeners, is used to predict the performance of the binaural minimum-variance distortionless response beamformer with partial noise estimation (BMVDR-N), which aims at preserving the speech component in a reference microphone and a scaled version of the noise component. The BMVDR-N beamformer is evaluated with respect to a predicted change in SRT depending on the parameter η, which controls a trade-off between noise reduction and binaural cue preservation of the noise component. The results show that BSIM benefits from the preserved binaural cues suggesting that the BMVDR-N beamformer can improve the spatial quality of a scene without affecting speech intelligibility.

[1]  Daniel Marquardt,et al.  Interaural Coherence Preservation for Binaural Noise Reduction Using Partial Noise Estimation and Spectral Postfiltering , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[2]  E. C. Cmm,et al.  on the Recognition of Speech, with , 2008 .

[3]  Volker Hohmann,et al.  Database of Multichannel In-Ear and Behind-the-Ear Head-Related and Binaural Room Impulse Responses , 2009, EURASIP J. Adv. Signal Process..

[4]  D. Gilson Revision , 2020 .

[5]  Sharon Gannot,et al.  Binaural Speech Processing with Application to Hearing Devices , 2018, Audio Source Separation and Speech Enhancement.

[6]  N. Durlach Equalization and Cancellation Theory of Binaural Masking‐Level Differences , 1963 .

[7]  J. C. Middlebrooks,et al.  Listener weighting of cues for lateral angle: the duplex theory of sound localization revisited. , 2002, The Journal of the Acoustical Society of America.

[8]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[9]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[10]  R Plomp,et al.  The effect of head-induced interaural time and level differences on speech intelligibility in noise. , 1987, The Journal of the Acoustical Society of America.

[11]  Birger Kollmeier,et al.  Prediction of the influence of reverberation on binaural speech intelligibility in noise and in quiet. , 2011, The Journal of the Acoustical Society of America.

[12]  Anna Warzybok,et al.  Modeling the effects of a single reflection on binaural speech intelligibility. , 2014, The Journal of the Acoustical Society of America.

[13]  Volker Hohmann,et al.  Interaural Coherence Preservation in Multi-Channel Wiener Filtering-Based Noise Reduction for Binaural Hearing Aids , 2013, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[14]  K. S. Rhebergen,et al.  A Speech Intelligibility Index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners. , 2005, The Journal of the Acoustical Society of America.