Modelling the combined effect of binaural hearing and reverberation

To study the interaction between the intelligibility advantage in rooms due to the presence of early reflections and due to the binaural blocking of interferers from undesired directions, a series of speech reception threshold (SRT) experiments was performed in a simulated room and with a single early reflection of the frontal target speech source as a function of its delay ranging from 0 to 200 ms. From the data and the model considerations given here, one can conclude that binaural unmasking and temporal integration of reflections seem to be comparatively independent from each other, thus providing evidence for a model with a binaural processing stage as a frontend and a reverberation compensation stage (like the MTF model) as the subsequent, independent stage. However, a blocking effect was found for reflections ipsilateral to the noise direction and a release from the deterioration effect at 200 ms delay was found for all non-blocked reflections from azimuths deviating from the midline. These findings are at odds with three versions of a model of binaural speech intelligibility in rooms described here.

[1]  T Houtgast,et al.  A physical method for measuring speech-transmission quality. , 1980, The Journal of the Acoustical Society of America.

[2]  N. I. Durlach,et al.  Binaural signal detection - Equalization and cancellation theory. , 1972 .

[3]  Anna Warzybok,et al.  Effects of spatial and temporal integration of a single early reflection on speech intelligibility. , 2013, The Journal of the Acoustical Society of America.

[4]  R. Beutelmann,et al.  Prediction of speech intelligibility in spatial noise and reverberation for normal-hearing and hearing-impaired listeners. , 2006, The Journal of the Acoustical Society of America.

[5]  Birger Kollmeier,et al.  Prediction of the influence of reverberation on binaural speech intelligibility in noise and in quiet. , 2011, The Journal of the Acoustical Society of America.

[6]  M. Sachs,et al.  Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers. , 1979, The Journal of the Acoustical Society of America.

[7]  J Tchorz,et al.  A model of auditory perception as front end for automatic speech recognition. , 1999, The Journal of the Acoustical Society of America.

[8]  J.P.A. Lochner,et al.  The influence of reflections on auditorium acoustics , 1964 .

[9]  Mathieu Lavandier,et al.  Prediction of binaural speech intelligibility against noise in rooms. , 2010, The Journal of the Acoustical Society of America.

[10]  Mathieu Lavandier,et al.  Prediction of reverberant speech intelligibility against multiple noise interferers in rooms: Binaural useful‐to‐detrimental ratios. , 2010 .

[11]  Birger Kollmeier,et al.  Revision, extension, and evaluation of a binaural speech intelligibility model. , 2010, The Journal of the Acoustical Society of America.

[12]  Kuansan Wang,et al.  Self-normalization and noise-robustness in early auditory representations , 1994, IEEE Trans. Speech Audio Process..

[13]  Heinrich Kuttruff,et al.  Room acoustics , 1973 .

[14]  Iris Arweiler,et al.  The influence of spectral characteristics of early reflections on speech intelligibility. , 2011, The Journal of the Acoustical Society of America.

[15]  Michael Kleinschmidt,et al.  Localized spectro-temporal features for automatic speech recognition , 2003, INTERSPEECH.

[16]  R. Drullman,et al.  Binaural intelligibility prediction based on the speech transmission index. , 2008, The Journal of the Acoustical Society of America.

[17]  B Kollmeier,et al.  Directivity of binaural noise reduction in spatial multiple noise-source arrangements for normal and impaired listeners. , 1997, The Journal of the Acoustical Society of America.

[18]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.