Binaural masking release in symmetric listening conditions with spectro-temporally modulated maskers.

Speech reception thresholds (SRTs) decrease as target and maskers are spatially separated (spatial release from masking, SRM). The current study systematically assessed how SRTs and SRM for a frontal target in a spatially symmetric masker configuration depend on spectro-temporal masker properties, the availability of short-time interaural level difference (ILD) and interaural time difference (ITD), and informational masking. Maskers ranged from stationary noise to single, interfering talkers and were modified by head-related transfer functions to provide: (i) different binaural cues (ILD, ITD, or both) and (ii) independent maskers in each ear ("infinite ILD"). Additionally, a condition was tested in which only information from short-time spectro-temporal segments of the ear with a favorable signal-to-noise ratio (better-ear glimpses) was presented. For noise-based maskers, ILD, ITD, and spectral changes related to masker location contributed similarly to SRM, while ILD cues played a larger role if temporal modulation was introduced. For speech maskers, glimpsing and perceived location contributed roughly equally and ITD contributed less. The "infinite ILD" condition might suggest better-ear glimpsing limitations resulting in a maximal SRM of 12 dB for maskers with low or absent informational masking. Comparison to binaural model predictions highlighted the importance of short-time processing and helped to clarify the contribution of the different binaural cues and mechanisms.

[1]  Birger Kollmeier,et al.  Development and analysis of an International Speech Test Signal (ISTS) , 2010, International journal of audiology.

[2]  Lauren Calandruccio,et al.  Masking release due to linguistic and phonetic dissimilarity between the target and masker speech. , 2013, American journal of audiology.

[3]  Informational masking in normal-hearing and hearing-impaired listeners. , 2000, Acta oto-laryngologica.

[4]  H S Colburn,et al.  Speech intelligibility and localization in a multi-source environment. , 1999, The Journal of the Acoustical Society of America.

[5]  Nandini Iyer,et al.  Better-ear glimpsing efficiency with symmetrically-placed interfering talkers. , 2012, The Journal of the Acoustical Society of America.

[6]  Ag Armin Kohlrausch,et al.  Analytical expressions for the envelope correlation of narrow-band stimuli used in CMR and BMLD research , 1998 .

[7]  K. D. Kryter,et al.  ARTICULATION-TESTING METHODS: CONSONANTAL DIFFERENTIATION WITH A CLOSED-RESPONSE SET. , 1965, The Journal of the Acoustical Society of America.

[8]  Mathieu Lavandier,et al.  Binaural speech intelligibility in rooms with variations in spatial location of sources and modulation depth of noise interferers. , 2013, The Journal of the Acoustical Society of America.

[9]  Jörg M. Buchholz,et al.  The importance of interaural time differences and level differences in spatial release from masking. , 2013, The Journal of the Acoustical Society of America.

[10]  B Kollmeier,et al.  Directivity of binaural noise reduction in spatial multiple noise-source arrangements for normal and impaired listeners. , 1997, The Journal of the Acoustical Society of America.

[11]  John F Culling,et al.  Speech intelligibility among modulated and spatially distributed noise sources. , 2013, The Journal of the Acoustical Society of America.

[12]  Brian C J Moore,et al.  Notionally steady background noise acts primarily as a modulation masker of speech. , 2012, The Journal of the Acoustical Society of America.

[13]  Volker Hohmann,et al.  Database of Multichannel In-Ear and Behind-the-Ear Head-Related and Binaural Room Impulse Responses , 2009, EURASIP J. Adv. Signal Process..

[14]  Matthew H. Davis,et al.  Speech recognition in adverse conditions: A review , 2012 .

[15]  Martin Cooke,et al.  A glimpsing model of speech perception in noise. , 2006, The Journal of the Acoustical Society of America.

[16]  Lutz Wiegrebe,et al.  Binaural Glimpses at the Cocktail Party? , 2016, Journal of the Association for Research in Otolaryngology.

[17]  J M Festen Contributions of comodulation masking release and temporal resolution to the speech-reception threshold masked by an interfering voice. , 1993, The Journal of the Acoustical Society of America.

[18]  R. Plomp,et al.  Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing. , 1990, The Journal of the Acoustical Society of America.

[19]  Joseph W. Hall,et al.  Detection in noise by spectro-temporal pattern analysis. , 1984, The Journal of the Acoustical Society of America.

[20]  Birger Kollmeier,et al.  Revision, extension, and evaluation of a binaural speech intelligibility model. , 2010, The Journal of the Acoustical Society of America.

[21]  Roger K. Moore Cognitive Approaches to Spoken Language Technology , 2010 .

[22]  K. S. Rhebergen,et al.  Extended speech intelligibility index for the prediction of the speech reception threshold in fluctuating noise. , 2006, The Journal of the Acoustical Society of America.

[23]  D S Brungart,et al.  Informational and energetic masking effects in the perception of two simultaneous talkers. , 2001, The Journal of the Acoustical Society of America.

[24]  William Noble,et al.  Hearing speech against spatially separate competing speech versus competing noise , 2002, Perception & psychophysics.

[25]  Ruth Y Litovsky,et al.  A cocktail party model of spatial release from masking by both noise and speech interferers. , 2011, The Journal of the Acoustical Society of America.

[26]  B. Shinn-Cunningham,et al.  Note on informational masking (L) , 2003 .

[27]  Ira J. Hirsh,et al.  The Relation between Localization and Intelligibility , 1950 .

[28]  Ruth Y Litovsky,et al.  The benefit of binaural hearing in a cocktail party: effect of location and type of interferer. , 2004, The Journal of the Acoustical Society of America.

[29]  Birger Kollmeier,et al.  Efficient adaptive procedures for threshold and concurrent slope estimates for psychophysics and speech intelligibility tests. , 2002, The Journal of the Acoustical Society of America.

[30]  D. Brungart,et al.  Better-ear glimpsing in hearing-impaired listeners. , 2015, The Journal of the Acoustical Society of America.

[31]  N. Durlach Equalization and Cancellation Theory of Binaural Masking‐Level Differences , 1963 .

[32]  G. Kidd,et al.  The effect of spatial separation on informational and energetic masking of speech. , 2002, The Journal of the Acoustical Society of America.

[33]  Tim Jürgens,et al.  Influence of noise type on speech reception thresholds across four languages measured with matrix sentence tests , 2015, International journal of audiology.

[34]  R Plomp,et al.  The effect of head-induced interaural time and level differences on speech intelligibility in noise. , 1987, The Journal of the Acoustical Society of America.

[35]  C. Mason,et al.  Release from masking due to spatial separation of sources in the identification of nonspeech auditory patterns. , 1998, The Journal of the Acoustical Society of America.

[36]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[37]  Torsten Dau,et al.  Modeling within- and across-channel processes in comodulation masking release. , 2013, The Journal of the Acoustical Society of America.

[38]  Fanny Meunier,et al.  Phonetic and lexical interferences in informational masking during speech-in-speech comprehension , 2007, Speech Commun..

[39]  W. T. Nelson,et al.  A speech corpus for multitalker communications research. , 2000, The Journal of the Acoustical Society of America.

[40]  Virginia Best,et al.  Stimulus factors influencing spatial release from speech-on-speech masking. , 2010, The Journal of the Acoustical Society of America.

[41]  T. Houtgast,et al.  The concept of signal-to-noise ratio in the modulation domain and speech intelligibility. , 2008, The Journal of the Acoustical Society of America.

[42]  Virginia Best,et al.  The effect of better-ear glimpsing on spatial release from masking. , 2013, The Journal of the Acoustical Society of America.

[43]  Birger Kollmeier,et al.  Monaural speech intelligibility and detection in maskers with varying amounts of spectro-temporal speech features. , 2016, The Journal of the Acoustical Society of America.

[44]  B. Grothe,et al.  Psychophysical and Physiological Evidence for Fast Binaural Processing , 2008, The Journal of Neuroscience.

[45]  Nathaniel I Durlach,et al.  Application of a short-time version of the Equalization-Cancellation model to speech intelligibility experiments with speech maskers. , 2014, The Journal of the Acoustical Society of America.