The role of head-induced interaural time and level differences in the speech reception threshold for multiple interfering sound sources.

Three experiments investigated the roles of interaural time differences (ITDs) and level differences (ILDs) in spatial unmasking in multi-source environments. In experiment 1, speech reception thresholds (SRTs) were measured in virtual-acoustic simulations of an anechoic environment with three interfering sound sources of either speech or noise. The target source lay directly ahead, while three interfering sources were (1) all at the target's location (0 degrees,0 degrees,0 degrees), (2) at locations distributed across both hemifields (-30 degrees,60 degrees,90 degrees), (3) at locations in the same hemifield (30 degrees,60 degrees,90 degrees), or (4) co-located in one hemifield (90 degrees,90 degrees,90 degrees). Sounds were convolved with head-related impulse responses (HRIRs) that were manipulated to remove individual binaural cues. Three conditions used HRIRs with (1) both ILDs and ITDs, (2) only ILDs, and (3) only ITDs. The ITD-only condition produced the same pattern of results across spatial configurations as the combined cues, but with smaller differences between spatial configurations. The ILD-only condition yielded similar SRTs for the (-30 degrees,60 degrees,90 degrees) and (0 degrees,0 degrees,0 degrees) configurations, as expected for best-ear listening. In experiment 2, pure-tone BMLDs were measured at third-octave frequencies against the ITD-only, speech-shaped noise interferers of experiment 1. These BMLDs were 4-8 dB at low frequencies for all spatial configurations. In experiment 3, SRTs were measured for speech in diotic, speech-shaped noise. Noises were filtered to reduce the spectrum level at each frequency according to the BMLDs measured in experiment 2. SRTs were as low or lower than those of the corresponding ITD-only conditions from experiment 1. Thus, an explanation of speech understanding in complex listening environments based on the combination of best-ear listening and binaural unmasking (without involving sound-localization) cannot be excluded.

[1]  E. C. Cherry Some Experiments on the Recognition of Speech, with One and with Two Ears , 1953 .

[2]  Ruth Y Litovsky,et al.  The benefit of binaural hearing in a cocktail party: effect of location and type of interferer. , 2004, The Journal of the Acoustical Society of America.

[3]  Jens Blauert,et al.  The AUDIS catalog of human HRTFs , 1998 .

[4]  N. R. Zitron Multiple Scattering of Elastic Waves by Two Arbitrary Cylinders , 1967 .

[5]  H S Colburn,et al.  Binaural sluggishness in the perception of tone sequences and speech in noise. , 2000, The Journal of the Acoustical Society of America.

[6]  K. D. Kryter Errata: Method for the Calculation and Use of the Articulation Index [J. Acoust. Soc. Am. 34, 1689–1697 (1962)] , 1964 .

[7]  J. Culling,et al.  Perceptual separation of concurrent speech sounds: absence of across-frequency grouping by common interaural delay. , 1995, The Journal of the Acoustical Society of America.

[8]  J. Culling,et al.  Measurements of the binaural temporal window using a detection task , 1998 .

[9]  N. Durlach Equalization and Cancellation Theory of Binaural Masking‐Level Differences , 1963 .

[10]  R. Plomp A signal-to-noise ratio model for the speech-reception threshold of the hearing impaired. , 1986, Journal of speech and hearing research.

[11]  B Kollmeier,et al.  Directivity of binaural noise reduction in spatial multiple noise-source arrangements for normal and impaired listeners. , 1997, The Journal of the Acoustical Society of America.

[12]  William Noble,et al.  Hearing speech against spatially separate competing speech versus competing noise , 2002, Perception & psychophysics.

[13]  N. I. Durlach,et al.  Binaural signal detection - Equalization and cancellation theory. , 1972 .

[14]  L. Rabiner,et al.  Predicting binaural gain in intelligibility and release from masking for speech. , 1967, Journal of the Acoustical Society of America.

[15]  IEEE Recommended Practice for Speech Quality Measurements , 1969, IEEE Transactions on Audio and Electroacoustics.

[16]  Earl D. Schubert,et al.  Some Preliminary Experiments on Binaural Time Delay and Intelligibility , 1956 .

[17]  K. D. Kryter Methods for the Calculation and Use of the Articulation Index , 1962 .

[18]  A. M. Mimpen,et al.  Effect of the orientation of the speaker's head and azimuth of a noise source on the speech reception threshold for sentences , 1980 .

[19]  R Plomp,et al.  Effect of multiple speechlike maskers on binaural speech recognition in normal and impaired hearing. , 1992, The Journal of the Acoustical Society of America.

[20]  H. Fletcher,et al.  The Perception of Speech and Its Relation to Telephony , 1950 .