Spatial unmasking of nearby speech sources in a simulated anechoic environment

Spatial unmasking of speech has traditionally been studied with target and masker at the same, relatively large distance. The present study investigated spatial unmasking for configurations in which the simulated sources varied in azimuth and could be either near or far from the head. Target sentences and speech-shaped noise maskers were simulated over headphones using head-related transfer functions derived from a spherical-head model. Speech reception thresholds were measured adaptively, varying target level while keeping the masker level constant at the ‘‘better’’ ear. Results demonstrate that small positional changes can result in very large changes in speech intelligibility when sources are near the listener as a result of large changes in the overall level of the stimuli reaching the ears. In addition, the difference in the target-to-masker ratios at the two ears can be substantially larger for nearby sources than for relatively distant sources. Predictions from an existing model of binaural speech intelligibility are in good agreement with results from all conditions comparable to those that have been tested previously. However, small but important deviations between the measured and predicted results are observed for other spatial configurations, suggesting that current theories do not accurately account for speech intelligibility for some of the novel spatial configurations tested. © 2001 Acoustical Society of America. @DOI: 10.1121/1.1386633#

[1]  Y. Shao,et al.  Sound Localization Cues for a Magnified Head: Implications from Sound Diffraction about a Rigid Sphere , 1993, Presence: Teleoperators & Virtual Environments.

[2]  R R Coles,et al.  Binaural advantages in hearing of speech , 1971, The Journal of Laryngology & Otology.

[3]  R Plomp,et al.  The effect of head-induced interaural time and level differences on speech intelligibility in noise. , 1987, The Journal of the Acoustical Society of America.

[4]  B Kollmeier,et al.  Directivity of binaural noise reduction in spatial multiple noise-source arrangements for normal and impaired listeners. , 1997, The Journal of the Acoustical Society of America.

[5]  W. M. Rabinowitz,et al.  Auditory localization of nearby sources. Head-related transfer functions. , 1999, The Journal of the Acoustical Society of America.

[6]  Tara J. Brown Characterization of acoustic head-related transfer functions for nearby sources , 2000 .

[7]  K. D. Kryter Errata: Method for the Calculation and Use of the Articulation Index [J. Acoust. Soc. Am. 34, 1689–1697 (1962)] , 1964 .

[8]  Ira J. Hirsh,et al.  The Relation between Localization and Intelligibility , 1950 .

[9]  H S Colburn,et al.  Speech intelligibility and localization in a multi-source environment. , 1999, Journal of the Acoustical Society of America.

[10]  B. Shinn-Cunningham,et al.  Tori of confusion: binaural localization cues for sources within reach of a listener. , 2000, The Journal of the Acoustical Society of America.

[11]  B. Shinn-Cunningham DISTANCE CUES FOR VIRTUAL AUDITORY SPACE , 2000 .

[12]  D D Dirks,et al.  The effect of spatially separated sound sources on speech intelligibility. , 1969, Journal of speech and hearing research.

[13]  K. D. Kryter Methods for the Calculation and Use of the Articulation Index , 1962 .

[14]  H S Colburn,et al.  Theory of binaural interaction based on auditory-nerve data. II. Detection of tones in noise. , 1977, The Journal of the Acoustical Society of America.

[15]  R Plomp,et al.  A clinical test for the assessment of binaural speech perception in noise. , 1990, Audiology : official organ of the International Society of Audiology.

[16]  William L. Martens,et al.  Range-dependence of the HRTF for a spherical head , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.

[17]  H. Fletcher,et al.  The Perception of Speech and Its Relation to Telephony , 1950 .

[18]  C V Pavlovic,et al.  Derivation of primary parameters and procedures for use in speech intelligibility predictions. , 1987, The Journal of the Acoustical Society of America.