In realistic listening scenarios, humans are extremely skillful in following one particular talker even in the presence of many others (i.e., the cocktail party effect). One aspect of this is the ability of a listener to make use of the spatial separation between different sound sources. In a complex acoustic scene, as interferers are moved away from the spatial position of the target, speech intelligibility (SI) increases, often referred to as spatial release from masking (SRM). This benefit is largely based on the listeners’ ability to make use of interaural level differences (ILD) and interaural time differences (ITD), which vary with the source location. While many studies have explored SRM, few have investigated the effects of overall number and spatial distribution of interferers while controlling for monaural masking effects. In the present study, speech reception thresholds (SRTs) and lateralization thresholds were measured over headphones in babble noise conditions consisting of 2, 4, 8 and 12 talkers. The perceived locations of the signal (female voice) and individual maskers (male, time-reversed voices) were steered separately either to the left or to the right using 680 μsec ITDs. For a fixed number of maskers, the distribution of interfering talkers (i.e., co-located or separated from the target) was varied. Thus, for all conditions with the same number of maskers, the monaural SNR was held constant regardless of the perceived spatial distribution. The performance between the speech and lateralization tasks was highly correlated. No substantial SRM occurred while one or more maskers were co-located with the target. Interestingly, spatially shifting the last 2 co-located interferers resulted in the same SRM, independent of the overall number of maskers. The same finding was found for the last 4 co-located maskers. The results suggest that SRM is independent of the overall number of interfering talkers.
[1]
K. Wagener,et al.
Design, optimization and evaluation of a Danish sentence test in noise: Diseño, optimización y evaluación de la prueba Danesa de frases en ruido
,
2003,
International journal of audiology.
[2]
B Kollmeier,et al.
Directivity of binaural noise reduction in spatial multiple noise-source arrangements for normal and impaired listeners.
,
1997,
The Journal of the Acoustical Society of America.
[3]
Ruth Y Litovsky,et al.
The benefit of binaural hearing in a cocktail party: effect of location and type of interferer.
,
2004,
The Journal of the Acoustical Society of America.
[4]
M. Cooke,et al.
Consonant identification in N-talker babble is a nonmonotonic function of N.
,
2005,
The Journal of the Acoustical Society of America.
[5]
Jon Barker,et al.
An audio-visual corpus for speech perception and automatic speech recognition.
,
2006,
The Journal of the Acoustical Society of America.
[6]
E. C. Cmm,et al.
on the Recognition of Speech, with
,
2008
.
[7]
E. C. Cherry.
Some Experiments on the Recognition of Speech, with One and with Two Ears
,
1953
.
[8]
Stuart Rosen,et al.
Listening to speech in a background of other talkers: effects of talker number and noise vocoding.
,
2013,
The Journal of the Acoustical Society of America.
[9]
R Plomp,et al.
Effect of multiple speechlike maskers on binaural speech recognition in normal and impaired hearing.
,
1992,
The Journal of the Acoustical Society of America.