Spatial release from speech-on-speech masking in the median sagittal plane.

Several studies have described a release from speech-on-speech masking associated with separation of target and masker sources in the median sagittal plane. Some have excluded the possibility that small differences between target and masker interaural time disparities can fully account for this release. This study explored the mechanisms underlying the spatial release from speech-on-speech masking that can be obtained in the absence of such differences. In one condition, interaural time disparities were removed from the nominal median-sagittal-plane, head-related impulse responses used to generate the virtual auditory space within which competing sentences were presented. In other conditions, interaural level and spectral disparities also were manipulated by presenting competing sentences monaurally or diotically after convolution with one ear's head-related impulse responses. It was found that substantial spatial release from masking can be obtained in the absence of any interaural disparities and that such disparities probably make a relatively minor contribution to spatial release from speech-on-speech masking in the median sagittal plane. It is argued that this release from masking is driven primarily by a reduction in informational masking that occurs when monaural information at one, or both, of the listener's ears facilitates differentiation of competing sentences that emanate from spatially separated sources.

[1]  Russell L. Martin,et al.  Memory for the locations of environmental sounds. , 2011, The Journal of the Acoustical Society of America.

[2]  J. Moncur,et al.  Binaural and monaural speech intelligibility in reverberation. , 1967, Journal of speech and hearing research.

[3]  Mark A. Ericson,et al.  Factors That Influence Intelligibility in Multitalker Speech Displays , 2004 .

[4]  L. Rabiner,et al.  Predicting binaural gain in intelligibility and release from masking for speech. , 1967, Journal of the Acoustical Society of America.

[5]  R L Freyman,et al.  The role of perceived spatial separation in the unmasking of speech. , 1999, The Journal of the Acoustical Society of America.

[6]  B. Shinn-Cunningham,et al.  Note on informational masking. , 2003, The Journal of the Acoustical Society of America.

[7]  E. Shaw Transformation of sound pressure level from the free field to the eardrum in the horizontal plane. , 1974, The Journal of the Acoustical Society of America.

[8]  B. Shinn-Cunningham,et al.  Note on informational masking (L) , 2003 .

[9]  Ruth Y Litovsky,et al.  The role of head-induced interaural time and level differences in the speech reception threshold for multiple interfering sound sources. , 2004, The Journal of the Acoustical Society of America.

[10]  W. T. Nelson,et al.  A speech corpus for multitalker communications research. , 2000, The Journal of the Acoustical Society of America.

[11]  J. C. Steinberg,et al.  Factors Governing the Intelligibility of Speech Sounds , 1945 .

[12]  Interaural Time and Intensity Difference and the MLD , 1965 .

[13]  John F. Corso,et al.  Age and Sex Differences in Pure‐Tone Thresholds , 1959 .

[14]  John F Culling,et al.  The spatial unmasking of speech: evidence for better-ear listening. , 2006, The Journal of the Acoustical Society of America.

[15]  Richard L. McKinley,et al.  A cocktail party effect in the median plane , 1999 .

[16]  T W Tillman,et al.  Release of masking for speech through interaural time delay. , 1967, The Journal of the Acoustical Society of America.

[17]  D S Brungart,et al.  Informational and energetic masking effects in the perception of two simultaneous talkers. , 2001, The Journal of the Acoustical Society of America.

[18]  A. Bronkhorst,et al.  Multichannel speech intelligibility and talker recognition using monaural, binaural, and three-dimensional auditory presentation. , 2000, The Journal of the Acoustical Society of America.

[19]  Russell L. Martin,et al.  Segregation of Multiple Talkers in the Vertical Plane: Implications for the Design of a Multiple Talker Display , 2002 .

[20]  Barbara G. Shinn-Cunningham,et al.  Bottom-up and top-down influences on spatial unmasking , 2005 .

[21]  N. Durlach Equalization and Cancellation Theory of Binaural Masking‐Level Differences , 1963 .

[22]  C. Mason,et al.  Release from masking due to spatial separation of sources in the identification of nonspeech auditory patterns. , 1998, The Journal of the Acoustical Society of America.

[23]  L. A. Jeffress,et al.  Localization of High‐Frequency Tones , 1957 .

[24]  R. Butler,et al.  Factors that influence the localization of sound in the vertical plane. , 1968, The Journal of the Acoustical Society of America.

[25]  Robert B. King,et al.  The Impact of Signal Bandwidth on Auditory Localization: Implications for the Design of Three-Dimensional Audio Displays , 1997, Hum. Factors.

[26]  S Zerlin Interaural time and intensity difference and the MLD. , 1966, The Journal of the Acoustical Society of America.

[27]  Frederick J. Gallun,et al.  The ability to listen with independent ears. , 2007, The Journal of the Acoustical Society of America.

[28]  H S Colburn,et al.  Speech intelligibility and localization in a multi-source environment. , 1999, The Journal of the Acoustical Society of America.

[29]  Gerald Kidd,et al.  Tuning in the spatial dimension: evidence from a masked speech identification task. , 2008, The Journal of the Acoustical Society of America.

[30]  Uma Balakrishnan,et al.  Spatial And Spectral Factors In Release From Informational Masking In Speech Recognition , 2005 .

[31]  K. D. Kryter Methods for the Calculation and Use of the Articulation Index , 1962 .

[32]  B Kollmeier,et al.  Directivity of binaural noise reduction in spatial multiple noise-source arrangements for normal and impaired listeners. , 1997, The Journal of the Acoustical Society of America.

[33]  Virginia Best,et al.  The extent to which a position-based explanation accounts for binaural release from informational masking. , 2008, The Journal of the Acoustical Society of America.

[34]  M. Ericson,et al.  The Intelligibility of Multiple Talkers Separated Spatially in Noise , 2001 .

[35]  E. C. Cmm,et al.  on the Recognition of Speech, with , 2008 .

[36]  DeLiang Wang,et al.  Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation. , 2006, The Journal of the Acoustical Society of America.

[37]  J. W. Worley,et al.  Auditory attention based on differences in median vertical plane position. , 2002 .

[38]  J. F. Corso Age and sex differences in pure-tone thresholds. Survey of hearing levels from 18 to 65 years. , 1963, Archives of otolaryngology.

[39]  A. Duquesnoy Effect of a single interfering noise or speech source upon the binaural sentence intelligibility of aged persons. , 1983, The Journal of the Acoustical Society of America.

[40]  Neil L. Aaronson,et al.  Release from speech-on-speech masking in a front-and-back geometry. , 2009, The Journal of the Acoustical Society of America.

[41]  R L Freyman,et al.  Spatial release from informational masking in speech recognition. , 2001, The Journal of the Acoustical Society of America.