Speech intelligibility in various spatial configurations of backgroundnoise

This study is concerned with the influence of spatial separation of disturbing sources of noise on the speech intelligibility. Spatial separation of speech and disturbing sources without changing their acoustic power may contribute to the significant improvement in the speech intelligibility. This problem has been recently analysed in many papers [1-5]. These works have confirmed an important role of the spatial configuration of sources. However, there have been no work investigating this problem for nonsense words (logatoms) that may provide more rigorous tests of this phenomenon. Moreover, this problem has not been analysed for Polish speech. It is important to emphasize that the acoustic and phonetic properties of Polish speech are somewhat different from those of English one. Therefore, the attempt to investigate the influence of the spatial separation of sound sources was made in this study. In the situation with more than one spatially separated disturbances, there may occur a so-called spatial suppression phenomenon, that is ``mutual suppression'' of disturbing sounds in the auditory system that brings about an increase in the speech intelligibility. This phenomenon is also called the spatial unmasking of speech [5, 6]. The research consist in determination of the speech intelligibility in the presence of one or two statistically independent speech-shaped noise sources varying in configuration. Only two pairs of the spatial configurations were investigated. Character of the dependences obtained in the study implies that the spatial suppression occurs in certain configurations of sources only. This effect brings about an increase in the speech intelligibility and can be explained on the basis of the binaural masking level difference (BMLD). It seems then that the BMLD may be a more general phenomenon and includes not only difference in the detection threshold of a pure tone masked by noise but also the improvement in the speech intelligibility, while speech is presented at the background of disturbing signals. Key words: speech intelligibility, speech perception, spatial separation, spatial suppression.

[1]  B. Shinn-Cunningham,et al.  Tori of confusion: binaural localization cues for sources within reach of a listener. , 2000, The Journal of the Acoustical Society of America.

[2]  R Plomp,et al.  The effect of head-induced interaural time and level differences on speech intelligibility in noise. , 1987, The Journal of the Acoustical Society of America.

[3]  Ira J. Hirsh,et al.  The Relation between Localization and Intelligibility , 1950 .

[4]  B C Moore,et al.  Spectral contrast enhancement of speech in noise for listeners with sensorineural hearing impairment: effects on intelligibility, quality, and response times. , 1993, Journal of rehabilitation research and development.

[5]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[6]  H S Colburn,et al.  Binaural sluggishness in the perception of tone sequences and speech in noise. , 2000, The Journal of the Acoustical Society of America.

[7]  B Kollmeier,et al.  Directivity of binaural noise reduction in spatial multiple noise-source arrangements for normal and impaired listeners. , 1997, The Journal of the Acoustical Society of America.

[8]  A. M. Mimpen,et al.  Effect of the orientation of the speaker's head and azimuth of a noise source on the speech reception threshold for sentences , 1980 .

[9]  D D Dirks,et al.  The effect of spatially separated sound sources on speech intelligibility. , 1969, Journal of speech and hearing research.

[10]  N. Durlach,et al.  Interaural correlation discrimination: II. Relation to binaural unmasking. , 1986, The Journal of the Acoustical Society of America.

[11]  Ruth Y Litovsky,et al.  The benefit of binaural hearing in a cocktail party: effect of location and type of interferer. , 2004, The Journal of the Acoustical Society of America.

[12]  Barbara G. Shinn-Cunningham,et al.  Spatial unmasking of nearby speech sources in a simulated anechoic environment , 2000 .

[13]  B. Moore An introduction to the psychology of hearing, 3rd ed. , 1989 .

[14]  R Plomp,et al.  The effect of a hearing aid on the speech-reception threshold of hearing-impaired listeners in quiet and in noise. , 1983, The Journal of the Acoustical Society of America.

[15]  J. C. R. Licklider,et al.  The Influence of Interaural Phase Relations upon the Masking of Speech by White Noise , 1948 .

[16]  H S Colburn,et al.  Speech intelligibility and localization in a multi-source environment. , 1999, The Journal of the Acoustical Society of America.

[17]  Wiktor Jassem,et al.  The Formants of Fricative Consonants , 1965 .

[18]  B. Moore An Introduction to the Psychology of Hearing , 1977 .

[19]  Irwin Pollack,et al.  Stereophonic Listening and Speech Intelligibility against Voice Babble , 1958 .

[20]  R L Freyman,et al.  Spatial release from informational masking in speech recognition. , 2001, The Journal of the Acoustical Society of America.

[21]  Aleksander Sek,et al.  Intelligibility of speech processed by a spectral contrast enhancementprocedure and a binaural procedure , 2004 .

[22]  B G Shinn-Cunningham,et al.  Spatial unmasking of nearby speech sources in a simulated anechoic environment. , 2001, The Journal of the Acoustical Society of America.