The speech intelligibility and applicability of the speech transmission index in large spaces

Abstract This paper aims to explore the influence factors of speech intelligibility and the applicability of the speech transmission index (STI) in large spaces, where the sound energy is unevenly distributed and non-exponentially decays. The subjective speech intelligibility tests were conducted in Mandarin (China mainland) in two large spaces with volumes of 97,000 m3 and 246,000 m3. Objective indicators such as the Reverberation Time (RT), Early Decay Time (EDT), Definition (D50), and Speech Transmission Index (STI) under different signal-to-noise ratio (SNRs) were also measured in these two spaces. The results showed that both the SNR and room acoustics had significant effect on the speech intelligibility in these two spaces, but the effect of room acoustics on speech intelligibility was also affected by SNR. The speech intelligibility scores significantly increased with the increase in SNR when the SNR was less than 14.4 dB. In terms of room acoustics, D50 was more relevant to speech intelligibility than RT and EDT in these two large spaces when SNR ranged from −5dB to 15 dB. The STI value in large spaces should not be used as in ordinary spaces to evaluate the speech intelligibility. Based on the tests in this paper, the corresponding relation between STI and speech intelligibility in large spaces was modified, and a new rating threshold of STI was also proposed according to the revised relation, which indicated a necessity to modify the rating criteria of using STI to predict speech intelligibility in large spaces.

[1]  T. Houtgast,et al.  The Modulation Transfer Function in Room Acoustics as a Predictor of Speech Intelligibility , 1973 .

[2]  Masayuki Morimoto,et al.  Acceptable range of speech level in noisy sound fields for young adults and elderly persons. , 2011, The Journal of the Acoustical Society of America.

[3]  T Houtgast,et al.  A physical method for measuring speech-transmission quality. , 1980, The Journal of the Acoustical Society of America.

[4]  Shiu-keung Tang Speech related acoustical parameters in classrooms and their relationships , 2008 .

[5]  Gino G. Sacerdote,et al.  Measurements of the Acoustical Properties of Two Roman Basilicas , 1953 .

[6]  J S Bradley,et al.  On the combined effects of signal-to-noise ratio and room acoustics on speech intelligibility. , 1999, The Journal of the Acoustical Society of America.

[7]  B. S. Ramakrishna,et al.  Intelligibility of speech under nonexponential decay conditions. , 1975, The Journal of the Acoustical Society of America.

[9]  Jian Kang,et al.  Relationship Between Chinese Speech Intelligibility and Speech Transmission Index Under Reproduced General Room Conditions , 2014 .

[10]  J. S. Anderson,et al.  Some acoustical properties of St Paul's Cathedral, London , 1984 .

[11]  T. Houtgast,et al.  A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria , 1985 .

[12]  Peng Jian-xin,et al.  Chinese speech intelligibility at different speech sound pressure levels and signal-to-noise ratios in simulated classrooms , 2010 .

[13]  Jian Kang,et al.  Characteristics and prediction of sound level in extra-large spaces , 2018 .

[14]  J. S. Bradley,et al.  The intelligibility of speech in elementary school classrooms. , 2008, The Journal of the Acoustical Society of America.

[15]  J. Ahlstrom,et al.  Recognition of filtered words in noise at higher-than-normal levels: decreases in scores with and without increases in masking. , 2005, The Journal of the Acoustical Society of America.

[16]  G. Studebaker,et al.  Monosyllabic word recognition at higher-than-normal speech and noise levels. , 1999, The Journal of the Acoustical Society of America.

[17]  Herman J. M. Steeneken,et al.  Mutual dependence of the octave-band weights in predicting speech intelligibility , 1999, Speech Commun..

[18]  Herman J. M. Steeneken,et al.  Phoneme-group specific octave-band weights in predicting speech intelligibility , 2002, Speech Commun..

[19]  Herman J. M. Steeneken,et al.  Validation of the revised STIr method , 2002, Speech Commun..

[20]  Jian Kang Comparison of speech intelligibility between English and Chinese , 1998 .

[21]  Lidia Álvarez-Morales,et al.  A methodology for the study of the acoustic environment of Catholic cathedrals: Application to the Cathedral of Malaga , 2014 .

[22]  J. S. Bradley,et al.  Reverberation time and maximum background-noise level for classrooms from a comparative study of speech intelligibility metrics. , 2000, Journal of the Acoustical Society of America.

[23]  Peng Jianxin,et al.  Relationship between Chinese speech intelligibility and speech transmission index using diotic listening , 2007, Speech Commun..

[24]  J. S. Bradley,et al.  Speech intelligibility studies in classrooms. , 1986, The Journal of the Acoustical Society of America.