Effect of temporal fine structure on speech intelligibility modeling

Temporal fine structure (TFS) carries important information for the speech perception of hearing-impaired listeners and for the design of novel prosthetic hearing devices. This study assessed the performance of present intelligibility indices for predicting the intelligibility of speech containing different amount of TFS information. Speech intelligibility data was collected from vocoded and wideband Mandarin sentences containing little/partial and intact TFS information, respectively, and was then subjected to the correlation analysis with existing intelligibility indices. It was found that, though performing well in predicting the intelligibility of vocoded or wideband speech separately, present intelligibility indices were not highly correlated with the intelligibility scores when a general function was used to map all intelligibility measures to intelligibility scores. Analysis further showed that the intelligibility prediction power could be significantly improved when multiple condition-dependent functions were used for mapping intelligibility measures to intelligibility scores.

[1]  Zachary M. Smith,et al.  Chimaeric sounds reveal dichotomies in auditory perception , 2002, Nature.

[2]  James M Kates,et al.  Coherence and the speech intelligibility index. , 2004, The Journal of the Acoustical Society of America.

[3]  J. H. Steiger Tests for comparing elements of a correlation matrix. , 1980 .

[4]  G. Stickney,et al.  On the dichotomy in auditory perception between temporal envelope and fine structure cues. , 2004, The Journal of the Acoustical Society of America.

[5]  T Houtgast,et al.  A physical method for measuring speech-transmission quality. , 1980, The Journal of the Acoustical Society of America.

[6]  B. Moore The Role of Temporal Fine Structure Processing in Pitch Perception, Masking, and Speech Perception for Normal-Hearing and Hearing-Impaired People , 2008, Journal of the Association for Research in Otolaryngology.

[7]  S. Rosen Temporal information in speech: acoustic, auditory and linguistic aspects. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[8]  Brian C J Moore,et al.  Speech perception problems of the hearing impaired reflect inability to use temporal fine structure , 2006, Proceedings of the National Academy of Sciences.

[9]  Fei Chen,et al.  Predicting the intelligibility of vocoded and wideband Mandarin Chinese. , 2011, The Journal of the Acoustical Society of America.

[10]  Fei Chen,et al.  Predicting the Intelligibility of Vocoded Speech , 2011, Ear and hearing.

[11]  Bruce J. Gantz,et al.  Combining acoustic and electric hearing: Simulations and real‐patient results , 2000 .

[12]  Raymond L. Goldsworthy,et al.  Analysis of speech-based Speech Transmission Index methods with implications for nonlinear operations. , 2004, The Journal of the Acoustical Society of America.

[13]  Bruce J Gantz,et al.  Combining acoustic and electrical hearing. , 2003, The Laryngoscope.

[14]  Ying-Yee Kong,et al.  Temporal and spectral cues in Mandarin tone recognition , 2004 .

[15]  Yi Hu,et al.  Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions. , 2009, The Journal of the Acoustical Society of America.

[16]  P. Loizou Introduction to cochlear implants. , 1999, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.