A corpus of noise-induced word misperceptions for English.

Words spoken against a noise background often form an ambiguous percept. However, in certain conditions, a listener will mishear a noisy word but report hearing the same incorrect word as reported by other listeners. These consistent hearing errors are valuable as tests of detailed models of speech perception. This paper describes the collection of a corpus of consistent speech misperceptions for English. The mishearings were elicited using a large scale listening study involving 212 participants and over 300 000 token presentations. The study led to the identification of 3207 consistent misperceptions. For each of these, the corpus records the speech and masker waveforms that generated the error, the set of responses made by the listeners, and phonemic transcriptions of the target word and the response. The corpus is freely available online.

[1]  Marc Brysbaert,et al.  Subtlex-UK: A New and Improved Word Frequency Database for British English , 2014, Quarterly journal of experimental psychology.

[2]  María Luisa García Lecumberri,et al.  A corpus of noise-induced word misperceptions for Spanish. , 2015, The Journal of the Acoustical Society of America.

[3]  Masoud Geravanchizadeh,et al.  Microscopic prediction of speech intelligibility in spatially distributed speech-shaped noise for normal-hearing listeners. , 2015, The Journal of the Acoustical Society of America.

[4]  Martin Cooke,et al.  A glimpsing model of speech perception in noise. , 2006, The Journal of the Acoustical Society of America.

[5]  Rainer Martin,et al.  Objective Intelligibility Measures Based on Mutual Information for Speech Subjected to Speech Enhancement Processing , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[6]  Jesper Jensen,et al.  A short-time objective intelligibility measure for time-frequency weighted noisy speech , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  François Christophe Egidio Pellegrino,et al.  Across-Language Perspective on Speech Information Rate , 2011 .

[8]  T. Brand,et al.  Microscopic prediction of speech recognition for listeners with normal hearing in noise using an auditory model. , 2009, The Journal of the Acoustical Society of America.

[9]  K. S. Rhebergen,et al.  A Speech Intelligibility Index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners. , 2005, The Journal of the Acoustical Society of America.