Speech Identification and Comprehension in the Urban Soundscape

Urban environments are characterised by the presence of copious and unstructured noise. This noise continuously challenges speech intelligibility both in normal-hearing and hearing-impaired individuals. In this paper, we investigate the impact of urban noise, such as traffic, on speech identification and, more generally, speech understanding. With this purpose, we perform listening experiments to evaluate the ability of individuals with normal hearing to detect words and interpret conversational speech in the presence of urban noise (e.g., street drilling, traffic jams). Our experiments confirm previous findings in different acoustic environments and demonstrate that speech identification is influenced by the similarity between the target speech and the masking noise also in urban scenarios. More specifically, we propose the use of the structural similarity index to quantify this similarity. Our analysis confirms that speech identification is more successful in presence of noise with tempo-spectral characteristics different from speech. Moreover, our results show that speech comprehension is not as challenging as word identification in urban sound environments that are characterised by the presence of severe noise. Indeed, our experiments demonstrate that speech comprehension can be fairly successful even in acoustic scenes where the ability to identify speech is highly reduced.

[1]  A. Zekveld,et al.  Cognitive Load During Speech Perception in Noise: The Influence of Age, Hearing Loss, and Cognition on the Pupil Response , 2011, Ear and hearing.

[2]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[3]  Rob Drullman,et al.  Speech perception and talker segregation: effects of level, pitch, and tactile support with multiple simultaneous talkers. , 2004, The Journal of the Acoustical Society of America.

[4]  Justin Salamon,et al.  A Dataset and Taxonomy for Urban Sound Research , 2014, ACM Multimedia.

[5]  Gijsbert Stoet,et al.  PsyToolkit: A software package for programming psychological experiments using Linux , 2010, Behavior research methods.

[6]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[7]  Martin Cooke,et al.  A glimpsing model of speech perception in noise. , 2006, The Journal of the Acoustical Society of America.

[8]  P. Warren,et al.  Urban bioacoustics: it's not just noise , 2006, Animal Behaviour.

[9]  Björn Lidestam,et al.  Comparison of informational vs. energetic masking effects on speechreading performance , 2014, Front. Psychol..

[10]  N. Kraus,et al.  Training to improve hearing speech in noise: biological mechanisms. , 2012, Cerebral cortex.

[11]  Alireza Nasiri Avanaki,et al.  Exact global histogram specification optimized for structural similarity , 2008, 0901.0065.

[12]  Jian Kang,et al.  Soundscape and Sound Preferences in Urban Squares: A Case Study in Sheffield , 2005 .

[13]  Lina R. Kubli,et al.  Development of a test battery for evaluating speech perception in complex listening environments. , 2014, The Journal of the Acoustical Society of America.

[14]  William J. Davies,et al.  Measuring and mapping soundscape speech intelligibility , 2009 .

[15]  Gitte Keidser Introduction to Special Issue: Towards Ecologically Valid Protocols for the Assessment of Hearing and Hearing Devices. , 2016, Journal of the American Academy of Audiology.

[16]  Jian Kang,et al.  Acoustic comfort evaluation in urban open public spaces , 2005 .

[17]  I. Toshio An optimal auditory filter , 1995, Proceedings of 1995 Workshop on Applications of Signal Processing to Audio and Accoustics.

[18]  Deliang Wang,et al.  Role of mask pattern in intelligibility of ideal binary-masked noisy speech. , 2009, The Journal of the Acoustical Society of America.

[19]  Brian C J Moore,et al.  The importance for speech intelligibility of random fluctuations in "steady" background noise. , 2011, The Journal of the Acoustical Society of America.

[20]  G. A. Miller,et al.  The intelligibility of speech as a function of the context of the test materials. , 1951, Journal of experimental psychology.

[21]  Ingmar Posner,et al.  Leveraging the urban soundscape: Auditory perception for smart vehicles , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[22]  Richard F. Lyon,et al.  History and future of auditory filter models , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[23]  Jesper Jensen,et al.  An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[24]  Lars Kai Hansen,et al.  The Role of Top-Down Attention in the Cocktail Party: Revisiting Cherry's Experiment after Sixty Years , 2011, 2011 10th International Conference on Machine Learning and Applications and Workshops.

[25]  Jon Barker,et al.  The foreign language cocktail party problem: Energetic and informational masking effects in non-native speech perception. , 2008, The Journal of the Acoustical Society of America.

[26]  S. Stansfeld,et al.  Noise and Health in the Urban Environment , 2000, Reviews on environmental health.

[27]  H Levitt,et al.  Noise reduction in hearing aids: a review. , 2001, Journal of rehabilitation research and development.

[28]  V. Woisard,et al.  Relationship Between Speech Intelligibility and Speech Comprehension in Babble Noise. , 2015, Journal of speech, language, and hearing research : JSLHR.

[29]  Susan Scollie,et al.  Semantic context improves speech intelligibility and reduces listening effort for listeners with hearing impairment , 2018, International journal of audiology.

[30]  Robert W. Heath,et al.  SSIM-optimal linear image restoration , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[31]  M. Cooke,et al.  Effect of masker type on native and non-native consonant perception in noise. , 2006, The Journal of the Acoustical Society of America.

[32]  A. Tjønneland,et al.  Long-Term Exposure to Road Traffic Noise and Incident Diabetes: A Cohort Study , 2012, Environmental health perspectives.

[33]  Gijsbert Stoet,et al.  PsyToolkit: A Novel Web-Based Method for Running Online Questionnaires and Reaction-Time Experiments , 2017 .

[34]  Richard L. Neitzel,et al.  Environmental Noise Pollution in the United States: Developing an Effective Public Health Response , 2013, Environmental health perspectives.

[35]  S. Kochkin MarkeTrak V: “Why my hearing aids are in the drawer” The consumers' perspective , 2000 .

[36]  Xenofon Fafoutis,et al.  A Behavioral Study on the Effects of Rock Music on Auditory Attention , 2013, HBU.

[37]  Edward R. Vrscay,et al.  SSIM-inspired image denoising using sparse representations , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[38]  W. Passchier,et al.  Noise exposure and public health. , 2000, Environmental health perspectives.

[39]  Xenofon Fafoutis,et al.  On cross-language consonant identification in second language noise. , 2015, The Journal of the Acoustical Society of America.

[40]  Pasquale Bottalico,et al.  Subjective and objective speech intelligibility investigations in primary school classrooms. , 2012, The Journal of the Acoustical Society of America.

[41]  Matthew H. Davis,et al.  Hierarchical Processing in Spoken Language Comprehension , 2003, The Journal of Neuroscience.

[42]  Jörn Anemüller,et al.  Automatic acoustic siren detection in traffic noise by part-based models , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[43]  Kristin J. Van Engen,et al.  Sentence recognition in native- and foreign-language multi-talker background noise. , 2007, The Journal of the Acoustical Society of America.

[44]  Enrico Del Re,et al.  A real-time siren detector to improve safety of guide in traffic environment , 2008, 2008 16th European Signal Processing Conference.

[45]  Anna Warzybok,et al.  The multilingual matrix test: Principles, applications, and comparison across languages: A review , 2015, International journal of audiology.