Listening in complex acoustic scenes

Being able to pick out particular sounds, such as speech, against a background of other sounds represents one of the key tasks performed by the auditory system. Understanding how this happens is important because speech recognition in noise is particularly challenging for older listeners and for people with hearing impairments. Central to this ability is the capacity of neurons to adapt to the statistics of sounds reaching the ears, which helps to generate noise-tolerant representations of sounds in the brain. In more complex auditory scenes, such as a cocktail party — where the background noise comprises other voices, sound features associated with each source have to be grouped together and segregated from those belonging to other sources. This depends on precise temporal coding and modulation of cortical response properties when attending to a particular speaker in a multi-talker environment. Furthermore, the neural processing underlying auditory scene analysis is shaped by experience over multiple timescales.

[1]  W. H. Sumby,et al.  Visual contribution to speech intelligibility in noise , 1954 .

[2]  Jonathan Z. Simon,et al.  The Auditory System at the Cocktail Party , 2017 .

[3]  Neil C. Rabinowitz,et al.  Constructing Noise-Invariant Representations of Sound in the Auditory Pathway , 2013, PLoS biology.

[4]  Barbara G Shinn-Cunningham,et al.  Normal hearing is not enough to guarantee robust encoding of suprathreshold features important in everyday communication , 2011, Proceedings of the National Academy of Sciences.

[5]  Josh H McDermott,et al.  Ecological origins of perceptual grouping principles in the auditory system , 2019, Proceedings of the National Academy of Sciences.

[6]  Nai Ding,et al.  Prior Knowledge Guides Speech Segregation in Human Auditory Cortex , 2019, Cerebral cortex.

[7]  Enrique A. Lopez-Poveda,et al.  The Influence of Cochlear Mechanical Dysfunction, Temporal Processing Deficits, and Age on the Intelligibility of Audible Speech in Noise for Hearing-Impaired Listeners , 2016, Trends in hearing.

[8]  B. Willmore,et al.  Contrast gain control occurs independently of both parvalbumin-positive interneuron activity and shunting inhibition in auditory cortex , 2020, Journal of neurophysiology.

[9]  Jan Rennies,et al.  Energetic and Informational Components of Speech-on-Speech Masking in Binaural Speech Intelligibility and Perceived Listening Effort , 2019, Trends in hearing.

[10]  B J Malone,et al.  Background noise exerts diverse effects on the cortical encoding of foreground sounds. , 2017, Journal of neurophysiology.

[11]  Josh H McDermott,et al.  Schema learning for the cocktail party problem , 2018, Proceedings of the National Academy of Sciences.

[12]  Matthew S. Tata,et al.  Theta-band phase tracking in the two-talker problem , 2014, Brain and Language.

[13]  Kenneth E Hancock,et al.  Electrophysiological markers of cochlear function correlate with hearing-in-noise performance among audiometrically normal subjects. , 2020, Journal of neurophysiology.

[14]  Matthias H Hennig,et al.  The Sound of Silence: Ionic Mechanisms Encoding Sound Termination , 2011, Neuron.

[15]  Giancarlo Valente,et al.  Cortical tracking of multiple streams outside the focus of attention in naturalistic auditory scenes , 2018, NeuroImage.

[16]  Hideki Kawahara,et al.  Inharmonic speech reveals the role of harmonicity in the cocktail party problem , 2018, Nature Communications.

[17]  Alan R Palmer,et al.  Responses in the inferior colliculus of the guinea pig to concurrent harmonic series and the effect of inactivation of descending controls. , 2010, Journal of neurophysiology.

[18]  Johannes C. Dahmen,et al.  Adaptation to Stimulus Statistics in the Perception and Neural Representation of Auditory Space , 2010, Neuron.

[19]  B. Grothe,et al.  Cooperative population coding facilitates efficient sound-source separability by adaptation to input statistics , 2019, PLoS biology.

[20]  Brian C J Moore,et al.  Using Acoustic Information to Perceive Room Size: Effects of Blindness, Room Reverberation Time, and Stimulus , 2013, Perception.

[21]  Max F. K. Happel,et al.  A Role for Auditory Corticothalamic Feedback in the Perception of Complex Sounds , 2017, The Journal of Neuroscience.

[22]  B. Delgutte,et al.  Dynamic Range Adaptation to Sound Level Statistics in the Auditory Nerve , 2009, The Journal of Neuroscience.

[23]  Paul V. Watkins,et al.  Specialized neuronal adaptation for preserving input sensitivity , 2008, Nature Neuroscience.

[24]  William A. Ainsworth,et al.  Effects of preceding noise on the perception of voiced plosives , 2005 .

[25]  Nima Mesgarani,et al.  Speaker-independent auditory attention decoding without access to clean speech sources , 2019, Science Advances.

[26]  Jon Barker,et al.  The foreign language cocktail party problem: Energetic and informational masking effects in non-native speech perception. , 2008, The Journal of the Acoustical Society of America.

[27]  Bahar Khalighinejad,et al.  Adaptation of the human auditory cortex to changing background noise , 2019, Nature Communications.

[28]  Josh H McDermott,et al.  Statistics of natural reverberation enable perceptual separation of sound and space , 2016, Proceedings of the National Academy of Sciences.

[29]  J. Linden,et al.  Mind the Gap: Two Dissociable Mechanisms of Temporal Processing in the Auditory System , 2016, The Journal of Neuroscience.

[30]  Nima Mesgarani,et al.  Joint Representation of Spatial and Phonetic Features in the Human Core Auditory Cortex. , 2018, Cell reports.

[31]  Bruce A. Schneider,et al.  Does the degree of linguistic experience (native versus nonnative) modulate the degree to which listeners can benefit from a delay between the onset of the maskers and the onset of the target speech? , 2016, Hearing Research.

[32]  Robert J. Zatorre,et al.  Neural Correlates of Early Sound Encoding and their Relationship to Speech-in-Noise Perception , 2017, Front. Neurosci..

[33]  Mitchell Steinschneider,et al.  Neural Representation of Concurrent Harmonic Sounds in Monkey Primary Auditory Cortex: Implications for Models of Auditory Scene Analysis , 2014, The Journal of Neuroscience.

[34]  S. Mishra,et al.  The role of medial efferents in human auditory development: Efferent inhibition predicts frequency discrimination in noise for children. , 2020, Journal of neurophysiology.

[35]  Jonathan Z. Simon,et al.  Adaptive Temporal Encoding Leads to a Background-Insensitive Cortical Representation of Speech , 2013, The Journal of Neuroscience.

[36]  Emily J. Allen,et al.  Representations of Pitch and Timbre Variation in Human Auditory Cortex , 2017, The Journal of Neuroscience.

[37]  Aïcha Bouzid,et al.  A comparison of several computational auditory scene analysis (CASA) techniques for monaural speech segregation , 2015, Brain Informatics.

[38]  S. Shamma,et al.  Temporal coherence and attention in auditory scene analysis , 2011, Trends in Neurosciences.

[39]  M. Ericson,et al.  Informational and energetic masking effects in the perception of multiple simultaneous talkers. , 2001, The Journal of the Acoustical Society of America.

[40]  Barbara G. Shinn-Cunningham,et al.  Influence of Task-Relevant and Task-Irrelevant Feature Continuity on Selective Auditory Attention , 2012, Journal of the Association for Research in Otolaryngology.

[41]  E. C. Cherry Some Experiments on the Recognition of Speech, with One and with Two Ears , 1953 .

[42]  B. Willmore,et al.  Contrast gain control in mouse auditory cortex , 2018, Journal of neurophysiology.

[43]  N. Harper,et al.  Meta-adaptation in the auditory midbrain under cortical influence , 2016, Nature Communications.

[44]  A. Zador,et al.  Balanced inhibition underlies tuning and sharpens spike timing in auditory cortex , 2003, Nature.

[45]  Stuart Rosen,et al.  The neural processing of masked speech: evidence for different mechanisms in the left and right temporal lobes. , 2009, The Journal of the Acoustical Society of America.

[46]  Li I. Zhang,et al.  Topography and synaptic shaping of direction selectivity in primary auditory cortex , 2003, Nature.

[47]  Claude Alain,et al.  Representation of concurrent acoustic objects in primary auditory cortex. , 2004, The Journal of the Acoustical Society of America.

[48]  C. Darwin,et al.  The Quarterly Journal of Experimental Psychology Section a Human Experimental Psychology Perceptual Grouping of Speech Components Differing in Fundamental Frequency and Onset-time Perceptual Grouping of Speech Components Differing in Fundamental Frequency and Onset-time , 2022 .

[49]  Alon Fishbach,et al.  Neural model for physiological responses to frequency and amplitude transitions uncovers topographical order in the auditory cortex. , 2003, Journal of neurophysiology.

[50]  M. Sachs,et al.  Effect of electrical stimulation of the crossed olivocochlear bundle on auditory nerve response to tones in noise. , 1987, Journal of neurophysiology.

[51]  P. Chadderton,et al.  Comodulation Enhances Signal Detection via Priming of Auditory Cortical Circuits , 2016, The Journal of Neuroscience.

[52]  Yang Zhang,et al.  Investigating Influences of Medial Olivocochlear Efferent System on Central Auditory Processing and Listening in Noise: A Behavioral and Event-Related Potential Study , 2020, Brain sciences.

[53]  H. Voigt,et al.  Acoustic and current-pulse responses of identified neurons in the dorsal cochlear nucleus of unanesthetized, decerebrate gerbils. , 1999, Journal of neurophysiology.

[54]  J. Simon,et al.  Emergence of neural encoding of auditory objects while listening to competing speakers , 2012, Proceedings of the National Academy of Sciences.

[55]  B. Willmore,et al.  Hearing in noisy environments: noise invariance and contrast gain control , 2014, The Journal of physiology.

[56]  Enrique A. Lopez-Poveda,et al.  Adaptation to Noise in Human Speech Recognition Depends on Noise-Level Statistics and Fast Dynamic-Range Compression , 2020, The Journal of Neuroscience.

[57]  Shanelle Canavan,et al.  The near non-existence of "pure" energetic masking release for speech: Extension to spectro-temporal modulation and glimpsing. , 2016, The Journal of the Acoustical Society of America.

[58]  Francesca Rocchi,et al.  Neuronal adaptation to sound statistics in the inferior colliculus of behaving macaques does not reduce the effectiveness of the masking noise. , 2018, Journal of neurophysiology.

[59]  Ryan G. Natan,et al.  Cortical Interneurons Differentially Shape Frequency Tuning following Adaptation. , 2017, Cell reports.

[60]  Timothy D. Griffiths,et al.  ‘Normal’ hearing thresholds and fundamental auditory grouping processes predict difficulties with speech-in-noise perception , 2019, Scientific Reports.

[61]  I. Johnsrude,et al.  Aging Affects Adaptation to Sound-Level Statistics in Human Auditory Cortex , 2018, The Journal of Neuroscience.

[62]  N. Mesgarani,et al.  Selective cortical representation of attended speaker in multi-talker speech perception , 2012, Nature.

[63]  Stephen V. David,et al.  Mechanisms of noise robust representation of speech in primary auditory cortex , 2014, Proceedings of the National Academy of Sciences.

[64]  M. Wojtczak,et al.  Exploring the Role of Medial Olivocochlear Efferents on the Detection of Amplitude Modulation for Tones Presented in Noise , 2019, Journal of the Association for Research in Otolaryngology.

[65]  B. Moore,et al.  Thresholds for hearing mistuned partials as separate tones in harmonic complexes. , 1986, The Journal of the Acoustical Society of America.

[66]  Christoph E Schreiner,et al.  Effects of Signal-to-Noise Ratio on Auditory Cortical Frequency Processing , 2016, The Journal of Neuroscience.

[67]  Christian Lorenzi,et al.  Noise-Sensitive But More Precise Subcortical Representations Coexist with Robust Cortical Encoding of Natural Vocalizations , 2020, The Journal of Neuroscience.

[68]  Nima Mesgarani,et al.  Hierarchical Encoding of Attended Auditory Objects in Multi-talker Speech Perception , 2019, Neuron.

[69]  N. Kraus,et al.  Training to improve hearing speech in noise: biological mechanisms. , 2012, Cerebral cortex.

[70]  Kenneth E. Hancock,et al.  Audiomotor Perceptual Training Enhances Speech Intelligibility in Background Noise , 2017, Current Biology.

[71]  Almudena Eustaquio-Martín,et al.  Adaptation to Noise in Human Speech Recognition Unrelated to the Medial Olivocochlear Reflex , 2018, The Journal of Neuroscience.

[72]  Claude Alain,et al.  Turning down the noise: The benefit of musical training on the aging auditory brain , 2014, Hearing Research.

[73]  Tetsuaki Kawase,et al.  Effects of contralateral noise on the measurement of auditory threshold. , 2003, The Tohoku journal of experimental medicine.

[74]  Wiktor Mlynarski,et al.  Ecological origins of perceptual grouping principles in the auditory system , 2019, Proceedings of the National Academy of Sciences.

[75]  Adrian K. C. Lee,et al.  Integration of Visual Information in Auditory Cortex Promotes Auditory Scene Analysis through Multisensory Binding , 2017, Neuron.

[76]  Tammo Houtgast,et al.  Auditory distance perception in rooms , 1999, Nature.

[77]  Timothy D. Griffiths,et al.  ‘Normal’ hearing thresholds and fundamental auditory grouping processes predict difficulties with speech-in-noise perception , 2019, Scientific Reports.

[78]  J. Simon,et al.  Neural coding of continuous speech in auditory cortex during monaural and dichotic listening. , 2012, Journal of neurophysiology.

[79]  Neil C. Rabinowitz,et al.  Contrast Gain Control in Auditory Cortex , 2011, Neuron.

[80]  Josh H McDermott,et al.  Invariance to background noise as a signature of non-primary auditory cortex , 2019, Nature Communications.

[81]  B. Shinn-Cunningham Cortical and Sensory Causes of Individual Differences in Selective Attention Ability Among Listeners With Normal Hearing Thresholds. , 2017, Journal of speech, language, and hearing research : JSLHR.

[82]  I. Dean,et al.  Neural population coding of sound level adapts to stimulus statistics , 2005, Nature Neuroscience.

[83]  M. Wehr,et al.  Nonoverlapping Sets of Synapses Drive On Responses and Off Responses in Auditory Cortex , 2010, Neuron.

[84]  Kerry M. M. Walker,et al.  Multiplexed and Robust Representations of Sound Features in Auditory Cortex , 2011, The Journal of Neuroscience.

[85]  Andrew J. King,et al.  Neural circuits underlying auditory contrast gain control and their perceptual implications , 2020, Nature Communications.

[86]  Gregory B. Cogan,et al.  Visual Input Enhances Selective Speech Envelope Tracking in Auditory Cortex at a “Cocktail Party” , 2013, The Journal of Neuroscience.

[87]  Christoph E Schreiner,et al.  Auditory Cortical Plasticity Dependent on Environmental Noise Statistics , 2020, Cell reports.

[88]  Mari Nakamura,et al.  White Noise Background Improves Tone Discrimination by Suppressing Cortical Tuning Curves. , 2019, Cell reports.