Attention is required for knowledge-based sequential grouping of syllables into words

How the brain sequentially groups sensory events into temporal chunks and how this process is modulated by attention are fundamental questions in cognitive neuroscience. Sequential grouping includes bottom-up primitive grouping and top-down knowledge-based grouping. In speech perception, grouping acoustic features into syllables can rely on bottom-up acoustic continuity cues but grouping syllables into words critically relies on the listener’s lexical knowledge. This study investigates whether top-down attention is required to apply lexical knowledge to group syllables into words, by concurrently monitoring neural entrainment to syllables and words using electroencephalography (EEG). When attention is directed to a competing speech stream or cross-modally to a silent movie, neural entrainment to syllables is weakened but neural entrainment to words largely diminishes. These results strongly suggest that knowledge-based grouping of syllables into words requires top-down attention and is a bottleneck for the neural processing of unattended speech.

[1]  Wen Zhang,et al.  Time-domain analysis of neural tracking of hierarchical linguistic structures , 2017, NeuroImage.

[2]  Josh H. McDermott,et al.  Attentive Tracking of Sound Sources , 2015, Current Biology.

[3]  J. Fodor,et al.  The Modularity of Mind: An Essay on Faculty Psychology , 1984 .

[4]  Joachim Gross,et al.  Phase-Locked Responses to Speech in Human Auditory Cortex are Enhanced During Comprehension , 2012, Cerebral cortex.

[5]  R. Freyman,et al.  Effect of Priming on Energetic and Informational Masking in a Same–Different Task , 2012, Ear and hearing.

[6]  N. Mesgarani,et al.  Selective cortical representation of attended speaker in multi-talker speech perception , 2012, Nature.

[7]  D. Poeppel,et al.  Cortical Tracking of Hierarchical Linguistic Structures in Connected Speech , 2015, Nature Neuroscience.

[8]  S. Shamma,et al.  Temporal coherence and attention in auditory scene analysis , 2011, Trends in Neurosciences.

[9]  Elissa L. Newport,et al.  Segmenting nonsense: an event-related potential index of perceived onsets in continuous speech , 2002, Nature Neuroscience.

[10]  Lucia Melloni,et al.  Brain Oscillations during Spoken Sentence Processing , 2012, Journal of Cognitive Neuroscience.

[11]  M. Berger,et al.  High Gamma Power Is Phase-Locked to Theta Oscillations in Human Neocortex , 2006, Science.

[12]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[13]  S. Shamma On the role of space and time in auditory processing , 2001, Trends in Cognitive Sciences.

[14]  Marco Buiatti,et al.  Investigating the neural correlates of continuous speech computation with frequency-tagged neuroelectric responses , 2009, NeuroImage.

[15]  Michael M Merzenich,et al.  Successive-signal biasing for a learned sound sequence , 2010, Proceedings of the National Academy of Sciences.

[16]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[17]  R. K. Abolfazlian,et al.  The Cocktail Party Listener , 1994 .

[18]  G. McCarthy,et al.  Language-related field potentials in the anterior-medial temporal lobe: II. Effects of word type and semantic priming , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[19]  S. Dehaene,et al.  Unconscious Masked Priming Depends on Temporal Attention , 2002, Psychological science.

[20]  C. Eulitz,et al.  Top-down knowledge supports the retrieval of lexical information from degraded speech , 2007, Brain Research.

[21]  Antoine J. Shahin,et al.  Attentional Gain Control of Ongoing Cortical Speech Representations in a “Cocktail Party” , 2010, The Journal of Neuroscience.

[22]  Kai Lu,et al.  Temporal coherence structure rapidly shapes neuronal interactions , 2017, Nature Communications.

[23]  Yoshitaka Nakajima,et al.  Auditory Scene Analysis: The Perceptual Organization of Sound Albert S. Bregman , 1992 .

[24]  Henning Scheich,et al.  Formation of associations in auditory cortex by slow changes of tonic firing , 2011, Hearing Research.

[25]  Virginia Best,et al.  Auditory Object Formation and Selection , 2017 .

[26]  L. A. Jeffress Cerebral mechanisms in behavior : the Hixon symposium , 1951 .

[27]  S. Kelly,et al.  A supramodal accumulation-to-bound signal that determines perceptual decisions in humans , 2012, Nature Neuroscience.

[28]  Angela D. Friederici,et al.  Brain potentials indicate immediate use of prosodic cues in natural speech processing , 1999, Nature Neuroscience.

[29]  Ying-Yee Kong,et al.  Differential modulation of auditory responses to attended and unattended speech in different listening conditions , 2014, Hearing Research.

[30]  S. Dehaene,et al.  Cortical representation of the constituent structure of sentences , 2011, Proceedings of the National Academy of Sciences.

[31]  Philippe Peigneux,et al.  Auditory Magnetoencephalographic Frequency-Tagged Responses Mirror the Ongoing Segmentation Processes Underlying Statistical Learning , 2016, Brain Topography.

[32]  John J. Foxe,et al.  Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG. , 2015, Cerebral cortex.

[33]  Matthew H. Davis,et al.  Lexical Influences on Auditory Streaming , 2013, Current Biology.

[34]  Mark F. Bear,et al.  Learned spatiotemporal sequence recognition and prediction in primary visual cortex , 2014, Nature Neuroscience.

[35]  J. Fodor The Modularity of mind. An essay on faculty psychology , 1986 .

[36]  D. Poeppel,et al.  Mechanisms Underlying Selective Neuronal Tracking of Attended Speech at a “Cocktail Party” , 2013, Neuron.

[37]  Lars Meyer,et al.  Linguistic Bias Modulates Interpretation of Speech via Neural Delta-Band Oscillations , 2016, Cerebral cortex.

[38]  David Poeppel,et al.  Cortical oscillations and speech processing: emerging computational principles and operations , 2012, Nature Neuroscience.

[39]  Stanislas Dehaene,et al.  Neurophysiological dynamics of phrase-structure building during sentence processing , 2017, Proceedings of the National Academy of Sciences.

[40]  J. Simon,et al.  Emergence of neural encoding of auditory objects while listening to competing speakers , 2012, Proceedings of the National Academy of Sciences.

[41]  Luc H. Arnal,et al.  Cortical oscillations and sensory predictions , 2012, Trends in Cognitive Sciences.

[42]  A. Friederici Towards a neural basis of auditory sentence processing , 2002, Trends in Cognitive Sciences.

[43]  M. Mishkin,et al.  Early stages of melody processing: stimulus-sequence and task-dependent neuronal activity in monkey auditory cortical fields A1 and R. , 2008, Journal of neurophysiology.

[44]  U. Goswami,et al.  Speech rhythm and temporal structure: Converging perspectives? , 2013 .

[45]  K. Lashley The problem of serial order in behavior , 1951 .

[46]  Josh H McDermott,et al.  Recovering sound sources from embedded repetition , 2011, Proceedings of the National Academy of Sciences.

[47]  S. David,et al.  Does attention play a role in dynamic receptive field adaptation to changing acoustic salience in A1? , 2007, Hearing Research.

[48]  R. Näätänen,et al.  The mismatch negativity (MMN) in basic research of central auditory processing: A review , 2007, Clinical Neurophysiology.

[49]  J. Fodor The Modularity of mind. An essay on faculty psychology , 1986 .

[50]  I. Winkler,et al.  The role of attention in the formation of auditory streams , 2007, Perception & psychophysics.

[51]  Andrea E Martin,et al.  A mechanism for the cortical computation of hierarchical linguistic structure , 2017, PLoS biology.

[52]  T W Picton,et al.  Human auditory steady-state evoked potentials during selective attention. , 1987, Electroencephalography and clinical neurophysiology.

[53]  Keith Johnson,et al.  Phonetic Feature Encoding in Human Superior Temporal Gyrus , 2014, Science.

[54]  Anne Cutler,et al.  Native Listening: Language Experience and the Recognition of Spoken Words , 2012 .

[55]  C. Schroeder,et al.  Low-frequency neuronal oscillations as instruments of sensory selection , 2009, Trends in Neurosciences.

[56]  György Buzsáki,et al.  Neural Syntax: Cell Assemblies, Synapsembles, and Readers , 2010, Neuron.

[57]  Jonathan Z. Simon,et al.  The Auditory System at the Cocktail Party , 2017 .

[58]  Karl J. Friston,et al.  Brain responses in humans reveal ideal observer-like sensitivity to complex acoustic patterns , 2016, Proceedings of the National Academy of Sciences.

[59]  Albert S. Bregman,et al.  The Auditory Scene. (Book Reviews: Auditory Scene Analysis. The Perceptual Organization of Sound.) , 1990 .

[60]  David Poeppel,et al.  Visual speech speeds up the neural processing of auditory speech. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[61]  Ankoor S. Shah,et al.  An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex. , 2005, Journal of neurophysiology.

[62]  Michael F. Bunting,et al.  The cocktail party phenomenon revisited: The importance of working memory capacity , 2001, Psychonomic bulletin & review.

[63]  B. Shinn-Cunningham Object-based auditory and visual attention , 2008, Trends in Cognitive Sciences.

[64]  Edmund C. Lalor,et al.  Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing , 2015, Current Biology.

[65]  J. Lisman,et al.  The Theta-Gamma Neural Code , 2013, Neuron.

[66]  M. Kutas,et al.  Semantic processing and memory for attended and unattended words in dichotic listening: behavioral and electrophysiological evidence. , 1995, Journal of experimental psychology. Human perception and performance.

[67]  J. Rauschecker,et al.  Perceptual Organization of Tone Sequences in the Auditory Cortex of Awake Macaques , 2005, Neuron.

[68]  E. Vogel,et al.  Word meanings can be accessed but not reported during the attentional blink , 1996, Nature.

[69]  Gregor Thut,et al.  Lip movements entrain the observers’ low-frequency brain oscillations to facilitate speech intelligibility , 2016, eLife.

[70]  Terence W. Picton,et al.  Effects of Attention on Neuroelectric Correlates of Auditory Stream Segregation , 2006, Journal of Cognitive Neuroscience.

[71]  Jessica M. Foxton,et al.  Effects of attention and unilateral neglect on auditory stream segregation. , 2001, Journal of experimental psychology. Human perception and performance.

[72]  N. Cowan,et al.  The cocktail party phenomenon revisited: how frequent are attention shifts to one's name in an irrelevant auditory channel? , 1995, Journal of experimental psychology. Learning, memory, and cognition.

[73]  Kirill V. Nourski,et al.  Representation of speech in human auditory cortex: Is it special? , 2013, Hearing Research.

[74]  E. C. Cherry Some Experiments on the Recognition of Speech, with One and with Two Ears , 1953 .

[75]  W. Luh,et al.  Abstract linguistic structure correlates with temporal activity during naturalistic comprehension , 2016, Brain and Language.