The Role of High-Level Processes for Oscillatory Phase Entrainment to Speech Sound

Constantly bombarded with input, the brain has the need to filter out relevant information while ignoring the irrelevant rest. A powerful tool may be represented by neural oscillations which entrain their high-excitability phase to important input while their low-excitability phase attenuates irrelevant information. Indeed, the alignment between brain oscillations and speech improves intelligibility and helps dissociating speakers during a “cocktail party”. Although well-investigated, the contribution of low- and high-level processes to phase entrainment to speech sound has only recently begun to be understood. Here, we review those findings, and concentrate on three main results: (1) Phase entrainment to speech sound is modulated by attention or predictions, likely supported by top-down signals and indicating higher-level processes involved in the brain’s adjustment to speech. (2) As phase entrainment to speech can be observed without systematic fluctuations in sound amplitude or spectral content, it does not only reflect a passive steady-state “ringing” of the cochlea, but entails a higher-level process. (3) The role of intelligibility for phase entrainment is debated. Recent results suggest that intelligibility modulates the behavioral consequences of entrainment, rather than directly affecting the strength of entrainment in auditory regions. We conclude that phase entrainment to speech reflects a sophisticated mechanism: several high-level processes interact to optimally align neural oscillations with predicted events of high relevance, even when they are hidden in a continuous stream of background noise.

[1]  Anne Hauswald,et al.  Investigating ongoing brain oscillations and their influence on conscious perception – network states and the window to consciousness , 2014, Front. Psychol..

[2]  D. Poeppel,et al.  Auditory Cortex Tracks Both Auditory and Visual Stimulus Dynamics Using Low-Frequency Neuronal Phase Modulation , 2010, PLoS biology.

[3]  Rufin VanRullen,et al.  On the cyclic nature of perception in vision versus audition , 2014, Philosophical Transactions of the Royal Society B: Biological Sciences.

[4]  S. Makeig,et al.  A 40-Hz auditory potential recorded from the human scalp. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[5]  John J. Foxe,et al.  Ready, Set, Reset: Stimulus-Locked Periodicity in Behavioral Performance Demonstrates the Consequences of Cross-Sensory Phase Reset , 2011, The Journal of Neuroscience.

[6]  B. Ross,et al.  Internalized Timing of Isochronous Sounds Is Represented in Neuromagnetic Beta Oscillations , 2012, The Journal of Neuroscience.

[7]  Rufin VanRullen,et al.  Selective Perceptual Phase Entrainment to Speech Rhythm in the Absence of Spectral Energy Fluctuations , 2015, The Journal of Neuroscience.

[8]  John J. Foxe,et al.  Oscillatory Sensory Selection Mechanisms during Intersensory Attention to Rhythmic Auditory and Visual Inputs: A Human Electrocorticographic Investigation , 2011, The Journal of Neuroscience.

[9]  David Poeppel,et al.  Acoustic landmarks drive delta–theta oscillations to enable speech comprehension by facilitating perceptual parsing , 2014, NeuroImage.

[10]  Michael J. Crosse,et al.  Congruent Visual Speech Enhances Cortical Entrainment to Continuous Auditory Speech in Noise-Free Conditions , 2015, The Journal of Neuroscience.

[11]  Charles E. Schroeder,et al.  Motor contributions to the temporal precision of auditory attention , 2014, Nature Communications.

[12]  Matthew H. Davis,et al.  Neural Oscillations Carry Speech Rhythm through to Comprehension , 2012, Front. Psychology.

[13]  David Poeppel,et al.  Cortical oscillations and speech processing: emerging computational principles and operations , 2012, Nature Neuroscience.

[14]  P. Schyns,et al.  Speech Rhythms and Multiplexed Oscillatory Sensory Coding in the Human Brain , 2013, PLoS biology.

[15]  C. Gilbert,et al.  Top-down influences on visual processing , 2013, Nature Reviews Neuroscience.

[16]  Christian Keitel,et al.  Stimulus-Driven Brain Oscillations in the Alpha Range: Entrainment of Intrinsic Rhythms or Frequency-Following Response? , 2014, The Journal of Neuroscience.

[17]  E. C. Cmm,et al.  on the Recognition of Speech, with , 2008 .

[18]  Gregory B. Cogan,et al.  Visual Input Enhances Selective Speech Envelope Tracking in Auditory Cortex at a “Cocktail Party” , 2013, The Journal of Neuroscience.

[19]  Jonathan Z Simon,et al.  The encoding of auditory objects in auditory cortex: insights from magnetoencephalography. , 2015, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[20]  Luc H. Arnal,et al.  Transitions in neural oscillations reflect prediction errors generated in audiovisual speech , 2011, Nature Neuroscience.

[21]  Sylvie Nozaradan,et al.  Exploring how musical rhythm entrains brain activity with electroencephalogram frequency-tagging , 2014, Philosophical Transactions of the Royal Society B: Biological Sciences.

[22]  C. Granger Investigating causal relations by econometric models and cross-spectral methods , 1969 .

[23]  David Poeppel,et al.  The effects of selective attention and speech acoustics on neural speech-tracking in a multi-talker scene , 2015, Cortex.

[24]  R. Romo,et al.  Conversion of sensory signals into perceptual decisions , 2013, Progress in Neurobiology.

[25]  Ramesh Srinivasan,et al.  Suppression of competing speech through entrainment of cortical oscillations. , 2013, Journal of neurophysiology.

[26]  A. Puce,et al.  Neuronal oscillations and visual amplification of speech , 2008, Trends in Cognitive Sciences.

[27]  J. Simon,et al.  Cortical entrainment to continuous speech: functional roles and interpretations , 2014, Front. Hum. Neurosci..

[28]  J. Obleser,et al.  Frequency modulation entrains slow neural oscillations and optimizes human listening behavior , 2012, Proceedings of the National Academy of Sciences.

[29]  P. Heil,et al.  Detection of Near-Threshold Sounds is Independent of EEG Phase in Common Frequency Bands , 2013, Front. Psychol..

[30]  Gregory Hickok,et al.  The Rhythm of Perception , 2015, Psychological science.

[31]  J. Rauschecker,et al.  Phoneme and word recognition in the auditory ventral stream , 2012, Proceedings of the National Academy of Sciences.

[32]  Jonathan Z. Simon,et al.  Adaptive Temporal Encoding Leads to a Background-Insensitive Cortical Representation of Speech , 2013, The Journal of Neuroscience.

[33]  J. Simon,et al.  Emergence of neural encoding of auditory objects while listening to competing speakers , 2012, Proceedings of the National Academy of Sciences.

[34]  Christoph Kayser,et al.  Natural asynchronies in audiovisual communication signals regulate neuronal multisensory interactions in voice-sensitive cortex , 2014, Proceedings of the National Academy of Sciences.

[35]  Robin A. A. Ince,et al.  Frontal Top-Down Signals Increase Coupling of Auditory Low-Frequency Oscillations to Continuous Speech in Human Listeners , 2015, Current Biology.

[36]  Peter Lakatos,et al.  Dynamics of Active Sensing and perceptual selection , 2010, Current Opinion in Neurobiology.

[37]  P. Schyns,et al.  Entrainment of Perceptually Relevant Brain Oscillations by Non-Invasive Rhythmic Stimulation of the Human Brain , 2011, Front. Psychology.

[38]  Huan Luo,et al.  Behavioral Oscillations in Attention: Rhythmic α Pulses Mediated through θ Band , 2014, The Journal of Neuroscience.

[39]  Schreiber,et al.  Measuring information transfer , 2000, Physical review letters.

[40]  J. Obleser,et al.  Entrained neural oscillations in multiple frequency bands comodulate behavior , 2014, Proceedings of the National Academy of Sciences.

[41]  E. T. Possing,et al.  Human temporal lobe activation by speech and nonspeech sounds. , 2000, Cerebral cortex.

[42]  S. Huettel,et al.  The functional neuroanatomy of decision making: Prefrontal control of thought and action , 2012, Brain Research.

[43]  C. Schroeder,et al.  Neuronal oscillations as a mechanistic substrate of auditory temporal prediction , 2015, Annals of the New York Academy of Sciences.

[44]  Stuart Rosen,et al.  Spectral and temporal cues to pitch in noise-excited vocoder simulations of continuous-interleaved-sampling cochlear implants. , 2002, The Journal of the Acoustical Society of America.

[45]  E Ahissar,et al.  Speech comprehension is correlated with temporal response patterns recorded from auditory cortex , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[46]  J. Simon,et al.  Neural coding of continuous speech in auditory cortex during monaural and dichotic listening. , 2012, Journal of neurophysiology.

[47]  C. Schroeder,et al.  The Spectrotemporal Filter Mechanism of Auditory Selective Attention , 2013, Neuron.

[48]  David Poeppel,et al.  Visual speech speeds up the neural processing of auditory speech. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[49]  Jonathan Z. Simon,et al.  Robust cortical entrainment to the speech envelope relies on the spectro-temporal fine structure , 2014, NeuroImage.

[50]  Oded Ghitza,et al.  The theta-syllable: a unit of speech information defined by cortical function , 2013, Front. Psychol..

[51]  C. Braun,et al.  Prestimulus oscillatory power and connectivity patterns predispose conscious somatosensory perception , 2014, Proceedings of the National Academy of Sciences.

[52]  D. Poeppel,et al.  Temporal context in speech processing and attentional stream selection: A behavioral and neural perspective , 2012, Brain and Language.

[53]  D. Poeppel,et al.  Mechanisms Underlying Selective Neuronal Tracking of Attended Speech at a “Cocktail Party” , 2013, Neuron.

[54]  P. Fries A mechanism for cognitive dynamics: neuronal communication through neuronal coherence , 2005, Trends in Cognitive Sciences.

[55]  S. Kastner,et al.  Attention in the real world: toward understanding its neural basis , 2014, Trends in Cognitive Sciences.

[56]  G. Buzsáki,et al.  Neuronal Oscillations in Cortical Networks , 2004, Science.

[57]  Benedikt Zoefel,et al.  Investigating the Rhythm of Attention on a Fine-Grained Scale: Evidence from Reaction Times , 2014, The Journal of Neuroscience.

[58]  Joachim Gross,et al.  Phase-Locked Responses to Speech in Human Auditory Cortex are Enhanced During Comprehension , 2012, Cerebral cortex.

[59]  J. Gross,et al.  Steady-State Visual Evoked Potentials Can Be Explained by Temporal Superposition of Transient Event-Related Responses , 2011, PloS one.

[60]  C. Schroeder,et al.  Layer Specific Sharpening of Frequency Tuning by Selective Attention in Primary Auditory Cortex , 2014, The Journal of Neuroscience.

[61]  O. Jensen,et al.  University of Birmingham Attention Modulates TMS-Locked Alpha Oscillations in the Visual Cortex , 2015 .

[62]  J. Peelle,et al.  Prediction and constraint in audiovisual speech perception , 2015, Cortex.

[63]  Matthew H. Davis,et al.  Effortful Listening: The Processing of Degraded Speech Depends Critically on Attention , 2012, The Journal of Neuroscience.

[64]  C. Schroeder,et al.  Low-frequency neuronal oscillations as instruments of sensory selection , 2009, Trends in Neurosciences.

[65]  Nicolas Brunel,et al.  Author's Personal Copy Understanding the Relationships between Spike Rate and Delta/gamma Frequency Bands of Lfps and Eegs Using a Local Cortical Network Model , 2022 .

[66]  S. Scott,et al.  Identification of a pathway for intelligible speech in the left temporal lobe. , 2000, Brain : a journal of neurology.

[67]  C. Miniussi,et al.  The Functional Importance of Rhythmic Activity in the Brain , 2012, Current Biology.

[68]  G. Karmos,et al.  Transient cortical excitation at the onset of visual fixation. , 2008, Cerebral cortex.

[69]  Fred Cummins,et al.  Oscillators and Syllables: A Cautionary Note , 2012, Front. Psychology.

[70]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[71]  Oded Ghitza,et al.  Behavioral evidence for the role of cortical θ oscillations in determining auditory channel capacity for speech , 2014, Front. Psychol..

[72]  Luc H. Arnal Predicting “When” Using the Motor System’s Beta-Band Oscillations , 2012, Front. Hum. Neurosci..

[73]  G. Karmos,et al.  Entrainment of Neuronal Oscillations as a Mechanism of Attentional Selection , 2008, Science.

[74]  Oded Ghitza,et al.  Linking Speech Perception and Neurophysiology: Speech Decoding Guided by Cascaded Oscillators Locked to the Input Rhythm , 2011, Front. Psychology.

[75]  Benedikt Zoefel,et al.  EEG oscillations entrain their phase to high-level features of speech sound , 2016, NeuroImage.

[76]  David Poeppel,et al.  Towards a New Neurobiology of Language , 2012, The Journal of Neuroscience.

[77]  Ankoor S. Shah,et al.  An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex. , 2005, Journal of neurophysiology.

[78]  T. Hackett,et al.  Predictive motor control of sensory dynamics in auditory active sensing , 2015, Current Opinion in Neurobiology.

[79]  Usha Goswami,et al.  Neural entrainment to rhythmic speech in children with developmental dyslexia , 2013, Front. Hum. Neurosci..

[80]  Karl J. Friston,et al.  A theory of cortical responses , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[81]  N. Mesgarani,et al.  Selective cortical representation of attended speaker in multi-talker speech perception , 2012, Nature.

[82]  B. Hangya,et al.  Phase Entrainment of Human Delta Oscillations Can Mediate the Effects of Expectation on Reaction Speed , 2010, The Journal of Neuroscience.

[83]  D. Poeppel,et al.  Phase Patterns of Neuronal Responses Reliably Discriminate Speech in Human Auditory Cortex , 2007, Neuron.

[84]  Floris P. de Lange,et al.  Local Entrainment of Alpha Oscillations by Visual Stimuli Causes Cyclic Modulation of Perception , 2014, The Journal of Neuroscience.

[85]  Daniel C. Krawczyk,et al.  Contributions of the prefrontal cortex to the neural basis of human decision making , 2002, Neuroscience & Biobehavioral Reviews.

[86]  Kirill V. Nourski,et al.  Representation of speech in human auditory cortex: Is it special? , 2013, Hearing Research.

[87]  Rufin VanRullen,et al.  The Psychophysics of Brain Rhythms , 2011, Front. Psychology.

[88]  Keith Johnson,et al.  Phonetic Feature Encoding in Human Superior Temporal Gyrus , 2014, Science.

[89]  John J. Foxe,et al.  Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG. , 2015, Cerebral cortex.

[90]  Antoine J. Shahin,et al.  Attentional Gain Control of Ongoing Cortical Speech Representations in a “Cocktail Party” , 2010, The Journal of Neuroscience.

[91]  David Poeppel,et al.  Cortical Oscillations in Auditory Perception and Speech: Evidence for Two Temporal Windows in Human Auditory Cortex , 2012, Front. Psychology.

[92]  Anne-Lise Giraud,et al.  The contribution of frequency-specific activity to hierarchical information processing in the human auditory cortex , 2014, Nature Communications.

[93]  David Poeppel,et al.  Discrimination of speech stimuli based on neuronal response phase patterns depends on acoustics but not comprehension. , 2010, Journal of neurophysiology.

[94]  Björn Herrmann,et al.  Neural Oscillations in Speech: Don't be Enslaved by the Envelope , 2012, Front. Hum. Neurosci..

[95]  P. Fries,et al.  Attention Samples Stimuli Rhythmically , 2012, Current Biology.

[96]  M. Rushworth,et al.  Valuation and decision-making in frontal cortex: one or many serial or parallel systems? , 2012, Current Opinion in Neurobiology.

[97]  N. Weisz,et al.  Not so different after all: The same oscillatory processes support different types of attention , 2015, Brain Research.

[98]  D. Poeppel,et al.  The cortical organization of speech processing , 2007, Nature Reviews Neuroscience.

[99]  E. Adrian,et al.  Brain Rhythms , 1944, Nature.

[100]  Luc H. Arnal,et al.  Dual Neural Routing of Visual Facilitation in Speech Processing , 2009, The Journal of Neuroscience.

[101]  John J. Foxe,et al.  Neuro-Oscillatory Phase Alignment Drives Speeded Multisensory Response Times: An Electro-Corticographic Investigation , 2015, The Journal of Neuroscience.

[102]  S. Debener,et al.  Look now and hear what's coming: On the functional role of cross-modal phase reset , 2014, Hearing Research.

[103]  H Spekreijse,et al.  Modulations of primary visual cortex activity representing attentive and conscious scene perception. , 2000, Frontiers in bioscience : a journal and virtual library.

[104]  Luc H. Arnal,et al.  Cortical oscillations and sensory predictions , 2012, Trends in Cognitive Sciences.

[105]  C. Schroeder,et al.  The Leading Sense: Supramodal Control of Neurophysiological Context by Attention , 2009, Neuron.

[106]  C. Schroeder,et al.  Tuning of the Human Neocortex to the Temporal Dynamics of Attended Events , 2011, The Journal of Neuroscience.

[107]  Luc H. Arnal,et al.  Delta-Beta Coupled Oscillations Underlie Temporal Prediction Accuracy. , 2015, Cerebral cortex.

[108]  Garreth Prendergast,et al.  The Role of Phase-locking to the Temporal Envelope of Speech in Auditory Perception and Speech Intelligibility , 2015, Journal of Cognitive Neuroscience.