Phase Alignment of Low-Frequency Neural Activity to the Amplitude Envelope of Speech Reflects Evoked Responses to Acoustic Edges, Not Oscillatory Entrainment

The amplitude envelope of speech is crucial for accurate comprehension, and several studies have shown that the phase of neural activity in the theta-delta bands (1 - 10 Hz) tracks the phase of the speech amplitude envelope during listening, a process referred to as envelope tracking. However, the mechanisms underlying envelope tracking have been heavily debated. A dominant model posits that envelope tracking reflects continuous entrainment of endogenous low-frequency oscillations to the speech envelope. However, it has proven challenging to distinguish this from the alternative model that envelope tracking reflects a convolution of evoked responses to acoustic landmarks within the envelope. To address this, we recorded magnetoencephalography while participants listened to natural and slowed speech. First, we found that peaks in the rate of envelope change, acoustic edge landmarks, induced a strong evoked response. We then contrasted central diverging predictions of evoked response and oscillatory models regarding the spectral and temporal extent of neural phase locking in the theta-delta range at different speech rates. Our analyses revealed transient theta phase locking at both speech rates and delta phase locking in slow speech. This pattern was predicted by the evoked response model but not by oscillatory entrainment. Furthermore, the encoding of acoustic edge magnitudes was invariant to contextual speech rate, demonstrating speech rate normalization of acoustic edge representations. Taken together, our results show that an evoked response model provides a better account of neural phase locking to the speech envelope than oscillatory entrainment.

[1]  Jeesun Kim,et al.  Exploring the Role of Brain Oscillations in Speech Perception in Noise: Intelligibility of Isochronously Retimed Speech , 2016, Front. Hum. Neurosci..

[2]  P. Heil,et al.  Parallels between timing of onset responses of single neurons in cat and of evoked magnetic fields in human auditory cortex. , 2000, Journal of neurophysiology.

[3]  David Poeppel,et al.  An oscillator model better predicts cortical entrainment to music , 2019, Proceedings of the National Academy of Sciences.

[4]  Anne-Lise Giraud,et al.  θ-Band and β-Band Neural Activity Reflects Independent Syllable Tracking and Comprehension of Time-Compressed Speech , 2017, The Journal of Neuroscience.

[5]  Didier Dormont,et al.  Deciphering logopenic primary progressive aphasia: a clinical, imaging and biomarker investigation. , 2013, Brain : a journal of neurology.

[6]  P. Schyns,et al.  Speech Rhythms and Multiplexed Oscillatory Sensory Coding in the Human Brain , 2013, PLoS biology.

[7]  Anne Kösem,et al.  An Entrained Rhythm's Frequency, Not Phase, Influences Temporal Sampling of Speech , 2017, INTERSPEECH.

[8]  Christopher K. Kovach,et al.  Temporal Envelope of Time-Compressed Speech Represented in the Human Auditory Cortex , 2009, The Journal of Neuroscience.

[9]  John J. Foxe,et al.  Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG. , 2015, Cerebral cortex.

[10]  Oded Ghitza,et al.  The theta-syllable: a unit of speech information defined by cortical function , 2013, Front. Psychol..

[11]  Kensuke Sekihara,et al.  Evaluation of a dual signal subspace projection algorithm in magnetoencephalographic recordings from patients with intractable epilepsy and vagus nerve stimulators , 2019, NeuroImage.

[12]  Joachim Gross,et al.  Phase-Locked Responses to Speech in Human Auditory Cortex are Enhanced During Comprehension , 2012, Cerebral cortex.

[13]  Matthew H. Davis,et al.  Phase Entrainment of Brain Oscillations Causally Modulates Neural Responses to Intelligible Speech , 2018, Current Biology.

[14]  Robert Oostenveld,et al.  FieldTrip: Open Source Software for Advanced Analysis of MEG, EEG, and Invasive Electrophysiological Data , 2010, Comput. Intell. Neurosci..

[15]  Matthew H. Davis,et al.  Perceptual learning of degraded speech by minimizing prediction error , 2016, Proceedings of the National Academy of Sciences.

[16]  Karl J. Friston,et al.  Active listening , 2020, Hearing Research.

[17]  David Poeppel,et al.  Acoustic landmarks drive delta–theta oscillations to enable speech comprehension by facilitating perceptual parsing , 2014, NeuroImage.

[18]  Jan Kujala,et al.  Cortical Tracking of Global and Local Variations of Speech Rhythm during Connected Natural Speech Perception , 2018, Journal of Cognitive Neuroscience.

[19]  E. Chang,et al.  A speech envelope landmark for syllable encoding in human superior temporal gyrus , 2018, Science Advances.

[20]  L. Deouell,et al.  Neural mechanisms of rhythm-based temporal prediction: Delta phase-locking reflects temporal predictability but not rhythmic entrainment , 2017, PLoS biology.

[21]  C. Kayser,et al.  Neural Entrainment and Attentional Selection in the Listening Brain , 2019, Trends in Cognitive Sciences.

[22]  H. Heinze,et al.  Pulsed transcranial electric brain stimulation enhances speech comprehension , 2020, Brain Stimulation.

[23]  Benedikt Zoefel,et al.  Speech Entrainment: Rhythmic Predictions Carried by Neural Oscillations , 2018, Current Biology.

[24]  Lars Meyer,et al.  Linguistic Bias Modulates Interpretation of Speech via Neural Delta-Band Oscillations , 2016, Cerebral cortex.

[25]  Riitta Salmelin,et al.  Cortical entrainment: what we can learn from studying naturalistic speech perception , 2018, Language, Cognition and Neuroscience.

[26]  Sophie K. Scott,et al.  The point of P-centres , 1998 .

[27]  Edmund C Lalor,et al.  Semantic Context Enhances the Early Auditory Encoding of Natural Speech , 2019, The Journal of Neuroscience.

[28]  C. A. Harsin Acoustics of perceptual centers in speech , 1993 .

[29]  Oded Ghitza,et al.  Linking Speech Perception and Neurophysiology: Speech Decoding Guided by Cascaded Oscillators Locked to the Input Rhythm , 2011, Front. Psychology.

[30]  Steven Greenberg,et al.  On the Possible Role of Brain Rhythms in Speech Perception: Intelligibility of Time-Compressed Speech with Periodic and Aperiodic Insertions of Silence , 2009, Phonetica.

[31]  David Poeppel,et al.  Cortical oscillations and speech processing: emerging computational principles and operations , 2012, Nature Neuroscience.

[32]  Kensuke Sekihara,et al.  Dual signal subspace projection (DSSP): a novel algorithm for removing large interference in biomagnetic measurements , 2016, Journal of neural engineering.

[33]  Luc H. Arnal,et al.  Proactive Sensing of Periodic and Aperiodic Auditory Patterns , 2018, Trends in Cognitive Sciences.

[34]  Eva Reinisch,et al.  Speaker-specific processing and local context information: The case of speaking rate , 2016 .

[35]  Richard B. Ivry,et al.  Double dissociation of single-interval and rhythmic temporal prediction in cerebellar degeneration and Parkinson’s disease , 2018, Proceedings of the National Academy of Sciences.

[36]  L. Elliot Hong,et al.  Rapid Transformation from Auditory to Linguistic Representations of Continuous Speech , 2018, Current Biology.

[37]  R. Plomp,et al.  Effect of reducing slow temporal modulations on speech reception. , 1994, The Journal of the Acoustical Society of America.

[38]  L. Nilsson,et al.  Ageing, cognition, and neuroscience , 2009 .

[39]  R. Knight,et al.  Neural entrainment and network resonance in support of top-down guided attention. , 2019, Current opinion in psychology.

[40]  Sylvain Baillet,et al.  Two Distinct Neural Timescales for Predictive Speech Processing , 2019, Neuron.

[41]  Lars Meyer,et al.  Synchronous, but not entrained: exogenous and endogenous cortical rhythms of speech and language processing , 2019, Language, Cognition and Neuroscience.

[42]  Benedikt Zoefel,et al.  The Involvement of Endogenous Neural Oscillations in the Processing of Rhythmic Input: More Than a Regular Repetition of Evoked Neural Responses , 2018, Front. Neurosci..

[43]  Matthew H. Davis,et al.  Perception of Rhythmic Speech Is Modulated by Focal Bilateral Transcranial Alternating Current Stimulation , 2020, Journal of Cognitive Neuroscience.

[44]  Erik Edwards,et al.  A Spatial Map of Onset and Sustained Responses to Speech in the Human Superior Temporal Gyrus , 2018, Current Biology.

[45]  Joachim Gross,et al.  Simple Acoustic Features Can Explain Phoneme-Based Predictions of Cortical Responses to Speech , 2019, Current Biology.

[46]  Alexander T Sack,et al.  Oscillatory phase shapes syllable perception , 2015, Proceedings of the National Academy of Sciences.

[47]  Eric Moulines,et al.  Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones , 1989, Speech Commun..

[48]  P. Mermelstein Automatic segmentation of speech into syllabic units. , 1975, The Journal of the Acoustical Society of America.

[49]  D. Poeppel,et al.  Cortical Tracking of Hierarchical Linguistic Structures in Connected Speech , 2015, Nature Neuroscience.

[50]  Laura Gwilliams,et al.  Hierarchical oscillators in speech comprehension: a commentary on Meyer, Sun, and Martin (2019) , 2020 .

[51]  E Ahissar,et al.  Speech comprehension is correlated with temporal response patterns recorded from auditory cortex , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[52]  David Poeppel,et al.  Auditory M50 and M100 responses to broadband noise: functional implications , 2004, Neuroreport.

[53]  R. Oostenveld,et al.  Nonparametric statistical testing of EEG- and MEG-data , 2007, Journal of Neuroscience Methods.

[54]  S. Haegens,et al.  Rhythmic facilitation of sensory processing: A critical review , 2017, Neuroscience & Biobehavioral Reviews.

[55]  Atsuko Takashima,et al.  Neural Entrainment Determines the Words We Hear , 2017, Current Biology.

[56]  Hermann Ackermann,et al.  Magnetic brain activity phase-locked to the envelope, the syllable onsets, and the fundamental frequency of a perceived speech signal. , 2012, Psychophysiology.

[57]  D. Abrams,et al.  Right-Hemisphere Auditory Cortex Is Dominant for Coding Syllable Patterns in Speech , 2008, The Journal of Neuroscience.

[58]  D. Poeppel,et al.  Phase Patterns of Neuronal Responses Reliably Discriminate Speech in Human Auditory Cortex , 2007, Neuron.

[59]  L. Holt,et al.  Perceptual effects of preceding nonspeech rate on temporal properties of speech categories , 2005, Perception & psychophysics.

[60]  R. Plomp,et al.  Effect of temporal envelope smearing on speech reception. , 1994, The Journal of the Acoustical Society of America.

[61]  Matthew H. Davis,et al.  Neural Oscillations Carry Speech Rhythm through to Comprehension , 2012, Front. Psychology.

[62]  Anne-Lise Giraud,et al.  Combining predictive coding and neural oscillations enables online syllable recognition in natural speech , 2020, Nature Communications.

[63]  J. Gross,et al.  Steady-State Visual Evoked Potentials Can Be Explained by Temporal Superposition of Transient Event-Related Responses , 2011, PloS one.

[64]  Edmund C. Lalor,et al.  Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing , 2015, Current Biology.