Neural Entrainment to Speech Modulates Speech Intelligibility

Speech is crucial for communication in everyday life. Speech-brain entrainment, the alignment of neural activity to the slow temporal fluctuations (envelope) of acoustic speech input, is a ubiquitous element of current theories of speech processing. Associations between speech-brain entrainment and acoustic speech signal, listening task, and speech intelligibility have been observed repeatedly. However, a methodological bottleneck has prevented so far clarifying whether speech-brain entrainment contributes functionally to (i.e., causes) speech intelligibility or is merely an epiphenomenon of it. To address this long-standing issue, we experimentally manipulated speech-brain entrainment without concomitant acoustic and task-related variations, using a brain stimulation approach that enables modulating listeners' neural activity with transcranial currents carrying speech-envelope information. Results from two experiments involving a cocktail-party-like scenario and a listening situation devoid of aural speech-amplitude envelope input reveal consistent effects on listeners' speech-recognition performance, demonstrating a causal role of speech-brain entrainment in speech intelligibility. Our findings imply that speech-brain entrainment is critical for auditory speech comprehension and suggest that transcranial stimulation with speech-envelope-shaped currents can be utilized to modulate speech comprehension in impaired listening conditions.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  David Poeppel,et al.  The effects of selective attention and speech acoustics on neural speech-tracking in a multi-talker scene , 2015, Cortex.

[3]  G. Stickney,et al.  On the dichotomy in auditory perception between temporal envelope and fine structure cues. , 2004, The Journal of the Acoustical Society of America.

[4]  L. V. Noorden Temporal coherence in the perception of tone sequences , 1975 .

[5]  Simon Hanslmayr,et al.  Probing the causal role of prestimulus interregional synchrony for perceptual integration via tACS , 2016, Scientific Reports.

[6]  Lars Riecke,et al.  4-Hz Transcranial Alternating Current Stimulation Phase Modulates Hearing , 2015, Brain Stimulation.

[7]  Garreth Prendergast,et al.  The Role of Phase-locking to the Temporal Envelope of Speech in Auditory Perception and Speech Intelligibility , 2015, Journal of Cognitive Neuroscience.

[8]  Roi Cohen Kadosh,et al.  Not all brains are created equal: the relevance of individual differences in responsiveness to transcranial electrical stimulation , 2014, Front. Syst. Neurosci..

[9]  J. Minckler,et al.  A note on the gross configurations of the human auditory cortex , 1976, Brain and Language.

[10]  Uta Noppeney,et al.  When sentences live up to your expectations , 2016, NeuroImage.

[11]  Brian C J Moore,et al.  Contribution of very low amplitude-modulation rates to intelligibility in a competing-speech task (L). , 2009, The Journal of the Acoustical Society of America.

[12]  Hartwig R. Siebner,et al.  Combining non-invasive transcranial brain stimulation with neuroimaging and electrophysiology: Current approaches and future perspectives , 2016, NeuroImage.

[13]  D. Poeppel,et al.  Cortical entrainment to music and its modulation by expertise , 2015, Proceedings of the National Academy of Sciences.

[14]  Eric Moulines,et al.  Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones , 1989, Speech Commun..

[15]  Alexandre Hyafil,et al.  Speech encoding by coupled cortical theta and gamma oscillations , 2015, eLife.

[16]  D. Poeppel,et al.  Temporal context in speech processing and attentional stream selection: A behavioral and neural perspective , 2012, Brain and Language.

[17]  Lars Riecke,et al.  Stimulus Presentation at Specific Neuronal Oscillatory Phases Experimentally Controlled with tACS: Implementation and Applications , 2016, Front. Cell. Neurosci..

[18]  J. Jefferys,et al.  Effects of uniform extracellular DC electric fields on excitability in rat hippocampal slices in vitro , 2004, The Journal of physiology.

[19]  Christian Lorenzi,et al.  The ability of listeners to use recovered envelope cues from speech fine structure. , 2006, The Journal of the Acoustical Society of America.

[20]  Edmund C. Lalor,et al.  Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing , 2015, Current Biology.

[21]  D. McCormick,et al.  Endogenous Electric Fields May Guide Neocortical Network Activity , 2010, Neuron.

[22]  Rufin VanRullen,et al.  The Role of High-Level Processes for Oscillatory Phase Entrainment to Speech Sound , 2015, Front. Hum. Neurosci..

[23]  B. Moore An Introduction to the Psychology of Hearing: Sixth Edition , 2012 .

[24]  R. Plomp,et al.  Effect of reducing slow temporal modulations on speech reception. , 1994, The Journal of the Acoustical Society of America.

[25]  P. Schyns,et al.  Speech Rhythms and Multiplexed Oscillatory Sensory Coding in the Human Brain , 2013, PLoS biology.

[26]  Matthew H. Davis,et al.  Neural Oscillations Carry Speech Rhythm through to Comprehension , 2012, Front. Psychology.

[27]  David Poeppel,et al.  Cortical oscillations and speech processing: emerging computational principles and operations , 2012, Nature Neuroscience.

[28]  Oded Ghitza,et al.  On the Role of Theta-Driven Syllabic Parsing in Decoding Speech: Intelligibility of Speech with a Manipulated Modulation Spectrum , 2012, Front. Psychology.

[29]  Jon Andoni Duñabeitia,et al.  Differential oscillatory encoding of foreign speech , 2015, Brain and Language.

[30]  Oded Ghitza,et al.  Linking Speech Perception and Neurophysiology: Speech Decoding Guided by Cascaded Oscillators Locked to the Input Rhythm , 2011, Front. Psychology.

[31]  Benedikt Zoefel,et al.  EEG oscillations entrain their phase to high-level features of speech sound , 2016, NeuroImage.

[32]  Gregor Thut,et al.  Lip movements entrain the observers’ low-frequency brain oscillations to facilitate speech intelligibility , 2016, eLife.

[33]  Ramesh Srinivasan,et al.  Suppression of competing speech through entrainment of cortical oscillations. , 2013, Journal of neurophysiology.

[34]  D. Poeppel,et al.  Temporal window of integration in auditory-visual speech perception , 2007, Neuropsychologia.

[35]  L. Parra,et al.  Low frequency transcranial electrical stimulation does not entrain sleep rhythms measured by human intracranial recordings , 2017, Nature Communications.

[36]  D. Poeppel,et al.  Auditory Cortex Tracks Both Auditory and Visual Stimulus Dynamics Using Low-Frequency Neuronal Phase Modulation , 2010, PLoS biology.

[37]  J. Simon,et al.  Cortical entrainment to continuous speech: functional roles and interpretations , 2014, Front. Hum. Neurosci..

[38]  Á. Pascual-Leone,et al.  Contribution of axonal orientation to pathway-dependent modulation of excitatory transmission by direct current stimulation in isolated rat hippocampus. , 2012, Journal of neurophysiology.

[39]  D. T. Ives,et al.  Optimal Combination of Neural Temporal Envelope and Fine Structure Cues to Explain Speech Identification in Background Noise , 2014, The Journal of Neuroscience.

[40]  B. Grothe,et al.  Modulation of auditory percepts by transcutaneous electrical stimulation , 2017, Hearing Research.

[41]  Anahita Basirat,et al.  High-frequency neural activity predicts word parsing in ambiguous speech streams. , 2016, Journal of neurophysiology.

[42]  David Poeppel,et al.  Detection of auditory (cross-spectral) and auditory-visual (cross-modal) synchrony , 2004, Speech Commun..

[43]  Marco Buiatti,et al.  Investigating the neural correlates of continuous speech computation with frequency-tagged neuroelectric responses , 2009, NeuroImage.

[44]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[45]  Christopher K. Kovach,et al.  Temporal Envelope of Time-Compressed Speech Represented in the Human Auditory Cortex , 2009, The Journal of Neuroscience.

[46]  Robin A A Ince,et al.  Irregular Speech Rate Dissociates Auditory Cortical Entrainment, Evoked Responses, and Frontal Alpha , 2015, The Journal of Neuroscience.

[47]  E Ahissar,et al.  Speech comprehension is correlated with temporal response patterns recorded from auditory cortex , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[48]  Katharina S. Rufener,et al.  Modulating Human Auditory Processing by Transcranial Electrical Stimulation , 2016, Front. Cell. Neurosci..

[49]  Lars Meyer,et al.  Linguistic Bias Modulates Interpretation of Speech via Neural Delta-Band Oscillations , 2016, Cerebral cortex.

[50]  B. Moore An Introduction to the Psychology of Hearing , 1977 .

[51]  W. Paulus Transcranial electrical stimulation (tES – tDCS; tRNS, tACS) methods , 2011, Neuropsychological rehabilitation.

[52]  J. Peelle,et al.  Prediction and constraint in audiovisual speech perception , 2015, Cortex.

[53]  Joachim Gross,et al.  Phase-Locked Responses to Speech in Human Auditory Cortex are Enhanced During Comprehension , 2012, Cerebral cortex.

[54]  C. Schroeder,et al.  Low-frequency neuronal oscillations as instruments of sensory selection , 2009, Trends in Neurosciences.

[55]  Rufin VanRullen,et al.  Selective Perceptual Phase Entrainment to Speech Rhythm in the Absence of Spectral Energy Fluctuations , 2015, The Journal of Neuroscience.

[56]  Alexander Opitz,et al.  Spatiotemporal structure of intracranial electric fields induced by transcranial electric stimulation in humans and nonhuman primates , 2016, Scientific Reports.

[57]  Matthew H. Davis,et al.  Transcranial electric stimulation for the investigation of speech perception and comprehension , 2016, Language, cognition and neuroscience.

[58]  Lars Riecke,et al.  Endogenous Delta/Theta Sound-Brain Phase Entrainment Accelerates the Buildup of Auditory Streaming , 2015, Current Biology.

[59]  F. Fröhlich,et al.  Transcranial Alternating Current Stimulation Modulates Large-Scale Cortical Network Activity by Network Resonance , 2013, The Journal of Neuroscience.

[60]  Steven Greenberg,et al.  Multi-time resolution analysis of speech: evidence from psychophysics , 2015, Front. Neurosci..

[61]  Usha Goswami,et al.  A Rhythmic Musical Intervention for Poor Readers: A Comparison of Efficacy With a Letter‐Based Intervention , 2013 .

[62]  D. Poeppel,et al.  Cortical Tracking of Hierarchical Linguistic Structures in Connected Speech , 2015, Nature Neuroscience.

[63]  Michael J. Crosse,et al.  Congruent Visual Speech Enhances Cortical Entrainment to Continuous Auditory Speech in Noise-Free Conditions , 2015, The Journal of Neuroscience.

[64]  John J. Foxe,et al.  The timing and laminar profile of converging inputs to multisensory areas of the macaque neocortex. , 2002, Brain research. Cognitive brain research.

[65]  Aniruddh D. Patel,et al.  Temporal modulations in speech and music , 2017, Neuroscience & Biobehavioral Reviews.

[66]  Uta Noppeney,et al.  Audiovisual asynchrony detection in human speech. , 2011, Journal of experimental psychology. Human perception and performance.

[67]  Gregory B. Cogan,et al.  Visual Input Enhances Selective Speech Envelope Tracking in Auditory Cortex at a “Cocktail Party” , 2013, The Journal of Neuroscience.

[68]  C. Kayser,et al.  Rhythmic Auditory Cortex Activity at Multiple Timescales Shapes Stimulus–Response Gain and Background Firing , 2015, The Journal of Neuroscience.

[69]  J. Verhoeven,et al.  Speech Rate in a Pluricentric Language: A Comparison Between Dutch in Belgium and the Netherlands , 2004, Language and speech.

[70]  B. Krekelberg,et al.  Transcranial Alternating Current Stimulation Attenuates Visual Motion Adaptation , 2014, The Journal of Neuroscience.

[71]  A. Puce,et al.  Neuronal oscillations and visual amplification of speech , 2008, Trends in Cognitive Sciences.

[72]  N. Mesgarani,et al.  Selective cortical representation of attended speaker in multi-talker speech perception , 2012, Nature.

[73]  R. Plomp,et al.  Effect of temporal envelope smearing on speech reception. , 1994, The Journal of the Acoustical Society of America.

[74]  David Poeppel,et al.  The Tracking of Speech Envelope in the Human Cortex , 2013, PloS one.

[75]  Jean-Luc Schwartz,et al.  No, There Is No 150 ms Lead of Visual Speech on Auditory Speech, but a Range of Audiovisual Asynchronies Varying from Small Audio Lead to Large Audio Lag , 2014, PLoS Comput. Biol..

[76]  B. Hangya,et al.  Phase Entrainment of Human Delta Oscillations Can Mediate the Effects of Expectation on Reaction Speed , 2010, The Journal of Neuroscience.

[77]  Nicolas Grimault,et al.  Streaming of vowel sequences based on fundamental frequency in a cochlear-implant simulation. , 2008, The Journal of the Acoustical Society of America.

[78]  T Houtgast,et al.  Method for the selection of sentence materials for efficient measurement of the speech reception threshold. , 1999, The Journal of the Acoustical Society of America.

[79]  Erik Edwards,et al.  Syllabic (∼2–5 Hz) and fluctuation (∼1–10 Hz) ranges in speech and auditory processing , 2013, Hearing Research.

[80]  J. Simon,et al.  Emergence of neural encoding of auditory objects while listening to competing speakers , 2012, Proceedings of the National Academy of Sciences.

[81]  O Ghitza,et al.  On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception. , 2001, The Journal of the Acoustical Society of America.

[82]  C. Schroeder,et al.  The Leading Sense: Supramodal Control of Neurophysiological Context by Attention , 2009, Neuron.

[83]  B. Nolan Boosting slow oscillations during sleep potentiates memory , 2008 .

[84]  Martin Meyer,et al.  40Hz-Transcranial alternating current stimulation (tACS) selectively modulates speech perception. , 2016, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[85]  S. Debener,et al.  Look now and hear what's coming: On the functional role of cross-modal phase reset , 2014, Hearing Research.

[86]  Lucia Melloni,et al.  Brain Oscillations during Spoken Sentence Processing , 2012, Journal of Cognitive Neuroscience.

[87]  Virginie van Wassenhove,et al.  Distinct contributions of low- and high-frequency neural oscillations to speech comprehension , 2017 .

[88]  D. Poeppel,et al.  Mechanisms Underlying Selective Neuronal Tracking of Attended Speech at a “Cocktail Party” , 2013, Neuron.

[89]  Carsten H. Wolters,et al.  Good vibrations: Oscillatory phase shapes perception , 2012, NeuroImage.

[90]  Matthew H. Davis,et al.  Predictive Top-Down Integration of Prior Knowledge during Speech Perception , 2012, The Journal of Neuroscience.