The Role of Phase-locking to the Temporal Envelope of Speech in Auditory Perception and Speech Intelligibility

The temporal envelope of speech is important for speech intelligibility. Entrainment of cortical oscillations to the speech temporal envelope is a putative mechanism underlying speech intelligibility. Here we used magnetoencephalography (MEG) to test the hypothesis that phase-locking to the speech temporal envelope is enhanced for intelligible compared with unintelligible speech sentences. Perceptual “pop-out” was used to change the percept of physically identical tone-vocoded speech sentences from unintelligible to intelligible. The use of pop-out dissociates changes in phase-locking to the speech temporal envelope arising from acoustical differences between un/intelligible speech from changes in speech intelligibility itself. Novel and bespoke whole-head beamforming analyses, based on significant cross-correlation between the temporal envelopes of the speech stimuli and phase-locked neural activity, were used to localize neural sources that track the speech temporal envelope of both intelligible and unintelligible speech. Location-of-interest analyses were carried out in a priori defined locations to measure the representation of the speech temporal envelope for both un/intelligible speech in both the time domain (cross-correlation) and frequency domain (coherence). Whole-brain beamforming analyses identified neural sources phase-locked to the temporal envelopes of both unintelligible and intelligible speech sentences. Crucially there was no difference in phase-locking to the temporal envelope of speech in the pop-out condition in either the whole-brain or location-of-interest analyses, demonstrating that phase-locking to the speech temporal envelope is not enhanced by linguistic information.

[1]  David Poeppel,et al.  Cortical oscillations and speech processing: emerging computational principles and operations , 2012, Nature Neuroscience.

[2]  Sophie K Scott,et al.  The neurobiology of speech perception and production--can functional imaging tell us anything we did not already know? , 2012, Journal of communication disorders.

[3]  David Poeppel,et al.  Acoustic landmarks drive delta–theta oscillations to enable speech comprehension by facilitating perceptual parsing , 2014, NeuroImage.

[4]  Gary G. R. Green,et al.  Non-parametric statistical thresholding of baseline free MEG beamformer images , 2011, NeuroImage.

[5]  D. Abrams,et al.  Right-Hemisphere Auditory Cortex Is Dominant for Coding Syllable Patterns in Speech , 2008, The Journal of Neuroscience.

[6]  A Kohlrausch,et al.  Intrinsic envelope fluctuations and modulation-detection thresholds for narrow-band noise carriers. , 1999, The Journal of the Acoustical Society of America.

[7]  Oded Ghitza,et al.  On the Role of Theta-Driven Syllabic Parsing in Decoding Speech: Intelligibility of Speech with a Manipulated Modulation Spectrum , 2012, Front. Psychology.

[8]  W. Drongelen,et al.  Localization of brain electrical activity via linearly constrained minimum variance spatial filtering , 1997, IEEE Transactions on Biomedical Engineering.

[9]  J. Hegdé,et al.  A Link between Visual Disambiguation and Visual Memory , 2010, The Journal of Neuroscience.

[10]  Matthew H. Davis,et al.  Lexical information drives perceptual learning of distorted speech: evidence from the comprehension of noise-vocoded sentences. , 2005, Journal of experimental psychology. General.

[11]  A. Macleod,et al.  Quantifying the contribution of vision to speech perception in noise. , 1987, British journal of audiology.

[12]  E. Liebenthal,et al.  Short-Term Reorganization of Auditory Analysis Induced by Phonetic Experience , 2003, Journal of Cognitive Neuroscience.

[13]  Willy Serniclaes,et al.  Neural correlates of switching from auditory to speech perception , 2005, NeuroImage.

[14]  Matthew H. Davis,et al.  Predictive Top-Down Integration of Prior Knowledge during Speech Perception , 2012, The Journal of Neuroscience.

[15]  David Poeppel,et al.  The analysis of speech in different temporal integration windows: cerebral lateralization as 'asymmetric sampling in time' , 2003, Speech Commun..

[16]  Joachim Gross,et al.  Phase-Locked Responses to Speech in Human Auditory Cortex are Enhanced During Comprehension , 2012, Cerebral cortex.

[17]  D. Poeppel,et al.  Phase Patterns of Neuronal Responses Reliably Discriminate Speech in Human Auditory Cortex , 2007, Neuron.

[18]  Paul J. Abbas,et al.  Direct Recordings from the Auditory Cortex in a Cochlear Implant User , 2013, Journal of the Association for Research in Otolaryngology.

[19]  Christopher K. Kovach,et al.  Temporal Envelope of Time-Compressed Speech Represented in the Human Auditory Cortex , 2009, The Journal of Neuroscience.

[20]  Rufin VanRullen,et al.  The Role of High-Level Processes for Oscillatory Phase Entrainment to Speech Sound , 2015, Front. Hum. Neurosci..

[21]  Luc H. Arnal,et al.  Asymmetric Function of Theta and Gamma Activity in Syllable Processing: An Intra-Cortical Study , 2012, Front. Psychology.

[22]  V Ball,et al.  Lip-reading the BKB sentence lists: corrections for list and practice effects. , 1993, British journal of audiology.

[23]  Terry M. Peters,et al.  3D statistical neuroanatomical models from 305 MRI volumes , 1993, 1993 IEEE Conference Record Nuclear Science Symposium and Medical Imaging Conference.

[24]  Gary G. R. Green,et al.  Representations of the temporal envelope of sounds in human auditory cortex: Can the results from invasive intracortical “depth” electrode recordings be replicated using non-invasive MEG “virtual electrodes”? , 2013, NeuroImage.

[25]  F. Carducci,et al.  Automatic alignment of EEG/MEG and MRI data sets , 2001, Clinical Neurophysiology.

[26]  Oded Ghitza,et al.  Linking Speech Perception and Neurophysiology: Speech Decoding Guided by Cascaded Oscillators Locked to the Input Rhythm , 2011, Front. Psychology.

[27]  B. Moore,et al.  Benefit of high-rate envelope cues in vocoder processing: effect of number of channels and spectral region. , 2008, The Journal of the Acoustical Society of America.

[28]  Thomas E. Nichols,et al.  Nonparametric permutation tests for functional neuroimaging: A primer with examples , 2002, Human brain mapping.

[29]  Anders M. Dale,et al.  Multimodal imaging of repetition priming: Using fMRI, MEG, and intracranial EEG to reveal spatiotemporal profiles of word processing , 2010, NeuroImage.

[30]  R L Buckner,et al.  Spatiotemporal Maps of Brain Activity Underlying Word Generation and Their Modification during Repetition Priming , 2001, The Journal of Neuroscience.

[31]  Gary G. R. Green,et al.  Spatiotemporal reconstruction of the auditory steady-state response to frequency modulation using magnetoencephalography , 2010, NeuroImage.

[32]  David Poeppel,et al.  Discrimination of speech stimuli based on neuronal response phase patterns depends on acoustics but not comprehension. , 2010, Journal of neurophysiology.

[33]  Mitchell Steinschneider,et al.  Coding of repetitive transients by auditory cortex on Heschl's gyrus. , 2009, Journal of neurophysiology.

[34]  Richard L Freyman,et al.  Speech intelligibility in cochlear implant simulations: Effects of carrier type, interfering noise, and subject experience. , 2007, The Journal of the Acoustical Society of America.

[35]  Mark Hymers,et al.  Examining the Effects of One- and Three-Dimensional Spatial Filtering Analyses in Magnetoencephalography , 2011, PloS one.

[36]  B. Moore,et al.  Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. , 1983, The Journal of the Acoustical Society of America.

[37]  Matthew H. Davis,et al.  Neural Oscillations Carry Speech Rhythm through to Comprehension , 2012, Front. Psychology.

[38]  R. Plomp,et al.  Effect of reducing slow temporal modulations on speech reception. , 1994, The Journal of the Acoustical Society of America.

[39]  R M Leahy,et al.  A sensor-weighted overlapping-sphere head model and exhaustive head model comparison for MEG. , 1999, Physics in medicine and biology.

[40]  M. Berger,et al.  High Gamma Power Is Phase-Locked to Theta Oscillations in Human Neocortex , 2006, Science.

[41]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.

[42]  P. Morosan,et al.  Probabilistic Mapping and Volume Measurement of Human Primary Auditory Cortex , 2001, NeuroImage.

[43]  P Sterzer,et al.  Contributions of sensory input, auditory search and verbal comprehension to cortical activity during speech processing. , 2004, Cerebral cortex.

[44]  S. Rosen Temporal information in speech: acoustic, auditory and linguistic aspects. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[45]  M-X Huang,et al.  Commonalities and Differences Among Vectorized Beamformers in Electromagnetic Source Imaging , 2003, Brain Topography.

[46]  E Ahissar,et al.  Speech comprehension is correlated with temporal response patterns recorded from auditory cortex , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[47]  Ankoor S. Shah,et al.  An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex. , 2005, Journal of neurophysiology.

[48]  D. Poeppel,et al.  Mechanisms Underlying Selective Neuronal Tracking of Attended Speech at a “Cocktail Party” , 2013, Neuron.

[49]  D. Poeppel,et al.  The cortical organization of speech processing , 2007, Nature Reviews Neuroscience.

[50]  Brian N. Pasley,et al.  Reconstructing Speech from Human Auditory Cortex , 2012, PLoS biology.