Neural Entrainment to Rhythmically Presented Auditory, Visual, and Audio-Visual Speech in Children

Auditory cortical oscillations have been proposed to play an important role in speech perception. It is suggested that the brain may take temporal “samples” of information from the speech stream at different rates, phase resetting ongoing oscillations so that they are aligned with similar frequency bands in the input (“phase locking”). Information from these frequency bands is then bound together for speech perception. To date, there are no explorations of neural phase locking and entrainment to speech input in children. However, it is clear from studies of language acquisition that infants use both visual speech information and auditory speech information in learning. In order to study neural entrainment to speech in typically developing children, we use a rhythmic entrainment paradigm (underlying 2 Hz or delta rate) based on repetition of the syllable “ba,” presented in either the auditory modality alone, the visual modality alone, or as auditory-visual speech (via a “talking head”). To ensure attention to the task, children aged 13 years were asked to press a button as fast as possible when the “ba” stimulus violated the rhythm for each stream type. Rhythmic violation depended on delaying the occurrence of a “ba” in the isochronous stream. Neural entrainment was demonstrated for all stream types, and individual differences in standardized measures of language processing were related to auditory entrainment at the theta rate. Further, there was significant modulation of the preferred phase of auditory entrainment in the theta band when visual speech cues were present, indicating cross-modal phase resetting. The rhythmic entrainment paradigm developed here offers a method for exploring individual differences in oscillatory phase locking during development. In particular, a method for assessing neural entrainment and cross-modal phase resetting would be useful for exploring developmental learning difficulties thought to involve temporal sampling, such as dyslexia.

[1]  B. Dodd,et al.  Language and Cognition: Evidence from Disordered Language , 2007 .

[2]  D. Poeppel,et al.  Speech Perception from a Neurophysiological Perspective , 2012 .

[3]  D. Poeppel,et al.  Temporal context in speech processing and attentional stream selection: A behavioral and neural perspective , 2012, Brain and Language.

[4]  Steven Greenberg,et al.  On the Possible Role of Brain Rhythms in Speech Perception: Intelligibility of Time-Compressed Speech with Periodic and Aperiodic Insertions of Silence , 2009, Phonetica.

[5]  J. McClellan,et al.  Chebyshev Approximation for Nonrecursive Digital Filters with Linear Phase , 1972 .

[6]  D. Lewkowicz,et al.  Narrowing of intersensory speech perception in infancy , 2009, Proceedings of the National Academy of Sciences.

[7]  Steven Greenberg,et al.  Temporal properties of spontaneous speech - a syllable-centric perspective , 2003, J. Phonetics.

[8]  Jarmo A. Hämäläinen,et al.  Reduced phase locking to slow amplitude modulation in adults with dyslexia: An MEG study , 2012, NeuroImage.

[9]  D. Poeppel,et al.  Phase Patterns of Neuronal Responses Reliably Discriminate Speech in Human Auditory Cortex , 2007, Neuron.

[10]  David Poeppel,et al.  The neuromagnetic response to spoken sentences: Co-modulation of theta band amplitude and phase , 2012, NeuroImage.

[11]  U. Goswami,et al.  Rise time and formant transition duration in the discrimination of speech sounds: the Ba-Wa distinction in developmental dyslexia. , 2011, Developmental science.

[12]  U. Goswami A temporal sampling framework for developmental dyslexia , 2011, Trends in Cognitive Sciences.

[13]  G. Buzsáki,et al.  Temporal structure in spatially organized neuronal ensembles: a role for interneuronal networks , 1995, Current Opinion in Neurobiology.

[14]  Jeffery A. Jones,et al.  Visual Prosody and Speech Intelligibility , 2004, Psychological science.

[15]  G. Buzsáki,et al.  Neuronal Oscillations in Cortical Networks , 2004, Science.

[16]  C. Schroeder,et al.  Tuning of the Human Neocortex to the Temporal Dynamics of Attended Events , 2011, The Journal of Neuroscience.

[17]  Usha Goswami,et al.  Auditory Processing of Amplitude Envelope Rise Time in Adults Diagnosed With Developmental Dyslexia , 2007 .

[18]  D. Poeppel,et al.  Speech perception at the interface of neurobiology and linguistics , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[19]  H. Levitt Transformed up-down methods in psychoacoustics. , 1971, The Journal of the Acoustical Society of America.

[20]  Marcelo A. Montemurro,et al.  Spike-Phase Coding Boosts and Stabilizes Information Carried by Spatial and Temporal Spike Patterns , 2009, Neuron.

[21]  C. Schroeder,et al.  Low-frequency neuronal oscillations as instruments of sensory selection , 2009, Trends in Neurosciences.

[22]  John J. Foxe,et al.  Oscillatory Sensory Selection Mechanisms during Intersensory Attention to Rhythmic Auditory and Visual Inputs: A Human Electrocorticographic Investigation , 2011, The Journal of Neuroscience.

[23]  P. Kuhl Early language acquisition: cracking the speech code , 2004, Nature Reviews Neuroscience.

[24]  Denis Burnham,et al.  Auditory-visual speech integration by prelinguistic infants: perception of an emergent consonant in the McGurk effect. , 2004, Developmental psychobiology.

[25]  D. Abrams,et al.  Abnormal Cortical Processing of the Syllable Rate of Speech in Poor Readers , 2009, The Journal of Neuroscience.

[26]  Oded Ghitza,et al.  Linking Speech Perception and Neurophysiology: Speech Decoding Guided by Cascaded Oscillators Locked to the Input Rhythm , 2011, Front. Psychology.

[27]  Franck Ramus,et al.  Altered Low-Gamma Sampling in Auditory Cortex Accounts for the Three Main Facets of Dyslexia , 2011, Neuron.

[28]  G. Karmos,et al.  Entrainment of Neuronal Oscillations as a Mechanism of Attentional Selection , 2008, Science.

[29]  B. Hangya,et al.  Phase Entrainment of Human Delta Oscillations Can Mediate the Effects of Expectation on Reaction Speed , 2010, The Journal of Neuroscience.

[30]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.

[31]  R. B. Reilly,et al.  FASTER: Fully Automated Statistical Thresholding for EEG artifact Rejection , 2010, Journal of Neuroscience Methods.

[32]  R. Plomp,et al.  Effect of temporal envelope smearing on speech reception. , 1994, The Journal of the Acoustical Society of America.

[33]  T W Picton,et al.  Potentials evoked by the sinusoidal modulation of the amplitude or frequency of a tone. , 1987, The Journal of the Acoustical Society of America.

[34]  Ankoor S. Shah,et al.  An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex. , 2005, Journal of neurophysiology.

[35]  Charles H. Brown,et al.  The Influence of Natural Scene Dynamics on Auditory Cortical Activity , 2010, The Journal of Neuroscience.

[36]  R. Plomp,et al.  Effect of reducing slow temporal modulations on speech reception. , 1994, The Journal of the Acoustical Society of America.

[37]  Asif A. Ghazanfar,et al.  The Natural Statistics of Audiovisual Speech , 2009, PLoS Comput. Biol..

[38]  M. Tomasello,et al.  Variability in early communicative development. , 1994, Monographs of the Society for Research in Child Development.

[39]  D. Wechsler Wechsler Intelligence Scale for Children , 2020, Definitions.

[40]  R. Drullman The significance of temporal modulation frequencies for speech intelligibility. Part I: Perspective , 2005 .

[41]  D. Poeppel,et al.  The cortical organization of speech processing , 2007, Nature Reviews Neuroscience.

[42]  S. Debener,et al.  Cross-Modal Phase Reset Predicts Auditory Task Performance in Humans , 2011, The Journal of Neuroscience.

[43]  A. Puce,et al.  Neuronal oscillations and visual amplification of speech , 2008, Trends in Cognitive Sciences.

[44]  David Poeppel,et al.  The analysis of speech in different temporal integration windows: cerebral lateralization as 'asymmetric sampling in time' , 2003, Speech Commun..

[45]  J. Werker,et al.  Cross-language speech perception: Evidence for perceptual reorganization during the first year of life , 1984 .

[46]  Patrick Chauvel,et al.  Temporal envelope processing in the human left and right auditory cortices. , 2004, Cerebral cortex.

[47]  D. Poeppel,et al.  Auditory Cortex Tracks Both Auditory and Visual Stimulus Dynamics Using Low-Frequency Neuronal Phase Modulation , 2010, PLoS biology.

[48]  D. Lewkowicz,et al.  Intersensory Perception at Birth: Newborns Match Nonhuman Primate Faces and Voices. , 2010, Infancy : the official journal of the International Society on Infant Studies.

[49]  D. Abrams,et al.  Right-Hemisphere Auditory Cortex Is Dominant for Coding Syllable Patterns in Speech , 2008, The Journal of Neuroscience.

[50]  Ulla Richardson,et al.  Auditory processing skills and phonological representation in dyslexic children. , 2004, Dyslexia.