Neural Correlates of Interindividual Differences in Children's Audiovisual Speech Perception

Children use information from both the auditory and visual modalities to aid in understanding speech. A dramatic illustration of this multisensory integration is the McGurk effect, an illusion in which an auditory syllable is perceived differently when it is paired with an incongruent mouth movement. However, there are significant interindividual differences in McGurk perception: some children never perceive the illusion, while others always do. Because converging evidence suggests that the posterior superior temporal sulcus (STS) is a critical site for multisensory integration, we hypothesized that activity within the STS would predict susceptibility to the McGurk effect. To test this idea, we used BOLD fMRI in 17 children aged 6–12 years to measure brain responses to the following three audiovisual stimulus categories: McGurk incongruent, non-McGurk incongruent, and congruent syllables. Two separate analysis approaches, one using independent functional localizers and another using whole-brain voxel-based regression, showed differences in the left STS between perceivers and nonperceivers. The STS of McGurk perceivers responded significantly more than that of nonperceivers to McGurk syllables, but not to other stimuli, and perceivers' hemodynamic responses in the STS were significantly prolonged. In addition to the STS, weaker differences between perceivers and nonperceivers were observed in the fusiform face area and extrastriate visual cortex. These results suggest that the STS is an important source of interindividual variability in children's audiovisual speech perception.

[1]  Michael Wilson,et al.  MRC psycholinguistic database: Machine-usable dictionary, version 2.00 , 1988 .

[2]  Thomas E. Nichols,et al.  Thresholding of Statistical Maps in Functional Neuroimaging Using the False Discovery Rate , 2002, NeuroImage.

[3]  Bertrand Thirion,et al.  Early Maturation of the Linguistic Dorsal Pathway in Human Infants , 2011, The Journal of Neuroscience.

[4]  L A Thompson,et al.  Developmental changes in visual and auditory contributions to speech perception. , 1986, Journal of experimental child psychology.

[5]  Linda Polka,et al.  A developmental study of audiovisual speech perception using the McGurk paradigm , 1994 .

[6]  W. K. Simmons,et al.  Circular analysis in systems neuroscience: the dangers of double dipping , 2009, Nature Neuroscience.

[7]  W. Grodd,et al.  Language comprehension vs. language production: Age effects on fMRI activation , 2011, Brain and Language.

[8]  Alan C. Evans,et al.  A new anatomical landmark for reliable identification of human area V5/MT: a quantitative analysis of sulcal patterning. , 2000, Cerebral cortex.

[9]  Paul M Matthews,et al.  The Role of the Posterior Fusiform Gyrus in Reading , 2006, Journal of Cognitive Neuroscience.

[10]  F. Lin,et al.  Primary and multisensory cortical activity is correlated with audiovisual percepts , 2009, Human brain mapping.

[11]  Luigi Cattaneo,et al.  Automatic audiovisual integration in speech perception , 2005, Experimental Brain Research.

[12]  Gregory McCarthy,et al.  Polysensory interactions along lateral temporal regions evoked by audiovisual speech. , 2003, Cerebral cortex.

[13]  Qingyang Li,et al.  Emotional perception: Meta-analyses of face and natural scene processing , 2011, NeuroImage.

[14]  Kristina M. Visscher,et al.  Functional Neuroanatomical Differences Between Adults and School-Age Children in the Processing of Single Words , 2002, Science.

[15]  Margaret Ross,et al.  Comparison of Speechreading Training Regimens , 2007, Perceptual and motor skills.

[16]  D. V. Cramon,et al.  Investigating the post-stimulus undershoot of the BOLD signal—a simultaneous fMRI and fNIRS study , 2006, NeuroImage.

[17]  A. Braun,et al.  Auditory lexical decision, categorical perception, and FM direction discrimination differentially engage left and right auditory cortex , 2004, Neuropsychologia.

[18]  Lee M. Miller,et al.  Behavioral/systems/cognitive Perceptual Fusion and Stimulus Coincidence in the Cross- Modal Integration of Speech , 2022 .

[19]  Conny F. Schmidt,et al.  Face perception is mediated by a distributed cortical network , 2005, Brain Research Bulletin.

[20]  Barry Horwitz,et al.  The elusive concept of brain connectivity , 2003, NeuroImage.

[21]  A. Meltzoff,et al.  The bimodal perception of speech in infancy. , 1982, Science.

[22]  M A Schmuckler,et al.  The McGurk effect in infants. , 1995, Perception & psychophysics.

[23]  Jeffery A. Jones,et al.  Multisensory Integration Sites Identified by Perception of Spatial Wavelet Filtered Visual Speech Gesture Information , 2004, Journal of Cognitive Neuroscience.

[24]  R. Campbell,et al.  Evidence from functional magnetic resonance imaging of crossmodal binding in the human heteromodal cortex , 2000, Current Biology.

[25]  H. Pashler,et al.  Puzzlingly High Correlations in fMRI Studies of Emotion, Personality, and Social Cognition 1 , 2009, Perspectives on psychological science : a journal of the Association for Psychological Science.

[26]  Michael S. Beauchamp,et al.  Statistical criteria in fMRI studies of multisensory integration , 2005, Neuroinformatics.

[27]  F. Lepore,et al.  Speech and Non-Speech Audio-Visual Illusions: A Developmental Study , 2007, PloS one.

[28]  J. Gagné,et al.  An evaluation of CAST: a Computer-Aided Speechreading Training program. , 1991, Journal of speech and hearing research.

[29]  Efstathios D. Gennatas,et al.  Network-level structural covariance in the developing brain , 2010, Proceedings of the National Academy of Sciences.

[30]  Roy D Patterson,et al.  Functional imaging of the auditory processing applied to speech sounds , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[31]  P. Skudlarski,et al.  Disruption of posterior brain systems for reading in children with developmental dyslexia , 2002, Biological Psychiatry.

[32]  Whitney M. Weikum,et al.  Visual Language Discrimination in Infancy , 2007, Science.

[33]  A. Dale,et al.  Cortical Surface-Based Analysis II: Inflation, Flattening, and a Surface-Based Coordinate System , 1999, NeuroImage.

[34]  J. Werker,et al.  Matching phonetic information in lips and voice is robust in 4.5-month-old infants , 1999 .

[35]  P. Jusczyk,et al.  A precursor of language acquisition in young infants , 1988, Cognition.

[36]  R. Zatorre,et al.  Human temporal-lobe response to vocal sounds. , 2002, Brain research. Cognitive brain research.

[37]  R. Campbell,et al.  Reading Speech from Still and Moving Faces: The Neural Substrates of Visible Speech , 2003, Journal of Cognitive Neuroscience.

[38]  Audrey R. Nath,et al.  Dynamic Changes in Superior Temporal Sulcus Connectivity during Perception of Noisy Audiovisual Speech , 2011, The Journal of Neuroscience.

[39]  Y. Sugita,et al.  Auditory-visual speech perception examined by fMRI and PET , 2003, Neuroscience Research.

[40]  Michael S. Beauchamp,et al.  A new method for improving functional-to-structural MRI alignment using local Pearson correlation , 2009, NeuroImage.

[41]  D. Louis Collins,et al.  Unbiased average age-appropriate atlases for pediatric studies , 2011, NeuroImage.

[42]  Jacques Mehler,et al.  Infant Recognition of Mother's Voice , 1978, Perception.

[43]  D. Lewkowicz,et al.  The development of intersensory temporal perception: an epigenetic systems/limitations view. , 2000, Psychological bulletin.

[44]  J. Vroomen,et al.  Audiovisual speech recalibration in children* , 2008, Journal of Child Language.

[45]  Steven L. Small,et al.  Neural development of networks for audiovisual speech comprehension , 2010, Brain and Language.

[46]  J. Schwartz A reanalysis of McGurk data suggests that audiovisual fusion in speech perception is subject-dependent. , 2010, The Journal of the Acoustical Society of America.

[47]  P. McGuire,et al.  Neural systems underlying British Sign Language and audio-visual English processing in native users. , 2002, Brain : a journal of neurology.

[48]  F. Ramus,et al.  Language discrimination by human newborns and by cotton-top tamarin monkeys. , 2000, Science.

[49]  Bruce D. McCandliss,et al.  Neural systems predicting long-term outcome in dyslexia , 2010, Proceedings of the National Academy of Sciences.

[50]  James R. Booth,et al.  Development of brain networks involved in spoken word processing of Mandarin Chinese , 2011, NeuroImage.

[51]  Audrey R. Nath,et al.  fMRI-Guided Transcranial Magnetic Stimulation Reveals That the Superior Temporal Sulcus Is a Cortical Locus of the McGurk Effect , 2010, The Journal of Neuroscience.

[52]  Thomas A Zeffiro,et al.  Development of neural mechanisms for reading , 2003, Nature Neuroscience.

[53]  Karl J. Friston,et al.  A critique of functional localisers , 2006, NeuroImage.

[54]  P. Rochat Early Social Cognition : Understanding Others in the First Months of Life , 1999 .

[55]  Mark T. Wallace,et al.  The Development of Cortical Multisensory Integration , 2006, The Journal of Neuroscience.

[56]  L. Bahrick,et al.  The development of infant learning about specific face-voice relations. , 2005, Developmental psychology.

[57]  J. Gore,et al.  A comparison of bound and unbound audio-visual information processing in the human cerebral cortex. , 2002, Brain research. Cognitive brain research.

[58]  P. van Dijk,et al.  Neural responses to silent lipreading in normal hearing male and female subjects , 2006, The European journal of neuroscience.

[59]  E. Gibson,et al.  An Ecological Approach to Perceptual Learning and Development , 2000 .

[60]  D. Attwell,et al.  The physiology of developmental changes in BOLD functional imaging signals , 2011, Developmental Cognitive Neuroscience.

[61]  F. Gonzalez-Lima,et al.  Structural equation modeling and its application to network analysis in functional brain imaging , 1994 .

[62]  Denis Burnham,et al.  Auditory-visual speech integration by prelinguistic infants: perception of an emergent consonant in the McGurk effect. , 2004, Developmental psychobiology.

[63]  Tali Bitan,et al.  Developmental changes in brain regions involved in phonological and orthographic processing during spoken language processing , 2008, NeuroImage.

[64]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[65]  J MacDonald,et al.  Hearing by Eye: How Much Spatial Degradation can Be Tolerated? , 2000, Perception.

[66]  J. Mehler,et al.  Language discrimination by newborns: toward an understanding of the role of rhythm. , 1998, Journal of experimental psychology. Human perception and performance.

[67]  William P. Fifer,et al.  Two-day-olds prefer their native language , 1993 .

[68]  Nancy Kanwisher,et al.  Divide and conquer: A defense of functional localizers , 2006, NeuroImage.

[69]  et al.,et al.  The Effect of Template Choice on Morphometric Analysis of Pediatric Brain Data ☆ , 2022 .

[70]  Barbara Dodd,et al.  Lip reading in infants: Attention to speech presented in- and out-of-synchrony , 1979, Cognitive Psychology.

[71]  Ryan A. Stevenson,et al.  Audiovisual integration in human superior temporal sulcus: Inverse effectiveness and the neural processing of speech and object recognition , 2009, NeuroImage.

[72]  Steven E Petersen,et al.  Does human functional brain organization shift from diffuse to focal with development? , 2006, Developmental science.

[73]  Bruce D. McCandliss,et al.  Development of neural systems for reading. , 2007, Annual review of neuroscience.

[74]  N. Kanwisher,et al.  The fusiform face area: a cortical region specialized for the perception of faces , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[75]  James R. Booth,et al.  Development of Brain Mechanisms for Processing Orthographic and Phonologic Representations , 2004, Journal of Cognitive Neuroscience.

[76]  R. C. Oldfield The assessment and analysis of handedness: the Edinburgh inventory. , 1971, Neuropsychologia.

[77]  Kristen A. Lindgren,et al.  Effective and Structural Connectivity in the Human Auditory Cortex , 2008, The Journal of Neuroscience.

[78]  Marlene C. Richter,et al.  The shape of the fMRI BOLD response in children and adults changes systematically with age , 2003, NeuroImage.

[79]  G. Eden,et al.  The Neural Basis of Hyperlexic Reading An fMRI Case Study , 2004, Neuron.

[80]  D. Burnham,et al.  Impact of language on development of auditory-visual speech perception. , 2008, Developmental science.

[81]  Virginie van Wassenhove,et al.  Auditory-visual fusion in speech perception in children with cochlear implants. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[82]  Michael S. Beauchamp,et al.  A neural basis for interindividual differences in the McGurk effect, a multisensory speech illusion , 2012, NeuroImage.

[83]  R. Buxton,et al.  Dynamics of blood flow and oxygenation changes during brain activation: The balloon model , 1998, Magnetic resonance in medicine.

[84]  Anders M. Dale,et al.  Cortical Surface-Based Analysis I. Segmentation and Surface Reconstruction , 1999, NeuroImage.

[85]  B. Argall,et al.  Simplified intersubject averaging on the cortical surface using SUMA , 2006, Human brain mapping.

[86]  B. Argall,et al.  Integration of Auditory and Visual Information about Objects in Superior Temporal Sulcus , 2004, Neuron.

[87]  Cécile Colin,et al.  Top-down and bottom-up modulation of audiovisual integration in speech , 2005 .

[88]  S. Dupont,et al.  A study of the McGurk effect in 4 and 5-year-old French Canadian children , 2005 .

[89]  L. Bernstein,et al.  Enhanced speechreading in deaf adults: can short-term training/practice close the gap for hearing adults? , 2001, Journal of speech, language, and hearing research : JSLHR.

[90]  Harald C. Traue,et al.  Neural correlates of the perception of dynamic versus static facial expressions of emotion , 2011, Psycho-social medicine.

[91]  R. Goebel,et al.  Integration of Letters and Speech Sounds in the Human Brain , 2004, Neuron.

[92]  Michael S. Beauchamp,et al.  Touch, sound and vision in human superior temporal sulcus , 2008, NeuroImage.

[93]  S. Petersen,et al.  Developmental changes in human cerebral functional organization for word generation. , 2005, Cerebral cortex.

[94]  R W Cox,et al.  AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. , 1996, Computers and biomedical research, an international journal.