Different Neural Networks Are Involved in Audiovisual Speech Perception Depending on the Context

How are we able to easily and accurately recognize speech sounds despite the lack of acoustic invariance? One proposed solution is the existence of a neural representation of speech syllable perception that transcends its sensory properties. In the present fMRI study, we used two different audiovisual speech contexts both intended to identify brain areas whose levels of activation would be conditioned by the speech percept independent from its sensory source information. We exploited McGurk audiovisual fusion to obtain short oddball sequences of syllables that were either (a) acoustically different but perceived as similar or (b) acoustically identical but perceived as different. We reasoned that, if there is a single network of brain areas representing abstract speech perception, this network would show a reduction of activity when presented with syllables that are acoustically different but perceived as similar and an increase in activity when presented with syllables that are acoustically similar but perceived as distinct. Consistent with the long-standing idea that speech production areas may be involved in speech perception, we found that frontal areas were part of the neural network that showed reduced activity for sequences of perceptually similar syllables. Another network was revealed, however, when focusing on areas that exhibited increased activity for perceptually different but acoustically identical syllables. This alternative network included auditory areas but no left frontal activations. In addition, our findings point to the importance of subcortical structures much less often considered when addressing issues pertaining to perceptual representations.

[1]  V. Caggiano,et al.  The Mirror Neuron System , 2011, The Neuroscientist : a review journal bringing neurobiology, neurology and psychiatry.

[2]  M. Arbib Action to language via the mirror neuron system , 2006 .

[3]  Karl J. Friston,et al.  Evidence of Mirror Neurons in Human Inferior Frontal Gyrus , 2009, The Journal of Neuroscience.

[4]  P. Fonlupt,et al.  Cortical dynamics of a self driven choice: A MEG study during a card sorting task , 2010, Clinical Neurophysiology.

[5]  J. Changeux,et al.  A Neuronal Model of Predictive Coding Accounting for the Mismatch Negativity , 2012, The Journal of Neuroscience.

[6]  Ruth Campbell,et al.  The processing of audio-visual speech: empirical and neural bases , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[7]  A. Puce,et al.  Neuronal oscillations and visual amplification of speech , 2008, Trends in Cognitive Sciences.

[8]  Michael I. Jordan,et al.  Forward Models: Supervised Learning with a Distal Teacher , 1992, Cogn. Sci..

[9]  E. Bullmore,et al.  Activation of auditory cortex during silent lipreading. , 1997, Science.

[10]  R. Sperry Neural basis of the spontaneous optokinetic response produced by visual inversion. , 1950, Journal of comparative and physiological psychology.

[11]  Philip Lieberman,et al.  Speech production, syntax comprehension, and cognitive deficits in Parkinson's disease , 1992, Brain and Language.

[12]  R. Hari,et al.  Seeing speech: visual information from lip movements modifies activity in the human auditory cortex , 1991, Neuroscience Letters.

[13]  M. Hallett,et al.  Neural Correlates of Auditory–Visual Stimulus Onset Asynchrony Detection , 2001, The Journal of Neuroscience.

[14]  P. Denes On the Motor Theory of Speech Perception , 1965 .

[15]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[16]  R. Näätänen,et al.  The mismatch negativity (MMN) in basic research of central auditory processing: A review , 2007, Clinical Neurophysiology.

[17]  A. Borst Seeing smells: imaging olfactory learning in bees , 1999, Nature Neuroscience.

[18]  A. Dale,et al.  The Retinotopy of Visual Spatial Attention , 1998, Neuron.

[19]  Alan C. Evans,et al.  Modulation of cerebral blood-flow in the human auditory cortex during speech: role of motor-to-sensory discharges , 1996, NeuroImage.

[20]  M. Iacoboni,et al.  Listening to speech activates motor areas involved in speech production , 2004, Nature Neuroscience.

[21]  G. Salamon,et al.  Mutism and auditory agnosia due to bilateral insular damage—Role of the insula in human communication , 1995, Neuropsychologia.

[22]  Steven L. Small,et al.  Listening to talking faces: motor cortical activation during speech perception , 2005, NeuroImage.

[23]  G. Rizzolatti,et al.  Understanding motor events: a neurophysiological study , 2004, Experimental Brain Research.

[24]  Mikko Sams,et al.  Processing of changes in visual speech in the human auditory cortex. , 2002, Brain research. Cognitive brain research.

[25]  D. Poeppel,et al.  Towards a functional neuroanatomy of speech perception , 2000, Trends in Cognitive Sciences.

[26]  Jeffery A. Jones,et al.  Multisensory Integration Sites Identified by Perception of Spatial Wavelet Filtered Visual Speech Gesture Information , 2004, Journal of Cognitive Neuroscience.

[27]  Steven L. Small,et al.  Abstract Coding of Audiovisual Speech: Beyond Sensory Representation , 2007, Neuron.

[28]  Barbara Dodd,et al.  The Role of Vision in the Perception of Speech , 1977, Perception.

[29]  F. Lin,et al.  Primary and multisensory cortical activity is correlated with audiovisual percepts , 2009, Human brain mapping.

[30]  P. Deltenre,et al.  Mismatch negativity evoked by the McGurk–MacDonald effect: a phonetic representation within short-term memory , 2002, Clinical Neurophysiology.

[31]  T. Vilis,et al.  Integration of target and effector information in human posterior parietal cortex for the planning of action. , 2005, Journal of neurophysiology.

[32]  D. Heeger,et al.  Topographic organization for delayed saccades in human posterior parietal cortex. , 2005, Journal of neurophysiology.

[33]  Audrey R. Nath,et al.  fMRI-Guided Transcranial Magnetic Stimulation Reveals That the Superior Temporal Sulcus Is a Cortical Locus of the McGurk Effect , 2010, The Journal of Neuroscience.

[34]  I. Winkler,et al.  Auditory processing that leads to conscious perception: a unique window to central auditory processing opened by the mismatch negativity and related responses. , 2011, Psychophysiology.

[35]  Mikko Sams,et al.  Visual Processing Affects the Neural Basis of Auditory Discrimination , 2008, Journal of Cognitive Neuroscience.

[36]  D. Massaro Preperceptual images, processing time, and perceptual units in auditory perception. , 1972, Psychological review.

[37]  Mikko Sams,et al.  Primary auditory cortex activation by visual speech , 2004 .

[38]  R. Brubaker Models for the perception of speech and visual form: Weiant Wathen-Dunn, ed.: Cambridge, Mass., The M.I.T. Press, I–X, 470 pages , 1968 .

[39]  W. H. Sumby,et al.  Visual contribution to speech intelligibility in noise , 1954 .

[40]  Benjamin O Turner,et al.  Annals of the New York Academy of Sciences Hemispheric Lateralization in Reasoning , 2022 .

[41]  Nikos Makris,et al.  Automatically parcellating the human cerebral cortex. , 2004, Cerebral cortex.

[42]  T. Allison,et al.  Temporal Cortex Activation in Humans Viewing Eye and Mouth Movements , 1998, The Journal of Neuroscience.

[43]  Tai Sing Lee,et al.  Hierarchical Bayesian inference in the visual cortex. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[44]  O. Bertrand,et al.  Visual Activation and Audiovisual Interactions in the Auditory Cortex during Speech Perception: Intracranial Recordings in Humans , 2008, The Journal of Neuroscience.

[45]  R. Blake,et al.  Brain Areas Involved in Perception of Biological Motion , 2000, Journal of Cognitive Neuroscience.

[46]  M. Sams,et al.  Primary auditory cortex activation by visual speech: an fMRI study at 3 T , 2005, Neuroreport.

[47]  P. Deltenre,et al.  Generalization of the generation of an MMN by illusory McGurk percepts: voiceless consonants , 2004, Clinical Neurophysiology.

[48]  A M Liberman,et al.  Perception of the speech code. , 1967, Psychological review.

[49]  Jeremy I. Skipper,et al.  Action to Language via the Mirror Neuron System: Lending a helping hand to hearing: another motor theory of speech perception , 2006 .

[50]  G. Rizzolatti,et al.  The mirror-neuron system. , 2004, Annual review of neuroscience.

[51]  N. Dronkers A new brain region for coordinating speech articulation , 1996, Nature.

[52]  A. Liberman,et al.  The motor theory of speech perception revised , 1985, Cognition.

[53]  R. Turner,et al.  Language Control in the Bilingual Brain , 2006, Science.

[54]  Mikko Sams,et al.  Perception of matching and conflicting audiovisual speech in dyslexic and fluent readers: An fMRI study at 3 T , 2006, NeuroImage.

[55]  A. Craig,et al.  How do you feel — now? The anterior insula and human awareness , 2009, Nature Reviews Neuroscience.

[56]  John J. Foxe,et al.  Seeing voices: High-density electrical mapping and source-analysis of the multisensory mismatch negativity evoked during the McGurk illusion , 2007, Neuropsychologia.

[57]  N. Tzourio-Mazoyer,et al.  Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain , 2002, NeuroImage.

[58]  David A. Medler,et al.  Neural correlates of sensory and decision processes in auditory object identification , 2004, Nature Neuroscience.

[59]  Mikko Sams,et al.  Processing of audiovisual speech in Broca's area , 2005, NeuroImage.

[60]  K. Grill-Spector,et al.  Repetition and the brain: neural models of stimulus-specific effects , 2006, Trends in Cognitive Sciences.

[61]  Karl J. Friston,et al.  A theory of cortical responses , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.