Causal Inference in Audiovisual Perception

In our natural environment the senses are continuously flooded with a myriad of signals. To form a coherent representation of the world, the brain needs to integrate sensory signals arising from a common cause and segregate signals coming from separate causes. An unresolved question is how the brain solves this binding or causal inference problem and determines the causal structure of the sensory signals. In this functional magnetic resonance imaging (fMRI) study human observers (female and male) were presented with synchronous auditory and visual signals at the same location (i.e., common cause) or different locations (i.e., separate causes). On each trial, observers decided whether signals come from common or separate sources(i.e., “causal decisions”). To dissociate participants' causal inference from the spatial correspondence cues we adjusted the audiovisual disparity of the signals individually for each participant to threshold accuracy. Multivariate fMRI pattern analysis revealed the lateral prefrontal cortex as the only region that encodes predominantly the outcome of observers' causal inference (i.e., common vs separate causes). By contrast, the frontal eye field (FEF) and the intraparietal sulcus (IPS0–4) form a circuitry that concurrently encodes spatial (auditory and visual stimulus locations), decisional (causal inference), and motor response dimensions. These results suggest that the lateral prefrontal cortex plays a key role in inferring and making explicit decisions about the causal structure that generates sensory signals in our environment. By contrast, informed by observers' inferred causal structure, the FEF–IPS circuitry integrates auditory and visual spatial signals into representations that guide motor responses. SIGNIFICANCE STATEMENT In our natural environment, our senses are continuously flooded with a myriad of signals. Transforming this barrage of sensory signals into a coherent percept of the world relies inherently on solving the causal inference problem, deciding whether sensory signals arise from a common cause and should hence be integrated or else be segregated. This functional magnetic resonance imaging study shows that the lateral prefrontal cortex plays a key role in inferring the causal structure of the environment. Crucially, informed by the spatial correspondence cues and the inferred causal structure the frontal eye field and the intraparietal sulcus form a circuitry that integrates auditory and visual spatial signals into representations that guide motor responses.

[1]  Uta Noppeney,et al.  Integration of audiovisual spatial signals is not consistent with maximum likelihood estimation , 2019, Cortex.

[2]  Joost X. Maier,et al.  Multisensory Integration of Dynamic Faces and Voices in Rhesus Monkey Auditory Cortex , 2005 .

[3]  M. Wallace,et al.  A revised view of sensory cortical parcellation , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Luigi Acerbi,et al.  Bayesian comparison of explicit and implicit causal inference strategies in multisensory heading perception , 2017, bioRxiv.

[5]  Uta Noppeney,et al.  Reliability-Weighted Integration of Audiovisual Signals Can Be Modulated by Top-down Attention , 2018, eNeuro.

[6]  Uta Noppeney,et al.  How prior expectations shape multisensory perception , 2016, NeuroImage.

[7]  J W Belliveau,et al.  Borders of multiple visual areas in humans revealed by functional magnetic resonance imaging. , 1995, Science.

[8]  Uta Noppeney,et al.  Physical and Perceptual Factors Shape the Neural Mechanisms That Integrate Audiovisual Signals in Speech Comprehension , 2011, The Journal of Neuroscience.

[9]  D. J. Felleman,et al.  Distributed hierarchical processing in the primate cerebral cortex. , 1991, Cerebral cortex.

[10]  D G Pelli,et al.  The VideoToolbox software for visual psychophysics: transforming numbers into movies. , 1997, Spatial vision.

[11]  S. Hillyard,et al.  Neural Basis of the Ventriloquist Illusion , 2007, Current Biology.

[12]  G. Recanzone,et al.  Serial and parallel processing in the primate auditory cortex revisited , 2010, Behavioural Brain Research.

[13]  P. Gribble,et al.  Temporal constraints on the McGurk effect , 1996, Perception & psychophysics.

[14]  U. Noppeney,et al.  Perceptual Decisions Formed by Accumulation of Audiovisual Evidence in Prefrontal Cortex , 2010, The Journal of Neuroscience.

[15]  G. Recanzone Interactions of auditory and visual stimuli in space and time , 2009, Hearing Research.

[16]  Karl J. Friston,et al.  A theory of cortical responses , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[17]  Randy L. Gollub,et al.  Multi-site characterization of an fMRI working memory paradigm: Reliability of activation indices , 2010, NeuroImage.

[18]  Uta Noppeney,et al.  Prior auditory information shapes visual category-selectivity in ventral occipito-temporal cortex , 2010, NeuroImage.

[19]  Elia Formisano,et al.  An anatomical and functional topography of human auditory cortical areas , 2014, Front. Neurosci..

[20]  Uta Noppeney,et al.  Audiovisual asynchrony detection in human speech. , 2011, Journal of experimental psychology. Human perception and performance.

[21]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[22]  Karl J. Friston,et al.  Multisubject fMRI Studies and Conjunction Analyses , 1999, NeuroImage.

[23]  D H Brainard,et al.  The Psychophysics Toolbox. , 1997, Spatial vision.

[24]  Momchil S. Tomov,et al.  Neural Computations Underlying Causal Structure Learning , 2017, The Journal of Neuroscience.

[25]  Daniel Senkowski,et al.  Multisensory processing and oscillatory gamma responses: effects of spatial selective attention , 2005, Experimental Brain Research.

[26]  Lee M. Miller,et al.  Speech Cues Contribute to Audiovisual Spatial Integration , 2011, PloS one.

[27]  M. Ernst,et al.  Humans integrate visual and haptic information in a statistically optimal fashion , 2002, Nature.

[28]  L. Krubitzer,et al.  Multisensory plasticity in congenitally deaf mice: How are cortical areas functionally specified? , 2006, Neuroscience.

[29]  Uta Noppeney,et al.  The contributions of transient and sustained response codes to audiovisual integration. , 2011, Cerebral cortex.

[30]  M. Sereno,et al.  Multisensory maps in parietal cortex☆ , 2014, Current Opinion in Neurobiology.

[31]  G. Calvert Crossmodal processing in the human brain: insights from functional neuroimaging studies. , 2001, Cerebral cortex.

[32]  G. Recanzone,et al.  Temporal and spatial dependency of the ventriloquism effect , 2001, Neuroreport.

[33]  D. Burr,et al.  The Ventriloquist Effect Results from Near-Optimal Bimodal Integration , 2004, Current Biology.

[34]  Denis G. Pelli,et al.  ECVP '07 Abstracts , 2007, Perception.

[35]  U. Noppeney,et al.  Audiovisual Synchrony Improves Motion Discrimination via Enhanced Connectivity between Early Visual and Auditory Areas , 2010, The Journal of Neuroscience.

[36]  Karl J. Friston,et al.  The effect of prior visual information on recognition of speech and sounds. , 2008, Cerebral cortex.

[37]  J. Kaiser,et al.  Object Familiarity and Semantic Congruency Modulate Responses in Cortical Audiovisual Integration Areas , 2007, The Journal of Neuroscience.

[38]  Ann-Christine Ehlis,et al.  The neural dynamics of hierarchical Bayesian causal inference in multisensory perception , 2019, Nature Communications.

[39]  Uta Noppeney,et al.  Distinct Computational Principles Govern Multisensory Integration in Primary Sensory and Association Cortices , 2016, Current Biology.

[40]  S. Kastner,et al.  Topographic maps in human frontal and parietal cortex , 2009, Trends in Cognitive Sciences.

[41]  Thomas E. Nichols,et al.  Nonparametric permutation tests for functional neuroimaging: A primer with examples , 2002, Human brain mapping.

[42]  D. Poeppel,et al.  Temporal window of integration in auditory-visual speech perception , 2007, Neuropsychologia.

[43]  M. Ernst,et al.  When Correlation Implies Causation in Multisensory Integration , 2012, Current Biology.

[44]  Joost X. Maier,et al.  Natural, Metaphoric, and Linguistic Auditory Direction Signals Have Distinct Influences on Visual Motion Processing , 2009, The Journal of Neuroscience.

[45]  Murray Mm,et al.  Convergence of Auditory, Visual, and Somatosensory Information in Ventral Prefrontal Cortex -- The Neural Bases of Multisensory Processes , 2012 .

[46]  M. Mesulam,et al.  From sensation to cognition. , 1998, Brain : a journal of neurology.

[47]  Jacob L Yates,et al.  The Role of the Lateral Intraparietal Area in (the Study of) Decision Making. , 2017, Annual review of neuroscience.

[48]  U. Noppeney,et al.  Superadditive responses in superior temporal sulcus predict audiovisual benefits in object categorization. , 2010, Cerebral cortex.

[49]  M. Wallace,et al.  Unifying multisensory signals across time and space , 2004, Experimental Brain Research.

[50]  U. Noppeney,et al.  To integrate or not to integrate: Temporal dynamics of hierarchical Bayesian causal inference , 2019, PLoS biology.

[51]  Ulrik R. Beierholm,et al.  Causal inference in perception , 2010, Trends in Cognitive Sciences.

[52]  Liang Wang,et al.  Probabilistic Maps of Visual Topography in Human Cortex. , 2015, Cerebral cortex.

[53]  Christoph Kayser,et al.  Do early sensory cortices integrate cross-modal information? , 2007, Brain Structure and Function.

[54]  Anders M. Dale,et al.  Cortical Surface-Based Analysis I. Segmentation and Surface Reconstruction , 1999, NeuroImage.

[55]  A. Ghazanfar,et al.  Is neocortex essentially multisensory? , 2006, Trends in Cognitive Sciences.

[56]  J. Driver,et al.  Multisensory Interplay Reveals Crossmodal Influences on ‘Sensory-Specific’ Brain Regions, Neural Responses, and Judgments , 2008, Neuron.

[57]  A. Faisal,et al.  Noise in the nervous system , 2008, Nature Reviews Neuroscience.

[58]  P. Goldman-Rakic,et al.  Cytoarchitectonic definition of prefrontal areas in the normal human cortex: II. Variability in locations of areas 9 and 46 and relationship to the Talairach Coordinate System. , 1995, Cerebral cortex.

[59]  J. Rauschecker,et al.  Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing , 2009, Nature Neuroscience.

[60]  U. Noppeney,et al.  Distinct Functional Contributions of Primary Sensory and Association Areas to Audiovisual Integration in Object Categorization , 2010, The Journal of Neuroscience.

[61]  J. Rieger,et al.  Audiovisual Temporal Correspondence Modulates Human Multisensory Superior Temporal Sulcus Plus Primary Sensory Cortices , 2007, The Journal of Neuroscience.

[62]  U. Noppeney,et al.  Cortical Hierarchies Perform Bayesian Causal Inference in Multisensory Perception , 2015, PLoS biology.

[63]  Christoph Kayser,et al.  Causal Inference in the Multisensory Brain , 2018, Neuron.

[64]  J. Rauschecker,et al.  Mechanisms and streams for processing of "what" and "where" in auditory cortex. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[65]  Anders M. Dale,et al.  Automatic parcellation of human cortical gyri and sulci using standard anatomical nomenclature , 2010, NeuroImage.

[66]  Jacqueline Gottlieb,et al.  Spatial and non-spatial functions of the parietal cortex , 2010, Current Opinion in Neurobiology.

[67]  Uta Noppeney,et al.  Sensory reliability shapes perceptual inference via two mechanisms. , 2015, Journal of vision.

[68]  Marc O. Ernst,et al.  Correlation detection as a general mechanism for multisensory integration , 2016, Nature Communications.

[69]  Claude Alain,et al.  Assessing the auditory dual-pathway model in humans , 2004, NeuroImage.

[70]  Markus Siegel,et al.  Cortical information flow during flexible sensorimotor decisions , 2015, Science.

[71]  C. Schroeder,et al.  Neuronal Oscillations and Multisensory Interaction in Primary Auditory Cortex , 2007, Neuron.

[72]  Lotfi B Merabet,et al.  Visual Topography of Human Intraparietal Sulcus , 2007, The Journal of Neuroscience.

[73]  P. Bertelson,et al.  Cross-modal bias and perceptual fusion with auditory-visual spatial discordance , 1981, Perception & psychophysics.

[74]  Charles Spence,et al.  ‘When Birds of a Feather Flock Together’: Synesthetic Correspondences Modulate Audiovisual Integration in Non-Synesthetes , 2009, PloS one.

[75]  J. Lewald,et al.  Cross-modal perceptual integration of spatially and temporally disparate auditory and visual stimuli. , 2003, Brain research. Cognitive brain research.

[76]  Jean-Baptiste Poline,et al.  Analysis of a large fMRI cohort: Statistical and methodological issues for group analyses , 2007, NeuroImage.

[77]  Konrad Paul Kording,et al.  Causal Inference in Multisensory Perception , 2007, PloS one.

[78]  Jonathan W. Pillow,et al.  Dissociated functional significance of decision-related activity in the primate dorsal stream , 2016, Nature.

[79]  Wei Ji Ma,et al.  Causal inference of asynchronous audiovisual speech , 2013, Front. Psychol..

[80]  N. Prins Psychophysics: A Practical Introduction , 2009 .

[81]  T. Griffiths,et al.  Distinct Mechanisms for Processing Spatial Sequences and Pitch Sequences in the Human Auditory Brain , 2003, The Journal of Neuroscience.

[82]  P. Goldman-Rakic,et al.  Preface: Cerebral Cortex Has Come of Age , 1991 .

[83]  Uta Noppeney,et al.  Sensory and Striatal Areas Integrate Auditory and Visual Signals into Behavioral Benefits during Motion Discrimination , 2013, The Journal of Neuroscience.

[84]  V. Ekroll,et al.  Partial modal completion under occlusion: what do modal and amodal percepts represent? , 2015, Journal of vision.

[85]  John J. Foxe,et al.  Multisensory auditory-visual interactions during early sensory processing in humans: a high-density electrical mapping study. , 2002, Brain research. Cognitive brain research.

[86]  Rajesh P. N. Rao,et al.  Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. , 1999 .

[87]  J. Driver Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading , 1996, Nature.

[88]  Karl J. Friston,et al.  Assessing the significance of focal activations using their spatial extent , 1994, Human brain mapping.

[89]  Rainer Goebel,et al.  Top–down task effects overrule automatic multisensory responses to letter–sound pairs in auditory association cortex , 2006, NeuroImage.

[90]  R. Gregory The Most Expensive Painting in the World , 2007, Perception.

[91]  Uta Noppeney,et al.  Long-term music training tunes how the brain temporally binds signals from multiple senses , 2011, Proceedings of the National Academy of Sciences.

[92]  Karl J. Friston,et al.  Statistical parametric maps in functional imaging: A general linear approach , 1994 .