A biologically inspired neurocomputational model for audiovisual integration and causal inference

Recently, experimental and theoretical research has focused on the brain's abilities to extract information from a noisy sensory environment and how cross‐modal inputs are processed to solve the causal inference problem to provide the best estimate of external events. Despite the empirical evidence suggesting that the nervous system uses a statistically optimal and probabilistic approach in addressing these problems, little is known about the brain's architecture needed to implement these computations. The aim of this work was to realize a mathematical model, based on physiologically plausible hypotheses, to analyze the neural mechanisms underlying multisensory perception and causal inference. The model consists of three layers topologically organized: two encode auditory and visual stimuli, separately, and are reciprocally connected via excitatory synapses and send excitatory connections to the third downstream layer. This synaptic organization realizes two mechanisms of cross‐modal interactions: the first is responsible for the sensory representation of the external stimuli, while the second solves the causal inference problem. We tested the network by comparing its results to behavioral data reported in the literature. Among others, the network can account for the ventriloquism illusion, the pattern of sensory bias and the percept of unity as a function of the spatial auditory–visual distance, and the dependence of the auditory error on the causal inference. Finally, simulations results are consistent with probability matching as the perceptual strategy used in auditory–visual spatial localization tasks, agreeing with the behavioral data. The model makes untested predictions that can be investigated in future behavioral experiments.

[1]  John C. Middlebrooks,et al.  Distributed coding of sound locations in the auditory cortex , 2003, Biological Cybernetics.

[2]  Wei Ji Ma,et al.  Bayesian inference with probabilistic population codes , 2006, Nature Neuroscience.

[3]  J. C. Middlebrooks,et al.  Codes for sound-source location in nontonotopic auditory cortex. , 1998, Journal of neurophysiology.

[4]  Ulrik R. Beierholm,et al.  Probability Matching as a Computational Strategy Used in Perception , 2010, PLoS Comput. Biol..

[5]  Uta Noppeney,et al.  Sensory reliability shapes perceptual inference via two mechanisms. , 2015, Journal of vision.

[6]  S. Shimojo,et al.  Visual illusion induced by sound. , 2002, Brain research. Cognitive brain research.

[7]  Mauro Ursino,et al.  A neural network for learning the meaning of objects and words from a featural representation , 2015, Neural Networks.

[8]  Mauro Ursino,et al.  Recognition of Abstract Objects Via Neural Oscillators: Interaction Among Topological Organization, Associative Memory and Gamma Band Synchronization , 2009, IEEE Transactions on Neural Networks.

[9]  M. Ernst,et al.  Humans integrate visual and haptic information in a statistically optimal fashion , 2002, Nature.

[10]  Mauro Ursino,et al.  A Neural Network Model of Cortical Auditory-visual Interactions - A Neurocomputational Analysis of the Shams-Illusion , 2012, IJCCI.

[11]  M. Rasch,et al.  Decentralized Multisensory Information Integration in Neural Systems , 2016, The Journal of Neuroscience.

[12]  David R. Wozny,et al.  Computational Characterization of Visually Induced Auditory Spatial Adaptation , 2011, Front. Integr. Neurosci..

[13]  A. King,et al.  Visual–auditory spatial processing in auditory cortical neurons , 2008, Brain Research.

[14]  A. J. King,et al.  Role of auditory cortex in sound localization in the midsagittal plane. , 2007, Journal of neurophysiology.

[15]  Konrad Paul Kording,et al.  Causal Inference in Multisensory Perception , 2007, PloS one.

[16]  Endel Põder,et al.  Crowding with detection and coarse discrimination of simple visual features. , 2008, Journal of vision.

[17]  L. Ellerbroek,et al.  Serotype Distribution of Salmonella Isolates from Turkey Ground Meat and Meat Parts , 2013, BioMed research international.

[18]  W. Ma,et al.  Towards a neural implementation of causal inference in cue combination. , 2013, Multisensory research.

[19]  U. Noppeney,et al.  Cortical Hierarchies Perform Bayesian Causal Inference in Multisensory Perception , 2015, PLoS biology.

[20]  Marc O. Ernst,et al.  Correlation detection as a general mechanism for multisensory integration , 2016, Nature Communications.

[21]  G. Recanzone Interactions of auditory and visual stimuli in space and time , 2009, Hearing Research.

[22]  Nadia Bolognini,et al.  A neurocomputational analysis of the sound-induced flash illusion , 2014, NeuroImage.

[23]  M. Wallace,et al.  Unifying multisensory signals across time and space , 2004, Experimental Brain Research.

[24]  Giuliano Iurilli,et al.  Sound-Driven Synaptic Inhibition in Primary Visual Cortex , 2012, Neuron.

[25]  C. Schroeder,et al.  Neuronal mechanisms, response dynamics and perceptual functions of multisensory interactions in auditory cortex , 2009, Hearing Research.

[26]  P. Bertelson,et al.  Cross-modal bias and perceptual fusion with auditory-visual spatial discordance , 1981, Perception & psychophysics.

[27]  S. Shimojo,et al.  Illusions: What you see is what you hear , 2000, Nature.

[28]  Ulrik R Beierholm,et al.  Sound-induced flash illusion as an optimal percept , 2005, Neuroreport.

[29]  G. DeAngelis,et al.  Multisensory Integration in Macaque Visual Cortex Depends on Cue Reliability , 2008, Neuron.

[30]  A. Pouget,et al.  Probabilistic brains: knowns and unknowns , 2013, Nature Neuroscience.

[31]  Masato Okada,et al.  Recurrent network for multisensory integration-identification of common sources of audiovisual stimuli , 2013, Front. Comput. Neurosci..

[32]  Mikko Sams,et al.  Factors influencing audiovisual fission and fusion illusions. , 2004, Brain research. Cognitive brain research.

[33]  Mauro Ursino,et al.  A Neural Network Model of Ventriloquism Effect and Aftereffect , 2012, PloS one.

[34]  Robert A Jacobs,et al.  Bayesian integration of visual and auditory signals for spatial localization. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[35]  Luc H. Arnal,et al.  Dual Neural Routing of Visual Facilitation in Speech Processing , 2009, The Journal of Neuroscience.

[36]  Ladan Shams,et al.  Biases in Visual, Auditory, and Audiovisual Perception of Space , 2015, PLoS Comput. Biol..

[37]  D. Burr,et al.  The Ventriloquist Effect Results from Near-Optimal Bimodal Integration , 2004, Current Biology.

[38]  Christopher R Fetsch,et al.  Neural correlates of reliability-based cue weighting during multisensory integration , 2011, Nature Neuroscience.

[39]  Mauro Ursino,et al.  Neurocomputational approaches to modelling multisensory integration in the brain: A review , 2014, Neural Networks.

[40]  Argye E. Hillis,et al.  Neural Correlates of Modality-specific Spatial Extinction , 2006, Journal of Cognitive Neuroscience.

[41]  R. Zemel,et al.  Inference and computation with population codes. , 2003, Annual review of neuroscience.

[42]  Ulrik R. Beierholm,et al.  Causal inference in perception , 2010, Trends in Cognitive Sciences.

[43]  A. Ghazanfar,et al.  Is neocortex essentially multisensory? , 2006, Trends in Cognitive Sciences.

[44]  Albert Jin Chung,et al.  Perception of Body Ownership Is Driven by Bayesian Sensory Inference , 2015, PloS one.

[45]  M. Wallace,et al.  Visual Localization Ability Influences Cross-Modal Bias , 2003, Journal of Cognitive Neuroscience.

[46]  Jennifer K. Bizley,et al.  Visual influences on ferret auditory cortex , 2009, Hearing Research.

[47]  Mauro Ursino,et al.  A Neural Network Model Can Explain Ventriloquism Aftereffect and Its Generalization across Sound Frequencies , 2013, BioMed research international.

[48]  Mauro Ursino,et al.  Multisensory Bayesian Inference Depends on Synapse Maturation during Training: Theoretical Analysis and Neural Modeling Implementation , 2017, Neural Computation.

[49]  Brian Zingg,et al.  Cross-Modality Sharpening of Visual Cortical Processing through Layer-1-Mediated Inhibition and Disinhibition , 2016, Neuron.

[50]  David R. Wozny,et al.  Human trimodal perception follows optimal statistical inference. , 2008, Journal of vision.