Learning Multisensory Integration and Coordinate Transformation via Density Estimation

Sensory processing in the brain includes three key operations: multisensory integration—the task of combining cues into a single estimate of a common underlying stimulus; coordinate transformations—the change of reference frame for a stimulus (e.g., retinotopic to body-centered) effected through knowledge about an intervening variable (e.g., gaze position); and the incorporation of prior information. Statistically optimal sensory processing requires that each of these operations maintains the correct posterior distribution over the stimulus. Elements of this optimality have been demonstrated in many behavioral contexts in humans and other animals, suggesting that the neural computations are indeed optimal. That the relationships between sensory modalities are complex and plastic further suggests that these computations are learned—but how? We provide a principled answer, by treating the acquisition of these mappings as a case of density estimation, a well-studied problem in machine learning and statistics, in which the distribution of observed data is modeled in terms of a set of fixed parameters and a set of latent variables. In our case, the observed data are unisensory-population activities, the fixed parameters are synaptic connections, and the latent variables are multisensory-population activities. In particular, we train a restricted Boltzmann machine with the biologically plausible contrastive-divergence rule to learn a range of neural computations not previously demonstrated under a single approach: optimal integration; encoding of priors; hierarchical integration of cues; learning when not to integrate; and coordinate transformation. The model makes testable predictions about the nature of multisensory representations.

[1]  Michael S. Lewicki,et al.  Efficient coding of natural sounds , 2002, Nature Neuroscience.

[2]  R. Andersen,et al.  Dorsal Premotor Neurons Encode the Relative Position of the Hand, Eye, and Goal during Reach Planning , 2006, Neuron.

[3]  Philip N. Sabes,et al.  Flexible strategies for sensory integration during motor planning , 2005, Nature Neuroscience.

[4]  H. Bülthoff,et al.  Merging the senses into a robust percept , 2004, Trends in Cognitive Sciences.

[5]  Frank Bremmer,et al.  Attentional Modulation of Visual Receptive Fields in the Posterior Parietal Cortex of the Behaving Macaque , 1997 .

[6]  D. V. van Essen,et al.  Corticocortical connections of visual, sensorimotor, and multimodal processing areas in the parietal lobe of the macaque monkey , 2000, The Journal of comparative neurology.

[7]  M. Graziano Where is my arm? The relative role of vision and proprioception in the neuronal representation of limb position. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Gordon M. Redding,et al.  Applications of prism adaptation: a tutorial in theory and method , 2005, Neuroscience & Biobehavioral Reviews.

[9]  A. Sittig,et al.  Integration of proprioceptive and visual position-information: An experimentally supported model. , 1999, Journal of neurophysiology.

[10]  R. Andersen,et al.  Models of the Posterior Parietal Cortex Which Perform Multimodal Integration and Represent Space in Several Coordinate Frames , 2000, Journal of Cognitive Neuroscience.

[11]  Geoffrey E. Hinton,et al.  Self-organizing neural network that discovers surfaces in random-dot stereograms , 1992, Nature.

[12]  J. Lisman,et al.  A mechanism for the Hebb and the anti-Hebb processes underlying learning and memory. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[13]  M. Ernst,et al.  Humans integrate visual and haptic information in a statistically optimal fashion , 2002, Nature.

[14]  H Barlow,et al.  Redundancy reduction revisited , 2001, Network.

[15]  Andrew P Davison,et al.  Learning Cross-Modal Spatial Transformations through Spike Timing-Dependent Plasticity , 2006, The Journal of Neuroscience.

[16]  Konrad Paul Kording,et al.  Bayesian integration in sensorimotor learning , 2004, Nature.

[17]  Philip N. Sabes,et al.  Heterogeneous Representations in the Superior Parietal Lobule Are Common across Reaches to Visual and Proprioceptive Targets , 2011, The Journal of Neuroscience.

[18]  F. Bremmer,et al.  Spatial invariance of visual receptive fields in parietal cortex neurons , 1997, Nature.

[19]  Mriganka Sur,et al.  Role of afferent activity in the development of cortical specification. , 2002, Results and problems in cell differentiation.

[20]  M. Goldberg,et al.  Ventral intraparietal area of the macaque: congruent visual and somatic response properties. , 1998, Journal of neurophysiology.

[21]  Philip N. Sabes,et al.  Multisensory Integration during Motor Planning , 2003, The Journal of Neuroscience.

[22]  P. Földiák,et al.  The ‘Ideal Homunculus’: Statistical Inference from Neural Population Responses , 1993 .

[23]  Philip N. Sabes,et al.  How Each Movement Changes the Next: An Experimental and Theoretical Study of Fast Adaptive Priors in Reaching , 2011, The Journal of Neuroscience.

[24]  Geoffrey E. Hinton,et al.  Exponential Family Harmoniums with an Application to Information Retrieval , 2004, NIPS.

[25]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[26]  Alexandre Pouget,et al.  A computational perspective on the neural basis of multisensory spatial representations , 2002, Nature Reviews Neuroscience.

[27]  Jascha Sohl-Dickstein,et al.  Minimum Probability Flow Learning , 2009, ICML.

[28]  Paul B. Johnson,et al.  Visuomotor transformations underlying arm movements toward visual targets: a neural network model of cerebral cortical operations , 1992, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[29]  R. Andersen,et al.  The posterior parietal cortex: Sensorimotor interface for the planning and online control of visually guided movements , 2006, Neuropsychologia.

[30]  P. McCullagh,et al.  Generalized Linear Models , 1972, Predictive Analytics.

[31]  Bruno A. Olshausen,et al.  PROBABILISTIC FRAMEWORK FOR THE ADAPTATION AND COMPARISON OF IMAGE CODES , 1999 .

[32]  L F Abbott,et al.  Transfer of coded information from sensory to motor networks , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[33]  R. Held,et al.  PLASTICITY IN HUMAN SENSORIMOTOR CONTROL. , 1963, Science.

[34]  A. Pouget,et al.  Reference frames for representing visual and tactile locations in parietal cortex , 2005, Nature Neuroscience.

[35]  A. Pouget,et al.  Efficient computation and cue integration with noisy population codes , 2001, Nature Neuroscience.

[36]  Richard A. Andersen,et al.  Coding of the Reach Vector in Parietal Area 5d , 2012, Neuron.

[37]  W. Wildman,et al.  Theoretical Neuroscience , 2014 .

[38]  R. Held,et al.  MOVEMENT-PRODUCED STIMULATION IN THE DEVELOPMENT OF VISUALLY GUIDED BEHAVIOR. , 1963, Journal of comparative and physiological psychology.

[39]  Philip N. Sabes,et al.  Sensory transformations and the use of multiple reference frames for reach planning , 2009, Nature Neuroscience.

[40]  C. Galletti,et al.  The cortical connections of area V6: an occipito‐parietal network processing visual information , 2001, The European journal of neuroscience.

[41]  Christopher R Fetsch,et al.  Neural correlates of reliability-based cue weighting during multisensory integration , 2011, Nature Neuroscience.

[42]  Matthias Bethge,et al.  Natural Image Coding in V1: How Much Use Is Orientation Selectivity? , 2008, PLoS Comput. Biol..

[43]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[44]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[45]  Jascha Sohl-Dickstein,et al.  A new method for parameter estimation in probabilistic models: Minimum probability flow , 2011, Physical review letters.

[46]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[47]  M. Sur,et al.  Cross-modal plasticity in cortical development: differentiation and specification of sensory neocortex , 1990, Trends in Neurosciences.

[48]  Philip N. Sabes,et al.  Sensory integration for reaching: models of optimality in the context of behavior and the underlying neural circuits. , 2011, Progress in brain research.

[49]  K. Grant,et al.  Storage of a sensory pattern by anti-Hebbian synaptic plasticity in an electric fish. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[50]  D. Burr,et al.  The Ventriloquist Effect Results from Near-Optimal Bimodal Integration , 2004, Current Biology.

[51]  Zoubin Ghahramani,et al.  Factorial Learning and the EM Algorithm , 1994, NIPS.

[52]  F. Attneave Some informational aspects of visual perception. , 1954, Psychological review.

[53]  E. Knudsen,et al.  Vision calibrates sound localization in developing barn owls , 1989, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[54]  F. Lacquaniti,et al.  Combination of hand and gaze signals during reaching: activity in parietal area 7 m of the monkey. , 1997, Journal of neurophysiology.

[55]  Wei Ji Ma,et al.  Bayesian inference with probabilistic population codes , 2006, Nature Neuroscience.

[56]  Eero P. Simoncelli,et al.  Noise characteristics and prior expectations in human visual speed perception , 2006, Nature Neuroscience.

[57]  Philip N. Sabes,et al.  Visual-shift adaptation is composed of separable sensory and task-dependent effects. , 2007, Journal of neurophysiology.

[58]  Robert A. Jacobs,et al.  A Rational Analysis of the Acquisition of Multisensory Representations , 2012, Cogn. Sci..

[59]  Francesco Lacquaniti,et al.  Multiple levels of representation of reaching in the parieto-frontal network. , 2003, Cerebral cortex.

[60]  Paul B. Johnson,et al.  Premotor and parietal cortex: corticocortical connectivity and combinatorial computations. , 1997, Annual review of neuroscience.

[61]  Si Wu,et al.  Computing with Continuous Attractors: Stability and Online Aspects , 2005, Neural Computation.

[62]  Lawrence H Snyder,et al.  Idiosyncratic and systematic aspects of spatial representations in the macaque parietal cortex , 2010, Proceedings of the National Academy of Sciences.

[63]  Michael I. Jordan,et al.  Generalization to Local Remappings of the Visuomotor Coordinate Transformation , 1996, The Journal of Neuroscience.

[64]  S. Zeki,et al.  A visuo‐somatomotor pathway through superior parietal cortex in the macaque monkey: cortical connections of areas V6 and V6A , 1998, The European journal of neuroscience.

[65]  Konrad Paul Kording,et al.  Causal Inference in Multisensory Perception , 2007, PloS one.

[66]  H. B. Barlow,et al.  Possible Principles Underlying the Transformations of Sensory Messages , 2012 .

[67]  D. Knill,et al.  The Bayesian brain: the role of uncertainty in neural coding and computation , 2004, Trends in Neurosciences.

[68]  C. Gross,et al.  Spatial maps for the control of movement , 1998, Current Opinion in Neurobiology.

[69]  Paul B. Johnson,et al.  Cortical networks for visual reaching: physiological and anatomical organization of frontal and parietal lobe arm regions. , 1996, Cerebral cortex.