Hebbian learning of hand-centred representations in a hierarchical neural network model of the primate visual system

A subset of neurons in the posterior parietal and premotor areas of the primate brain respond to the locations of visual targets in a hand-centred frame of reference. Such hand-centred visual representations are thought to play an important role in visually-guided reaching to target locations in space. In this paper we show how a biologically plausible, Hebbian learning mechanism may account for the development of localized hand-centred representations in a hierarchical neural network model of the primate visual system, VisNet. The hand-centered neurons developed in the model use an invariance learning mechanism known as continuous transformation (CT) learning. In contrast to previous theoretical proposals for the development of hand-centered visual representations, CT learning does not need a memory trace of recent neuronal activity to be incorporated in the synaptic learning rule. Instead, CT learning relies solely on a Hebbian learning rule, which is able to exploit the spatial overlap that naturally occurs between successive images of a hand-object configuration as it is shifted across different retinal locations due to saccades. Our simulations show how individual neurons in the network model can learn to respond selectively to target objects in particular locations with respect to the hand, irrespective of where the hand-object configuration occurs on the retina. The response properties of these hand-centred neurons further generalise to localised receptive fields in the hand-centred space when tested on novel hand-object configurations that have not been explored during training. Indeed, even when the network is trained with target objects presented across a near continuum of locations around the hand during training, the model continues to develop hand-centred neurons with localised receptive fields in hand-centred space. With the help of principal component analysis, we provide the first theoretical framework that explains the behavior of Hebbian learning in VisNet.

[1]  C. Colby Action-Oriented Spatial Reference Frames in Cortex , 1998, Neuron.

[2]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[3]  Edmund T. Rolls,et al.  The neuronal encoding of information in the brain , 2011, Progress in Neurobiology.

[4]  Richard A. Andersen,et al.  Coding of the Reach Vector in Parietal Area 5d , 2012, Neuron.

[5]  Byron M. Yu,et al.  Reference frames for reach planning in macaque dorsal premotor cortex. , 2007, Journal of neurophysiology.

[6]  Edmund T. Rolls,et al.  Position invariant recognition in the visual system with cluttered environments , 2000, Neural Networks.

[7]  R. Andersen,et al.  The posterior parietal cortex: Sensorimotor interface for the planning and online control of visually guided movements , 2006, Neuropsychologia.

[8]  Mauro Ursino,et al.  Visuotactile Representation of Peripersonal Space: A Neural Network Study , 2010, Neural Computation.

[9]  Richard A Andersen,et al.  Integration of target and hand position signals in the posterior parietal cortex: effects of workspace and hand vision. , 2012, Journal of neurophysiology.

[10]  E. Oja Simplified neuron model as a principal component analyzer , 1982, Journal of mathematical biology.

[11]  Steve W. C. Chang,et al.  Using a Compound Gain Field to Compute a Reach Plan , 2009, Neuron.

[12]  M. Arbib Interaction of multiple representations of space in the brain. , 1991 .

[13]  J. F. Soechting,et al.  Early stages in a sensorimotor transformation , 1992, Behavioral and Brain Sciences.

[14]  M. Goodale,et al.  Two visual systems re-viewed , 2008, Neuropsychologia.

[15]  Ehud Zohary,et al.  Is That Near My Hand? Multisensory Representation of Peripersonal Space in Human Intraparietal Sulcus , 2007, The Journal of Neuroscience.

[16]  C. Gross,et al.  Spatial maps for the control of movement , 1998, Current Opinion in Neurobiology.

[17]  Peter Földiák,et al.  Learning Invariance from Transformation Sequences , 1991, Neural Comput..

[18]  P Fattori,et al.  Body-centered, mixed, but not hand-centered coding of visual targets in the medial posterior parietal cortex during reaches in 3D space. , 2014, Cerebral cortex.

[19]  Terence D. Sanger,et al.  Optimal unsupervised learning in a single-layer linear feedforward neural network , 1989, Neural Networks.

[20]  Benjamin D. Evans,et al.  A Self-Organizing Model of the Visual Development of Hand-Centred Representations , 2013, PloS one.

[21]  Terrence J. Sejnowski,et al.  Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.

[22]  Yoshua Bengio,et al.  Towards Biologically Plausible Deep Learning , 2015, ArXiv.

[23]  Olivier D. Faugeras,et al.  A Constructive Mean-Field Analysis of Multi-Population Neural Networks with Random Synaptic Weights and Stochastic Inputs , 2008, Front. Comput. Neurosci..

[24]  M. Graziano Where is my arm? The relative role of vision and proprioception in the neuronal representation of limb position. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[25]  M. Graziano Is Reaching Eye-Centered, Body-Centered, Hand-Centered, or a Combination? , 2001, Reviews in the neurosciences.

[26]  A. V. van den Berg,et al.  Adjacent visual representations of self-motion in different reference frames , 2011, Proceedings of the National Academy of Sciences.

[27]  C. Galletti,et al.  Mixed Body/Hand Reference Frame for Reaching in 3D Space in Macaque Parietal Area PEc , 2016, Cerebral cortex.

[28]  Y. Rossetti,et al.  Coding of Visual Space during Motor Preparation: Approaching Objects Rapidly Modulate Corticospinal Excitability in Hand-Centered Coordinates , 2009, The Journal of Neuroscience.

[29]  Edmund T. Rolls,et al.  Invariant visual object recognition: biologically plausible approaches , 2015, Biological Cybernetics.

[30]  E. Rolls,et al.  Continuous transformation learning of translation invariant representations , 2010, Experimental Brain Research.

[31]  Gordon Pipa,et al.  SORN: A Self-Organizing Recurrent Neural Network , 2009, Front. Comput. Neurosci..

[32]  Edmund T. Rolls,et al.  Learning invariant object recognition in the visual system with continuous transformations , 2006, Biological Cybernetics.

[33]  Alain Berthoz,et al.  Multiple reference frames used by the human brain for spatial perception and memory , 2010, Experimental Brain Research.

[34]  Werner Graf,et al.  Proprioceptive pathways to posterior parietal areas MIP and LIPv from the dorsal column nuclei and the postcentral somatosensory cortex , 2011, The European journal of neuroscience.

[35]  Sergey Levine,et al.  Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection , 2016, Int. J. Robotics Res..

[36]  Jeffrey Dean,et al.  Large-Scale Deep Learning For Building Intelligent Computer Systems , 2016, WSDM.

[37]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[38]  Valeria I. Petkova,et al.  Integration of visual and tactile signals from the hand in the human brain: an FMRI study. , 2011, Journal of neurophysiology.

[39]  Christopher A. Buneo,et al.  Direct visuomotor transformations for reaching , 2002, Nature.

[40]  Valeria I. Petkova,et al.  fMRI Adaptation Reveals a Cortical Mechanism for the Coding of Space Near the Hand , 2011, The Journal of Neuroscience.

[41]  Rodrigo Quian Quiroga,et al.  The visual development of hand-centered receptive fields in a neural network model of the primate visual system trained with experimentally recorded human gaze changes , 2016, Network.

[42]  Thomas Serre,et al.  Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  H. Sakata,et al.  Somatosensory properties of neurons in the superior parietal cortex (area 5) of the rhesus monkey. , 1973, Brain research.

[44]  D. V. van Essen,et al.  Mapping of architectonic subdivisions in the macaque monkey, with emphasis on parieto‐occipital cortex , 2000, The Journal of comparative neurology.

[45]  Philippe Gaussier,et al.  Spatio-Temporal Tolerance of Visuo-Tactile Illusions in Artificial Skin by Recurrent Neural Network with Spike-Timing-Dependent Plasticity , 2017, Scientific Reports.

[46]  G. Rizzolatti,et al.  Space and selective attention , 1994 .

[47]  Mauro Ursino,et al.  A neural network model of peri-hand space representation and its plastic properties related to tool use , 2008, 2008 8th IEEE International Conference on BioInformatics and BioEngineering.

[48]  Simon M. Stringer,et al.  The Development of Hand-Centered Visual Representations in the Primate Brain: A Computer Modeling Study Using Natural Visual Scenes , 2015, Front. Comput. Neurosci..

[49]  J F Soechting,et al.  Moving in three-dimensional space: frames of reference, vectors, and coordinate systems. , 1992, Annual review of neuroscience.

[50]  Edmund T. Rolls,et al.  Invariant Visual Object and Face Recognition: Neural and Computational Bases, and a Model, VisNet , 2012, Front. Comput. Neurosci..

[51]  Richard A. Andersen,et al.  A back-propagation programmed network that simulates response properties of a subset of posterior parietal neurons , 1988, Nature.

[52]  Niraj S. Desai,et al.  Activity-dependent scaling of quantal amplitude in neocortical neurons , 1998, Nature.

[53]  James M. Tromans,et al.  A Computational Model of the Development of Separate Representations of Facial Identity and Expression in the Primate Visual System , 2011, PloS one.

[54]  Anthony R. Dickinson,et al.  Limb-Specific Representation for Reaching in the Posterior Parietal Cortex , 2008, The Journal of Neuroscience.

[55]  Benjamin Kuipers,et al.  Learning to reach by building a representation of peri-personal space , 2016, 2016 IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids).

[56]  T. Sejnowski,et al.  Spatial Transformations in the Parietal Cortex Using Basis Functions , 1997, Journal of Cognitive Neuroscience.

[57]  Edmund T. Rolls,et al.  A Model of Invariant Object Recognition in the Visual System: Learning Rules, Activation Functions, Lateral Inhibition, and Information-Based Performance Measures , 2000, Neural Computation.

[58]  H. Ehrsson,et al.  That's Near My Hand! Parietal and Premotor Coding of Hand-Centered Space Contributes to Localization and Self-Attribution of the Hand , 2012, Journal of Neuroscience.

[59]  R. Andersen,et al.  Dorsal Premotor Neurons Encode the Relative Position of the Hand, Eye, and Goal during Reach Planning , 2006, Neuron.

[60]  R Linsker,et al.  From basic network principles to neural architecture: emergence of spatial-opponent cells. , 1986, Proceedings of the National Academy of Sciences of the United States of America.