Invariant Global Motion Recognition in the Dorsal Visual System: A Unifying Theory

The motion of an object (such as a wheel rotating) is seen as consistent independent of its position and size on the retina. Neurons in higher cortical visual areas respond to these global motion stimuli invariantly, but neurons in early cortical areas with small receptive fields cannot represent this motion, not only because of the aperture problem but also because they do not have invariant representations. In a unifying hypothesis with the design of the ventral cortical visual system, we propose that the dorsal visual system uses a hierarchical feedforward network architecture (V1, V2, MT, MSTd, parietal cortex) with training of the connections with a short-term memory trace associative synaptic modification rule to capture what is invariant at each stage. Simulations show that the proposal is computationally feasible, in that invariant representations of the motion flow fields produced by objects self-organize in the later layers of the architecture. The model produces invariant representations of the motion flow fields produced by global in-plane motion of an object, in-plane rotational motion, looming versus receding of the object, and object-based rotation about a principal axis. Thus, the dorsal and ventral visual systems may share some similar computational principles.

[1]  T. Poggio,et al.  Cognitive neuroscience: Neural mechanisms for the recognition of biological movements , 2003, Nature Reviews Neuroscience.

[2]  Edmund T. Rolls,et al.  A Model of Invariant Object Recognition in the Visual System: Learning Rules, Activation Functions, Lateral Inhibition, and Information-Based Performance Measures , 2000, Neural Computation.

[3]  E. Adelson,et al.  The analysis of moving visual patterns , 1985 .

[4]  Keiji Tanaka,et al.  Inferotemporal cortex and object vision. , 1996, Annual review of neuroscience.

[5]  E. Rolls,et al.  Neural networks and brain function , 1998 .

[6]  E. Rolls,et al.  Neurodynamics of biased competition and cooperation for attention: a model with spiking neurons. , 2005, Journal of neurophysiology.

[7]  D. Pandya,et al.  Afferent cortical connections and architectonics of the superior temporal sulcus and surrounding cortex in the rhesus monkey , 1978, Brain Research.

[8]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[9]  WenLu Yang,et al.  Computational model for perception of objects and motions , 2008, Science in China Series C: Life Sciences.

[10]  M. Tovée,et al.  The responses of neurons in the temporal cortex of primates, and face identification and detection , 1994, Experimental Brain Research.

[11]  H. Bülthoff,et al.  Effects of temporal association on recognition memory , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[12]  R. Desimone Face-Selective Cells in the Temporal Cortex of Monkeys , 1991, Journal of Cognitive Neuroscience.

[13]  R. M. Siegel,et al.  Three-dimensional structure-from-motion selectivity in the anterior superior temporal polysensory area, STPa, of the behaving monkey. , 2005, Cerebral cortex.

[14]  Edmund T Rolls,et al.  Spatial scene representations formed by self‐organizing learning in a hippocampal extension of the ventral visual system , 2008, The European journal of neuroscience.

[15]  M. Alexander,et al.  Principles of Neural Science , 1981 .

[16]  E. Rolls,et al.  INVARIANT FACE AND OBJECT RECOGNITION IN THE VISUAL SYSTEM , 1997, Progress in Neurobiology.

[17]  Edmund T. Rolls,et al.  Invariant recognition of feature combinations in the visual system , 2002, Biological Cybernetics.

[18]  S. Schultz Principles of Neural Science, 4th ed. , 2001 .

[19]  Edmund T. Rolls,et al.  Position invariant recognition in the visual system with cluttered environments , 2000, Neural Networks.

[20]  Eric R. Kandel,et al.  Perception of motion, depth and form , 2000 .

[21]  K. H. Britten,et al.  Neuronal correlates of a perceptual decision , 1989, Nature.

[22]  M. Tovée,et al.  Processing speed in the cerebral cortex and the neurophysiology of visual masking , 1994, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[23]  Stefano Panzeri,et al.  Information in the Neuronal Representation of Individual Stimuli in the Primate Temporal Visual Cortex , 1997, Journal of Computational Neuroscience.

[24]  Leslie G. Ungerleider Two cortical visual systems , 1982 .

[25]  E T Rolls,et al.  Invariant object recognition in the visual system with error correction and temporal difference learning , 2001, Network.

[26]  Margaret E. Sereno,et al.  Learning to See Rotation and Dilation with a Hebb Rule , 1990, NIPS.

[27]  M. Graziano,et al.  Tuning of MST neurons to spiral motions , 1994, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[28]  T J Sejnowski,et al.  Learning viewpoint-invariant face representations from visual experience in an attractor network. , 1998, Network.

[29]  R A Andersen,et al.  The Analysis of Complex Motion Patterns by Form/Cue Invariant MSTd Neurons , 1996, The Journal of Neuroscience.

[30]  Peter Földiák,et al.  Learning Invariance from Transformation Sequences , 1991, Neural Comput..

[31]  R A Andersen,et al.  Multimodal integration for the representation of space in the posterior parietal cortex. , 1997, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[32]  Edmund T. Rolls,et al.  Invariant Object Recognition in the Visual System with Novel Views of 3D Objects , 2002, Neural Computation.

[33]  B. McNaughton,et al.  Perception, memory, and emotion : frontiers in neuroscience , 1996 .

[34]  G. Orban,et al.  Responses of macaque STS neurons to optic flow components: a comparison of areas MT and MST. , 1994, Journal of neurophysiology.

[35]  M. Tarr,et al.  Visual Object Recognition , 1996, ISTCS.

[36]  J. Movshon,et al.  Adaptive Temporal Integration of Motion in Direction-Selective Neurons in Macaque Visual Cortex , 2004, The Journal of Neuroscience.

[37]  L. Chalupa,et al.  The visual neurosciences , 2004 .

[38]  E T Rolls,et al.  Neurophysiological mechanisms underlying face processing within and beyond the temporal cortical visual areas. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[39]  H. Sakata,et al.  Parietal cortical neurons responding to rotary movement of visual stimulus in space , 2004, Experimental Brain Research.

[40]  Edmund T. Rolls,et al.  Learning invariant object recognition in the visual system with continuous transformations , 2006, Biological Cybernetics.

[41]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[42]  M. Hasselmo,et al.  Object-centered encoding by face-selective neurons in the cortex in the superior temporal sulcus of the monkey , 2004, Experimental Brain Research.

[43]  Apostolos P. Georgopoulos,et al.  Participation of primary motor cortical neurons in a distributed network during maze solution: representation of spatial parameters and time-course comparison with parietal area 7a , 2004, Experimental Brain Research.

[44]  A. Treves,et al.  The representational capacity of the distributed encoding of information provided by populations of neurons in primate temporal visual cortex , 1997, Experimental Brain Research.

[45]  E. Rolls The representation of information about faces in the temporal and frontal lobes , 2007, Neuropsychologia.

[46]  Martin I. Sereno,et al.  Learning the Solution to the Aperture Problem for Pattern Motion with a Hebb Rule , 1988, NIPS.

[47]  Gustavo Deco,et al.  Computational neuroscience of vision , 2002 .

[48]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[49]  E. Rolls Face processing in different brain areas, and critical band masking. , 2008, Journal of neuropsychology.

[50]  E. Rolls Functions of the Primate Temporal Lobe Cortical Visual Areas in Invariant Visual Object and Face Recognition , 2000, Neuron.

[51]  Edmund T. Rolls,et al.  Learning transform invariant object recognition in the visual system with multiple stimuli present during training , 2008, Neural Networks.