论文信息 - A Model of Invariant Object Recognition in the Visual System: Learning Rules, Activation Functions, Lateral Inhibition, and Information-Based Performance Measures

A Model of Invariant Object Recognition in the Visual System: Learning Rules, Activation Functions, Lateral Inhibition, and Information-Based Performance Measures

VisNet2 is a model to investigate some aspects of invariant visual object recognition in the primate visual system. It is a four-layer feedforward network with convergence to each part of a layer from a small region of the preceding layer, with competition between the neurons within a layer and with a trace learning rule to help it learn transform invariance. The trace rule is a modified Hebbian rule, which modifies synaptic weights according to both the current firing rates and the firing rates to recently seen stimuli. This enables neurons to learn to respond similarly to the gradually transforming inputs it receives, which over the short term are likely to be about the same object, given the statistics of normal visual inputs. First, we introduce for VisNet2 both single-neuron and multiple-neuron information-theoretic measures of its ability to respond to transformed stimuli. Second, using these measures, we show that quantitatively resetting the trace between stimuli is not necessary for good performance. Third, it is shown that the sigmoid activation functions used in VisNet2, which allow the sparseness of the representation to be controlled, allow good performance when using sparse distributed representations. Fourth, it is shown that VisNet2 operates well with medium-range lateral inhibition with a radius in the same order of size as the region of the preceding layer from which neurons receive inputs. Fifth, in an investigation of different learning rules for learning transform invariance, it is shown that VisNet2 operates better with a trace rule that incorporates in the trace only activity from the preceding presentations of a given stimulus, with no contribution to the trace from the current presentation, and that this is related to temporal difference learning.

[1] S.M. Harris,et al. Information Processing , 1977, Nature.

[2] P Cavanagh,et al. Size and Position Invariance in the Visual System , 1978, Perception.

[3] Indranil Chakravarty,et al. A Generalized Line and Junction Labeling Scheme with Application to scene Analysis , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] Geoffrey E. Hinton. A Parallel Computation that Assigns Canonical Object-Based Frames of Reference , 1981, IJCAI.

[5] A G Barto,et al. Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.

[6] Leslie G. Ungerleider. Two cortical visual systems , 1982 .

[7] Kunihiko Fukushima,et al. Neocognitron: A Self-Organizing Neural Network Model for a Mechanism of Visual Pattern Recognition , 1982 .

[8] E. Rolls,et al. Selectivity between faces in the responses of a population of neurons in the cortex in the superior temporal sulcus of the monkey , 1985, Brain Research.

[9] R. Linsker,et al. From basic network principles to neural architecture , 1986 .

[10] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[11] A. Parker,et al. Spatial properties of neurons in the monkey striate cortex , 1987, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[12] James A. Anderson,et al. Neurocomputing: Foundations of Research , 1988 .

[13] Y. Miyashita,et al. Neuronal correlate of pictorial short-term memory in the primate temporal cortexYasushi Miyashita , 1988, Nature.

[14] M. Tarr,et al. Mental rotation and orientation-dependence in shape recognition , 1989, Cognitive Psychology.

[15] E. W. Kairiss,et al. Hebbian synapses: biophysical mechanisms and algorithms. , 1990, Annual review of neuroscience.

[16] Joachim M. Buhmann,et al. Size and distortion invariant object recognition by hierarchical graph matching , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[17] T. Poggio,et al. A network that learns to recognize three-dimensional objects , 1990, Nature.

[18] Adam Bennett,et al. Large competitive networks , 1990 .

[19] Peter Földiák,et al. Learning Invariance from Transformation Sequences , 1991, Neural Comput..

[20] Keiji Tanaka,et al. Coding visual images of objects in the inferotemporal cortex of the macaque monkey. , 1991, Journal of neurophysiology.

[21] G. Edelman,et al. Spatial signaling in the development and function of neural connections. , 1991, Cerebral cortex.

[22] R. Desimone. Face-Selective Cells in the Temporal Cortex of Monkeys , 1991, Journal of Cognitive Neuroscience.

[23] E T Rolls,et al. Neurophysiological mechanisms underlying face processing within and beyond the temporal cortical visual areas. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[24] D I Perrett,et al. Organization and functions of cells responsive to faces in the temporal cortex. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[25] I. Biederman,et al. Dynamic binding in a neural network for shape recognition. , 1992, Psychological review.

[26] H H Bülthoff,et al. Psychophysical support for a two-dimensional view interpolation theory of object recognition. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[27] D C Van Essen,et al. Information processing in the primate visual system: an integrated systems perspective. , 1992, Science.

[28] D. V. van Essen,et al. A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[29] G. Wallis,et al. Learning invariant responses to the natural transformations of objects , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).

[30] M. Tovée,et al. Processing speed in the cerebral cortex and the neurophysiology of visual masking , 1994, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[31] M. Tovée,et al. Translation invariance in the responses to faces of single neurons in the temporal visual cortical areas of the alert macaque. , 1994, Journal of neurophysiology.

[32] E. Rolls. Brain mechanisms for invariant visual recognition and learning , 1994, Behavioural Processes.

[33] Mark H. Johnson,et al. Object Recognition and Sensitive Periods: A Computational Analysis of Visual Imprinting , 1994, Neural Computation.

[34] Guy Wallis,et al. Neural Mechanisms Underlying Processing in the Visual Areas of the Occipital and Temporal Lobes , 1994 .

[35] N. Logothetis,et al. View-dependent object recognition by monkeys , 1994, Current Biology.

[36] Christoph von der Malsburg,et al. The Correlation Theory of Brain Function , 1994 .

[37] Leslie G. Ungerleider,et al. ‘What’ and ‘where’ in the human brain , 1994, Current Opinion in Neurobiology.

[38] E. Rolls. Learning mechanisms in the temporal lobe visual cortex , 1995, Behavioural Brain Research.

[39] E T Rolls,et al. Sparseness of the neuronal representation of stimuli in the primate temporal visual cortex. , 1995, Journal of neurophysiology.

[40] Jonathan Baxter,et al. Learning internal representations , 1995, COLT '95.

[41] M. Tovée,et al. Representational capacity of face coding in monkeys. , 1996, Cerebral cortex.

[42] James V. Stone,et al. A Canonical Microfunction for Learning Perceptual Invariances , 1996, Perception.

[43] Marian Stewart Bartlett,et al. Viewpoint Invariant Face Recognition using Independent Component Analysis and Attractor Networks , 1996, NIPS.

[44] Roland Baddeley,et al. Optimal, Unsupervised Learning in Invariant Object Recognition , 1997, Neural Computation.

[45] Bartlett W. Mel. SEEMORE: Combining Color, Shape, and Texture Histogramming in a Neurally Inspired Approach to Visual Object Recognition , 1997, Neural Computation.

[46] L. Abbott,et al. Invariant visual responses from attentional gain fields. , 1997, Journal of neurophysiology.

[47] E. Rolls,et al. INVARIANT FACE AND OBJECT RECOGNITION IN THE VISUAL SYSTEM , 1997, Progress in Neurobiology.

[48] A. Treves,et al. The representational capacity of the distributed encoding of information provided by populations of neurons in primate temporal visual cortex , 1997, Experimental Brain Research.

[49] Edmund Rolls,et al. A neurophysiological and computational approach to the functions of the temporal lobe cortical visual areas in invariant object recognition , 1997 .

[50] Andrew G. Barto,et al. Reinforcement learning , 1998 .

[51] E. Rolls,et al. Neural networks and brain function , 1998 .

[52] E. Rolls,et al. View-invariant representations of familiar objects by neurons in the inferior temporal visual cortex. , 1998, Cerebral cortex.

[53] Néstor Parga,et al. Transform-Invariant Recognition by Association in a Recurrent Network , 1998, Neural Computation.

[54] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[55] Hanchuan Peng,et al. Energy function for learning invariance in multilayer perceptron , 1998 .

[56] Stefano Panzeri,et al. On Decoding the Responses of a Population of Neurons from Short Time Windows , 1999, Neural Computation.

[57] Stefano Panzeri,et al. Firing Rate Distributions and Efficiency of Information Transmission of Inferior Temporal Cortex Neurons to Natural Visual Stimuli , 1999, Neural Computation.

[58] Néstor Parga,et al. A recurrent model of transformation invariance by association , 2000, Neural Networks.

[59] E. Rolls,et al. On the design of neural networks in the brain by genetic evolution , 2000, Progress in Neurobiology.

[60] Edmund T. Rolls,et al. Position invariant recognition in the visual system with cluttered environments , 2000, Neural Networks.

[61] E. Rolls. Functions of the Primate Temporal Lobe Cortical Visual Areas in Invariant Visual Object and Face Recognition , 2000, Neuron.

[62] E T Rolls,et al. Invariant object recognition in the visual system with error correction and temporal difference learning , 2001, Network.

[63] Refractor. Vision , 2000, The Lancet.

[64] Richard S. Sutton,et al. Reinforcement Learning , 1992, Handbook of Machine Learning.