Learning viewpoint-invariant face representations from visual experience in an attractor network.

In natural visual experience, different views of an object or face tend to appear in close temporal proximity as an animal manipulates the object or navigates around it, or as a face changes expression or pose. A set of simulations is presented which demonstrate how viewpoint-invariant representations of faces can be developed from visual experience by capturing the temporal relationships among the input patterns. The simulations explored the interaction of temporal smoothing of activity signals with Hebbian learning in both a feedforward layer and a second, recurrent layer of a network. The feedforward connections were trained by competitive Hebbian learning with temporal smoothing of the post-synaptic unit activities. The recurrent layer was a generalization of a Hopfield network with a low-pass temporal filter on all unit activities. The combination of basic Hebbian learning with temporal smoothing of unit activities produced an attractor network learning rule that associated temporally proximal input patterns into basins of attraction. These two mechanisms were demonstrated in a model that took grey-level images of faces as input. Following training on image sequences of faces as they changed pose, multiple views of a given face fell into the same basin of attraction, and the system acquired representations of faces that were approximately viewpoint-invariant.

[1]  James V. Stone Object recognition using spatiotemporal signatures , 1998, Vision Research.

[2]  John G. Daugman,et al.  Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression , 1988, IEEE Trans. Acoust. Speech Signal Process..

[3]  Peter Földiák,et al.  Learning Invariance from Transformation Sequences , 1991, Neural Comput..

[4]  M. Tovée,et al.  Representational capacity of face coding in monkeys. , 1996, Cerebral cortex.

[5]  Terrence J. Sejnowski,et al.  Unsupervised Learning Of Invariant Representations Of Faces Through Temporal Association , 1996 .

[6]  Erkki Oja,et al.  Neural Networks, Principal Components, and Subspaces , 1989, Int. J. Neural Syst..

[7]  M. Tsodyks,et al.  The Enhanced Storage Capacity in Neural Networks with Low Activity Level , 1988 .

[8]  J. Kaas Plasticity of sensory and motor maps in adult mammals. , 1991, Annual review of neuroscience.

[9]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  P. C. Murphy,et al.  Cerebral Cortex , 2017, Cerebral Cortex.

[11]  A. J. Mistlin,et al.  Visual neurones responsive to faces , 1987, Trends in Neurosciences.

[12]  Y. Miyashita Neuronal correlate of visual associative long-term memory in the primate temporal cortex , 1988, Nature.

[13]  M. Young,et al.  Sparse population coding of faces in the inferotemporal cortex. , 1992, Science.

[14]  I. Biederman,et al.  Neural and psychophysical analysis of object and face recognition , 1998 .

[15]  G. Pike,et al.  Recognizing moving faces: The relative contribution of motion and perspective view information. , 1997 .

[16]  E. Rolls Learning mechanisms in the temporal lobe visual cortex , 1995, Behavioural Brain Research.

[17]  Marian Stewart Bartlett,et al.  Viewpoint Invariant Face Recognition using Independent Component Analysis and Attractor Networks , 1996, NIPS.

[18]  Niraj S. Desai,et al.  Activity-dependent scaling of quantal amplitude in neocortical neurons , 1998, Nature.

[19]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[20]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[21]  David Beymer,et al.  Face recognition under varying pose , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[22]  H H Bülthoff,et al.  How are three-dimensional objects represented in the brain? , 1994, Cerebral cortex.

[23]  Roland Baddeley,et al.  Optimal, Unsupervised Learning in Invariant Object Recognition , 1997, Neural Computation.

[24]  M. Stryker Temporal associations , 1991, Nature.

[25]  Suzanna Becker,et al.  Learning to Categorize Objects Using Temporal Coherence , 1992, NIPS.

[26]  K Tanaka,et al.  Neuronal mechanisms of object recognition. , 1993, Science.

[27]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[28]  David Zipser,et al.  Feature Discovery by Competive Learning , 1986, Cogn. Sci..

[29]  J. Leo van Hemmen,et al.  Temporal association , 1991 .

[30]  Geoffrey E. Hinton,et al.  Lesioning an attractor network: investigations of acquired dyslexia. , 1991, Psychological review.

[31]  D. Amit The Hebbian paradigm reintegrated: Local reverberations as internal representations , 1995, Behavioral and Brain Sciences.

[32]  Michael S. Landy,et al.  Nonlinear Model of Neural Responses in Cat Visual Cortex , 1991 .

[33]  R. Shepard,et al.  Mental Images and Their Transformations , 1982 .

[34]  Terence D. Sanger,et al.  Optimal unsupervised learning in a single-layer linear feedforward neural network , 1989, Neural Networks.

[35]  Vicki Bruce,et al.  Face Recognition: From Theory to Applications , 1999 .

[36]  William M. Jenkins,et al.  Neocortical representational dynamics in adult primates: Implications for neuropsychology , 1990, Neuropsychologia.

[37]  James V. Stone Learning Perceptually Salient Visual Parameters Using Spatiotemporal Smoothness Constraints , 1996, Neural Computation.

[38]  W J Freeman,et al.  Characterization of state transitions in spatially distributed, chaotic, nonlinear, dynamical systems in cerebral cortex , 1994, Integrative physiological and behavioral science : the official journal of the Pavlovian Society.

[39]  V. Bruce,et al.  Human Face Perception and Identification , 1998 .

[40]  N. Logothetis,et al.  Psychophysical and physiological evidence for viewer-centered object representations in the primate. , 1995, Cerebral cortex.

[41]  D. Heeger Nonlinear model of neural responses in cat visual cortex. , 1991 .

[42]  Néstor Parga,et al.  Transform-Invariant Recognition by Association in a Recurrent Network , 1998, Neural Computation.

[43]  E. Rolls,et al.  INVARIANT FACE AND OBJECT RECOGNITION IN THE VISUAL SYSTEM , 1997, Progress in Neurobiology.

[44]  Daniel J. Amit,et al.  Conversion of Temporal Correlations Between Stimuli to Spatial Correlations Between Attractors , 1999, Neural Computation.

[45]  V. Bruce,et al.  The role of movement in the recognition of famous faces , 1999, Memory & cognition.

[46]  Mark H. Johnson,et al.  Object Recognition and Sensitive Periods: A Computational Analysis of Visual Imprinting , 1994, Neural Computation.

[47]  GrossbergS. Adaptive pattern classification and universal recoding , 1976 .

[48]  J J Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[49]  Joachim M. Buhmann,et al.  Distortion Invariant Object Recognition in the Dynamic Link Architecture , 1993, IEEE Trans. Computers.

[50]  E T Rolls,et al.  Neurophysiological mechanisms underlying face processing within and beyond the temporal cortical visual areas. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[51]  G. Edelman,et al.  Spatial signaling in the development and function of neural connections. , 1991, Cerebral cortex.

[52]  E. Gardner The space of interactions in neural network models , 1988 .