论文信息 - Natural Basis Functions and Topographic Memory for Face Recognition

Natural Basis Functions and Topographic Memory for Face Recognition

Recent work regarding the statistics of natural images has revealed that the dominant eigenvectors of arbitrary natural images closely approximate various oriented derivative-of-Gaussian functions; these functions have also been shown to provide the best fit to the receptive field profiles of cells in the primate striate cortex. We propose a scheme for expression-invariant face recognition that employs a fixed set of these "natural" basis functions to generate multiscale iconic representations of human faces. Using a fixed set of basis functions obviates the need for recomputing eigenvectors (a step that was necessary in some previous approaches employing principal component analysis (PCA) for recognition) while at the same time retaining the redundancy-reducing properties of PCA. A face is represented by a set of iconic representations automatically extracted from an input image. The description thus obtained is stored in a topographically-organized sparse distributed memory that is based on a model of human long-term memory first proposed by Kanerva. We describe experimental results for an implementation of the method on a pipeline image processor that is capable of achieving near real-time recognition by exploiting the processor's frame-rate convolution capability for indexing purposes. 1 Introduction The problem of object recognition has been a central subject in the field of computer vision. An especially interesting albeit difficult subproblem is that of recognizing human faces. In addition to the difficulties posed by changing viewing conditions, computational methods for face recognition have had to confront the fact that faces are complex non-rigid stimuli that defy easy geometric characterizations and form a dense cluster in the multidimensional space of input images. One of the most important issues in face recognition has therefore been the representation of faces. Early schemes for face recognition utilized geometrical representations; prominent features such as eyes, nose, mouth, and chin were detected and geometrical models of faces given by feature vectors whose dimensions, for instance, denoted the relative positions of the facial features were used for the purposes of recognition [Bledsoe, 1966; Kanade, 1973]. Recently, researchers have reported successful results using photometric representations i.e. representations that are computed directly from the intensity values of the input image. Some prominent examples include face representations based on biologically-motivated Gabor filter "jets" [Buhmann et al., 1990], randomly placed zeroth-order Gaussian kernels [Edelman et a/. This paper explores the use of an iconic representation of human faces that exploits the dimensionality-reducing properties of PCA. However, unlike previous approaches employing …

Rajesh P. N. Rao | Dana H. Ballard | D. Ballard

[1] Pentti Kanerva,et al. Sparse distributed memory and related models , 1993 .

[2] Rajesh P. N. Rao,et al. Learning Saccadic Eye Movements Using Multiscale Spatial Filters , 1994, NIPS.

[3] Dennis Gabor,et al. Theory of communication , 1946 .

[4] M. Turk,et al. Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[5] Leslie S. Smith,et al. The principal components of natural images , 1992 .

[6] R. Young. GAUSSIAN DERIVATIVE THEORY OF SPATIAL VISION: ANALYSIS OF CORTICAL CELL RECEPTIVE FIELD LINE-WEIGHTING PROFILES. , 1985 .

[7] K NayarShree,et al. Visual learning and recognition of 3-D objects from appearance , 1995 .

[8] Rajesh P. N. Rao,et al. An Active Vision Architecture Based on Iconic Representations , 1995, Artif. Intell..

[9] Alex Pentland,et al. View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[10] J. Keeler. Comparison Between Kanerva's SDM and Hopfield-Type Neural Networks , 1988, Cogn. Sci..

[11] Rajesh P. N. Rao,et al. Seeing Behind Occlusions , 1994, ECCV.

[12] Rajesh P. N. Rao,et al. Object indexing using an iconic sparse distributed memory , 1995, Proceedings of IEEE International Conference on Computer Vision.

[13] Edward H. Adelson,et al. The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[14] D. Coombs. Real-Time Gaze Holding in Binocular Robot Vision , 1992 .

[15] David Casasent,et al. Principal-Component Imagery For Statistical Pattern Recognition Correlators , 1982 .

[16] Shimon Edelman,et al. Learning to Recognize Faces from Examples , 1992, ECCV.

[17] David Beymer,et al. Face recognition under varying pose , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[18] Pentti Kanerva,et al. Sparse Distributed Memory , 1988 .

[19] Osamu Nakamura,et al. Identification of human faces based on isodensity maps , 1991, Pattern Recognit..

[20] Joachim M. Buhmann,et al. Size and distortion invariant object recognition by hierarchical graph matching , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[21] F. Girosi,et al. Networks for approximation and learning , 1990, Proc. IEEE.

[22] Terence D. Sanger,et al. Optimal unsupervised learning in a single-layer linear feedforward neural network , 1989, Neural Networks.