Object indexing using an iconic sparse distributed memory

A general-purpose object indexing technique is described that combines the virtues of principal component analysis with the favorable matching properties of high-dimensional spaces to achieve high-precision recognition. An object is represented by a set of high-dimensional iconic feature vectors comprised of the responses of derivatives of Gaussian filters at a range of orientations and scales. Since these filters can be shown to form the eigenvectors of arbitrary images containing both natural and man-made structures, they are well-suited for indexing in disparate domains. The indexing algorithm uses an active vision system in conjunction with a modified form of Kanerva's (1988, 1993) sparse distributed memory which facilitates interpolation between views and provides a convenient platform for learning the association between an object's appearance and its identity. The robustness of the indexing method was experimentally confirmed by subjecting the method to a range of viewing conditions and the accuracy was verified using a well-known model database containing a number of complex 3D objects under varying pose.<<ETX>>

[1]  Lawrence G. Roberts,et al.  Machine Perception of Three-Dimensional Solids , 1963, Outstanding Dissertations in the Computer Sciences.

[2]  D. Marr A theory of cerebellar cortex , 1969, The Journal of physiology.

[3]  E. Oja Simplified neuron model as a principal component analyzer , 1982, Journal of mathematical biology.

[4]  David Casasent,et al.  Principal-Component Imagery For Statistical Pattern Recognition Correlators , 1982 .

[5]  R. Young GAUSSIAN DERIVATIVE THEORY OF SPATIAL VISION: ANALYSIS OF CORTICAL CELL RECEPTIVE FIELD LINE-WEIGHTING PROFILES. , 1985 .

[6]  Charles R. Dyer,et al.  Model-based recognition in robot vision , 1986, CSUR.

[7]  J. Keeler Comparison Between Kanerva's SDM and Hopfield-Type Neural Networks , 1988, Cogn. Sci..

[8]  R. Bajcsy Active perception , 1988, Proc. IEEE.

[9]  Pentti Kanerva,et al.  Sparse Distributed Memory , 1988 .

[10]  Terence D. Sanger,et al.  Optimal unsupervised learning in a single-layer linear feedforward neural network , 1989, Neural Networks.

[11]  F. Girosi,et al.  Networks for approximation and learning , 1990, Proc. IEEE.

[12]  D G Stork,et al.  Do Gabor functions provide appropriate descriptions of visual cortical receptive fields? , 1990, Journal of the Optical Society of America. A, Optics and image science.

[13]  Dana H. Ballard,et al.  Animate Vision , 1991, Artif. Intell..

[14]  Gershon Buchsbaum,et al.  A computational model of spatiochromatic image coding in early vision , 1991, J. Vis. Commun. Image Represent..

[15]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[17]  Leslie S. Smith,et al.  The principal components of natural images , 1992 .

[18]  Allen Gersho,et al.  Competitive learning and soft competition for vector quantizer design , 1992, IEEE Trans. Signal Process..

[19]  D. Coombs Real-Time Gaze Holding in Binocular Robot Vision , 1992 .

[20]  D Mumford,et al.  On the computational architecture of the neocortex. II. The role of cortico-cortical loops. , 1992, Biological cybernetics.

[21]  Pentti Kanerva,et al.  Sparse distributed memory and related models , 1993 .

[22]  Alex Pentland,et al.  View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Rajesh P. N. Rao,et al.  Seeing Behind Occlusions , 1994, ECCV.

[24]  Rajesh P. N. Rao,et al.  An Active Vision Architecture Based on Iconic Representations , 1995, Artif. Intell..

[25]  Rajesh P. N. Rao,et al.  Natural Basis Functions and Topographic Memory for Face Recognition , 1995, IJCAI.