论文信息 - Object indexing using an iconic sparse distributed memory

Object indexing using an iconic sparse distributed memory

A general-purpose object indexing technique is described that combines the virtues of principal component analysis with the favorable matching properties of high-dimensional spaces to achieve high-precision recognition. An object is represented by a set of high-dimensional iconic feature vectors comprised of the responses of derivatives of Gaussian filters at a range of orientations and scales. Since these filters can be shown to form the eigenvectors of arbitrary images containing both natural and man-made structures, they are well-suited for indexing in disparate domains. The indexing algorithm uses an active vision system in conjunction with a modified form of Kanerva's (1988, 1993) sparse distributed memory which facilitates interpolation between views and provides a convenient platform for learning the association between an object's appearance and its identity. The robustness of the indexing method was experimentally confirmed by subjecting the method to a range of viewing conditions and the accuracy was verified using a well-known model database containing a number of complex 3D objects under varying pose.<<ETX>>

Rajesh P. N. Rao | Dana H. Ballard | D. Ballard

[1] Lawrence G. Roberts,et al. Machine Perception of Three-Dimensional Solids , 1963, Outstanding Dissertations in the Computer Sciences.

[2] D. Marr. A theory of cerebellar cortex , 1969, The Journal of physiology.

[3] E. Oja. Simplified neuron model as a principal component analyzer , 1982, Journal of mathematical biology.

[4] David Casasent,et al. Principal-Component Imagery For Statistical Pattern Recognition Correlators , 1982 .

[5] R. Young. GAUSSIAN DERIVATIVE THEORY OF SPATIAL VISION: ANALYSIS OF CORTICAL CELL RECEPTIVE FIELD LINE-WEIGHTING PROFILES. , 1985 .

[6] Charles R. Dyer,et al. Model-based recognition in robot vision , 1986, CSUR.

[7] J. Keeler. Comparison Between Kanerva's SDM and Hopfield-Type Neural Networks , 1988, Cogn. Sci..

[8] R. Bajcsy. Active perception , 1988, Proc. IEEE.

[9] Pentti Kanerva,et al. Sparse Distributed Memory , 1988 .

[10] Terence D. Sanger,et al. Optimal unsupervised learning in a single-layer linear feedforward neural network , 1989, Neural Networks.

[11] F. Girosi,et al. Networks for approximation and learning , 1990, Proc. IEEE.

[12] D G Stork,et al. Do Gabor functions provide appropriate descriptions of visual cortical receptive fields? , 1990, Journal of the Optical Society of America. A, Optics and image science.

[13] Dana H. Ballard,et al. Animate Vision , 1991, Artif. Intell..

[14] Gershon Buchsbaum,et al. A computational model of spatiochromatic image coding in early vision , 1991, J. Vis. Commun. Image Represent..

[15] Edward H. Adelson,et al. The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[16] M. Turk,et al. Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[17] Leslie S. Smith,et al. The principal components of natural images , 1992 .

[18] Allen Gersho,et al. Competitive learning and soft competition for vector quantizer design , 1992, IEEE Trans. Signal Process..

[19] D. Coombs. Real-Time Gaze Holding in Binocular Robot Vision , 1992 .

[20] D Mumford,et al. On the computational architecture of the neocortex. II. The role of cortico-cortical loops. , 1992, Biological cybernetics.

[21] Pentti Kanerva,et al. Sparse distributed memory and related models , 1993 .

[22] Alex Pentland,et al. View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[23] Rajesh P. N. Rao,et al. Seeing Behind Occlusions , 1994, ECCV.

[24] Rajesh P. N. Rao,et al. An Active Vision Architecture Based on Iconic Representations , 1995, Artif. Intell..

[25] Rajesh P. N. Rao,et al. Natural Basis Functions and Topographic Memory for Face Recognition , 1995, IJCAI.