Utilizing temporal associations for view-based 3-D object recognition

The authors propose an architecture for the recognition of three-dimensional objects on the basis of viewer-centered representations and temporal associations. Motivated by biological findings and by successful computational implementations they have chosen a viewer-centered representation scheme. In contrast to other implementations, special attention is paid to the temporal order of the views, which proves useful for learning and recognition purposes. Their recognition system combines different kinds of artificial neural networks into a four stage architecture: preprocessing by a Gaborjet transform is followed by an extended dynamic link matching algorithm which implements recognition and learning of the view classes. A STORE network records the temporal order of the views by transforming a sequence of view classes into an item-and-order coding. Subsequently, a Gaussian-ARTMAP architecture is used for the classification of the sequences and for their mapping onto object classes by means of supervised learning. The presented results demonstrate that the system is capable to autonomously learn and to discriminate similar objects. Additionally, the examples show how the utilization of the temporal context improves object recognition by making ambiguous views manageable and facilitating an increased insensitiveness against misclassifications.

[1]  Bärbel Mertsching,et al.  Visual Attention and Gaze Control for an Active Vision System , 1997, ICONIP.

[2]  M. Stryker Temporal associations , 1991, Nature.

[3]  Rolf P. Würtz,et al.  Multilayer dynamic link networks for establishing image point correspondences and visual object recognition , 1995 .

[4]  Keiji Tanaka,et al.  Inferotemporal cortex and object vision. , 1996, Annual review of neuroscience.

[5]  Y. Miyashita,et al.  Neural organization for the long-term memory of paired associates , 1991, Nature.

[6]  Christoph von der Malsburg,et al.  Recognizing Faces by Dynamic Link Matching , 1996, NeuroImage.

[7]  Allen M. Waxman,et al.  Adaptive 3-D Object Recognition from Multiple Views , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Christoph von der Malsburg,et al.  The Correlation Theory of Brain Function , 1994 .

[9]  Guy Wallis,et al.  Temporal Order in Human Object Recognition Learning , 1998 .

[10]  Stephen Grossberg,et al.  Working Memory Networks for Learning Temporal Order with Application to Three-Dimensional Visual Object Recognition , 1992, Neural Computation.

[11]  Y. Miyashita Neuronal correlate of visual associative long-term memory in the primate temporal cortex , 1988, Nature.

[12]  Alex Pentland,et al.  Recognition of Space-Time Gestures using a Distributed Representation , 1993 .

[13]  Stephen Grossberg,et al.  Fast Learning VIEWNET Architectures for Recognizing 3-D Objects from Multiple 2-D Views , 1995 .

[14]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[15]  J. Leo van Hemmen,et al.  Temporal association , 1991 .

[16]  Wolfgang Konen,et al.  A fast dynamic link matching algorithm for invariant pattern recognition , 1994, Neural Networks.

[17]  James R. Williamson,et al.  Gaussian ARTMAP: A Neural Network for Fast Incremental Learning of Noisy Multidimensional Maps , 1996, Neural Networks.