A Visual Object Recognition System Invariant to Scale and Rotation

We here address the problem of scale and orientation invariant object recognition, making use of a correspondence-based mechanism, in which the identity of an object represented by sensory signals is determined by matching it to a representation stored in memory. The sensory representation is in general affected by various transformations, notably scale and rotation, thus giving rise to the fundamental problem of invariant object recognition. We focus here on a neurally plausible mechanism that deals simultaneously with identification of the object and detection of the transformation, both types of information being important for visual processing. Our mechanism is based on macrocolumnar units. These evaluate identity- and transformation-specific feature similarities, performing competitive computation on the alternatives of their own subtask, and cooperate to make a coherent global decision for the identity, scale and rotation of the object.

[1]  David J. Freedman,et al.  Visual categorization and the primate prefrontal cortex: neurophysiology and behavior. , 2002, Journal of neurophysiology.

[2]  J. P. Jones,et al.  An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.

[3]  Christian Wolff,et al.  Dynamic Link Matching between Feature Columns for Different Scale and Orientation , 2007, ICONIP.

[4]  Laurenz Wiskott,et al.  Face recognition by dynamic link matching , 1996 .

[5]  H. D. Block,et al.  Analysis of a Four-Layer Series-Coupled Perceptron. II , 1962 .

[6]  Yasuomi D. Sato,et al.  A Neural System for Scale and Orientation Invariant Correspondence Finding , 2007 .

[7]  Thomas Serre,et al.  Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Christoph von der Malsburg,et al.  Dynamic link architecture , 1998 .

[9]  Thomas Serre,et al.  A Theory of Object Recognition: Computations and Circuits in the Feedforward Path of the Ventral Stream in Primate Visual Cortex , 2005 .

[10]  David W. Arathorn,et al.  Map-Seeking Circuits in Visual Cognition: A Computational Mechanism for Biological and Machine Vision , 2002 .

[11]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[12]  David J. Freedman,et al.  Categorical representation of visual stimuli in the primate prefrontal cortex. , 2001, Science.

[13]  Jörg Lücke,et al.  Rapid Correspondence Finding in Networks of Cortical Columns , 2006, ICANN.

[14]  Michael A. Arbib,et al.  The handbook of brain theory and neural networks , 1995, A Bradford book.

[15]  Dennis Gabor,et al.  Theory of communication , 1946 .

[16]  J. Orbach Principles of Neurodynamics. Perceptrons and the Theory of Brain Mechanisms. , 1962 .

[17]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[18]  Norbert Krüger,et al.  Face Recognition by Elastic Bunch Graph Matching , 1997, CAIP.

[19]  Marc Toussaint,et al.  Extracting Motion Primitives from Natural Handwriting Data , 2006, ICANN.

[20]  C. V. D. Malsburg,et al.  Frank Rosenblatt: Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms , 1986 .