论文信息 - Multistaged neural network architecture for position invariant shape recognition

Multistaged neural network architecture for position invariant shape recognition

This paper presents a pattern recognition system that self-organizes to recognize objects by shape as part of an Integrated Visual Network (IVN) for autonomous flight control. The system uses a multistaged hierarchical neural network that exhibits insensitivity to the location of the object in the visual field. The network''s three layers perform the functionally disjoint tasks of preprocessing (dynamic thresholding) invariance (position normalization) and recognition (identification of the shape). The Preprocessing stage uses a single layer of elements to dynamically threshold the grey level input image into a binary image. The Invariance stage is a multilayered neural network implementation of a modified Waish-Hadamard transform that generates a representation of the object that is invariant with respect to the object''s position. The Recognition stage is a modified version of Fukushima''s Neocognitron that identifies the position normalized representation by shape. The inclusion of the Preprocessing and Invariance stages allows reduction of the massively replicated processing structures used for translation invariance in the Neocognitron. This system offers roughly the same translation invariance capabilities as the Neocognitron with a dramatic reduction in the number of elements and the network''s interconnection complexity.

[1] Kunihiko Fukushima,et al. Neocognitron: A hierarchical neural network capable of visual pattern recognition , 1988, Neural Networks.

[2] E S. McVey,et al. Artificial Neural Computer For Image Tracking , 1989, Photonics West - Lasers and Applications in Science and Engineering.