An application of neural networks is the recognition of objects under translation, rotation, and scale change. Most existing networks for invariant object recognition require a huge number of connections and/or processing units. In this paper, we propose a new connectionist model for invariant object recognition for binary images with a reasonable network size. The network consists of five stages. The first stage shifts the object so that the centroid of the object coincides with the center of the image plane. The second stage is a variation of the polar- coordinate transformation used to obtain two N-dimensional representations of the input object. In this stage, the 0 axis is represented by the positions of the output units; therefore, any rotation of the original object becomes a cyclic shift of the output values of this stage. The third stage is a variation of the rapid transform, which provides invariant representations of cyclic-shift inputs. The next stage normalizes the outputs of the rapid transform to obtain scale invariance. The final stage is a nearest neighbor classifier. We tested the performance of the network for character recognition and good results were obtained with only one pattern per class in training.
[1]
Lilly Spirkovska,et al.
Connectivity strategies for higher-order neural networks applied to pattern recognition
,
1990,
1990 IJCNN International Joint Conference on Neural Networks.
[2]
Hans Burkhardt,et al.
On invariant sets of a certain class of fast translation-invariant transforms
,
1980
.
[3]
Alireza Khotanzad,et al.
Classification of invariant image representations using a neural network
,
1990,
IEEE Trans. Acoust. Speech Signal Process..
[4]
Bernard Widrow,et al.
Layered neural nets for pattern recognition
,
1988,
IEEE Trans. Acoust. Speech Signal Process..
[5]
D. Casasent,et al.
Position, rotation, and scale invariant optical correlation.
,
1976,
Applied optics.
[6]
Geoffrey E. Hinton,et al.
Learning internal representations by error propagation
,
1986
.