A rotation, scaling, and translation invariant pattern classification system

Abstract This paper describes a hybrid pattern classification system based on a pattern preprocessor and an artificial neural network classifier that can recognize patterns even when they are deformed by transformation of rotation, scaling, and translation or a combination of these. After a description of the system architecture we provide experimental results from three different classification domains: classification of letters in the English alphabet, classification of the letters in the Japanese Katakana alphabet, and classification of geometric figures. For the first problem, our system can recognize patterns deformed by a single transformation with well over 90% success ratio and with 89% success ratio when all three transformations are applied. For the second problem, the system performs very good for patterns deformed by scaling and translation but worse (about 75%) when rotations are involved. For the third problem, the success ratio is almost 100% when only a single transformation is applied and 88% when all three transformations are applied. The system is general purpose and has a reasonable noise tolerance.

[1]  King-Sun Fu,et al.  Shape Discrimination Using Fourier Descriptors , 1977, IEEE Trans. Syst. Man Cybern..

[2]  Lawrence Sirovich,et al.  Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Heidar A. Malki,et al.  Using the Karhunen-Loe've transformation in the back-propagation training algorithm , 1991, IEEE Trans. Neural Networks.

[4]  D. Casasent,et al.  Position, rotation, and scale invariant optical correlation. , 1976, Applied optics.

[5]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[6]  Etienne Barnard,et al.  Shift invariance and the neocognitron , 1990, Neural Networks.

[7]  Ralph Roskies,et al.  Fourier Descriptors for Plane Closed Curves , 1972, IEEE Transactions on Computers.

[8]  I. Guyon,et al.  Handwritten digit recognition: applications of neural network chips and automatic learning , 1989, IEEE Communications Magazine.

[9]  James A. Pittman,et al.  Recognizing Hand-Printed Letters and Digits Using Backpropagation Learning , 1991, Neural Computation.

[10]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[11]  Alireza Khotanzad,et al.  Classification of invariant image representations using a neural network , 1990, IEEE Trans. Acoust. Speech Signal Process..

[12]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[13]  King-Sun Fu,et al.  Syntactic Pattern Recognition And Applications , 1968 .

[14]  H H Arsenault,et al.  Rotation-invariant digital pattern recognition using circular harmonic expansion. , 1982, Applied optics.

[15]  Takayuki Ito,et al.  Neocognitron: A neural network model for a mechanism of visual pattern recognition , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[16]  Yann LeCun,et al.  Handwritten zip code recognition with multilayer networks , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[17]  Anthony P. Reeves,et al.  Three-Dimensional Shape Analysis Using Moments and Fourier Descriptors , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Rama Chellappa,et al.  Stochastic models for closed boundary analysis: Representation and reconstruction , 1981, IEEE Trans. Inf. Theory.