Gradient-based contour encoding for character recognition

We describe novel methods of feature extraction for recognition of single isolated character images. Our approach is flexible in that the same algorithms can be used, without modification, for feature extraction in a variety of OCR problems. These include handwritten, machine-print, grayscale, binary and low-resolution character recognition. We use the gradient representation as the basis for extraction of low-level, structural and stroke-type features. These algorithms require a few simple arithmetic operations per image pixel which makes them suitable for real-time applications. A description of the algorithms and experiments with several data sets are presented in this paper. Experimental results using artificial neural networks are presented. Our results demonstrate high performance of these features when tested on data sets distinct from the training data.

[1]  Theodosios Pavlidis,et al.  Direct Gray-Scale Extraction of Features for Character Recognition , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  D. Hubel Eye, brain, and vision , 1988 .

[3]  Antonio Bellacicco,et al.  Handbook of statistics 2: Classification, pattern recognition and reduction of dimensionality: P.R. KRISHNAIAH and L.N. KANAL (Eds.) North-Holland, Amsterdam, 1982, xxii + 903 pages, Dfl.275.00 , 1984 .

[4]  Karl Sims,et al.  Handwritten Character Classification Using Nearest Neighbor in Large Databases , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  George Nagy,et al.  29 Optical character recognition - Theory and practice , 1982, Classification, Pattern Recognition and Reduction of Dimensionality.

[6]  Ching Y. Suen,et al.  Computer algorithms for recognizing the distinct parts of handprinted characters , 1991, Conference Proceedings 1991 IEEE International Conference on Systems, Man, and Cybernetics.

[7]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[8]  Sargur N. Srihari,et al.  High-performance reading machines , 1992 .

[9]  Sargur N. Srihari,et al.  Postal address block location in real time , 1992, Computer.

[10]  Junichi Kanai,et al.  Character recognition , 1997 .

[11]  Sargur N. Srihari,et al.  Gray-scale character recognition using boundary features , 1992, Electronic Imaging.

[12]  Hirobumi Nishida,et al.  An Algebraic Approach to Automatic Construction of Structural Models , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Robert B. Kelley,et al.  Image Feature Extraction Using Diameter-Limited Gradient Direction Histograms , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.