论文信息 - A Fourier-descriptor-based character recognition engine implemented under the Gamera open-source document-processing framework

A Fourier-descriptor-based character recognition engine implemented under the Gamera open-source document-processing framework

This paper discusses the implementation of an engine for performing optical character recognition of bi-tonal images using the Gamera framework, an existing open-source framework for building document analysis applications. The OCR engine uses features that are based on the Fourier descriptor to distinguish characters, and is designed to be able to handle character images that contain multiple boundaries. The algorithm works by assigning to each character image a signature that encodes the boundary types that are present in the image as well as the positional relationships that exist between them. Under this approach, only images having the same signature are comparable. Effectively, a meta-classifier is used which first computes the signature of an input image and then dispatches the image to an underlying neural network based classifier which is trained to distinguish between images having that signature. The performance of the OCR engine is evaluated on a set of sample images taken from the newspaper domain, and compares well with other OCR engines. The source code for this engine and all supporting modules is currently available upon request, and will eventually be made available through an open-source project on the sourceforge website.

Timothy L. Andersen | Jared Hopkins

[1] Anil K. Jain,et al. Feature extraction methods for character recognition-A survey , 1996, Pattern Recognit..

[2] Malayappan Shridhar,et al. High accuracy character recognition algorithm using fourier and topological descriptors , 1984, Pattern Recognit..

[3] Gösta H. Granlund,et al. Fourier Preprocessing for Hand Print Character Recognition , 1972, IEEE Transactions on Computers.

[4] Matti Pietikäinen,et al. An Experimental Comparison of Autoregressive and Fourier-Based Descriptors in 2D Shape Classification , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[5] Ralph Roskies,et al. Fourier Descriptors for Plane Closed Curves , 1972, IEEE Transactions on Computers.

[6] Christos Faloutsos,et al. Efficient Similarity Search In Sequence Databases , 1993, FODO.

[7] Charles R. Giardina,et al. Elliptic Fourier features of a closed contour , 1982, Comput. Graph. Image Process..

[8] King-Sun Fu,et al. Shape Discrimination Using Fourier Descriptors , 1977, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Chun-Shin Lin,et al. New forms of shape invariants from elliptic fourier descriptors , 1987, Pattern Recognit..