Automated forms-processing software and services

While document-image systems for the management of collections of documents, such as forms, offer significant productivity improvements, the entry of information from documents remains a labor-intensive and costly task for most organizations. In this paper, we describe a software system for the machine reading of forms data from their scanned images. We describe its major components: form recognition and “dropout,” intelligent character recognition (ICR), and contextual checking. Finally, we describe applications for which our automated forms reader has been successfully used.

[1]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[2]  Ching Y. Suen,et al.  Historical review of OCR research and development , 1992, Proc. IEEE.

[3]  Jianchang Mao,et al.  A comparative study of different classifiers for handprinted character recognition , 1994 .

[4]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[5]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[6]  E. Lecolinet,et al.  Strategies in character segmentation: a survey , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[7]  Kohji Fukunaga,et al.  Introduction to Statistical Pattern Recognition-Second Edition , 1990 .

[8]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[9]  Adam Krzyżak,et al.  Methods of combining multiple classifiers and their applications to handwriting recognition , 1992, IEEE Trans. Syst. Man Cybern..

[10]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[11]  David J. Burr,et al.  Elastic Matching of Line Drawings , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[13]  Jianchang Mao,et al.  A two-stage multi-network OCR system with a soft pre-classifier and a network selector , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[14]  Patrick J. Grother,et al.  The First Census Optical Character Recognition Systems Conference | NIST , 1992 .

[15]  R. Hunter,et al.  International digital facsimile coding standards , 1980, Proceedings of the IEEE.

[16]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[17]  Hiroyasu Takahashi,et al.  A clustering method and radius tuning by end users , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.