Visual text recognition through contextual processing

Abstract In most of the works on contextual processing using a dictionary it is assumed that all the words of the document lie within the dictionary. It is also assumed that a clear word boundary exists. In practice neither is true. In this work we present a two pass contextual processing algorithm limited to the word level using a partial dictionary with an augmented dictionary approach, modified Viterbi algorithm and some heuristics based on pragmatic features. A character confusion matrix obtained through training and weighted with respect to dictionary words within the document is used to generate aliases for the input word. It has been tested with an omnifont character recogniser on documents of varying types. The overall performance of our system exceeds 98% correct character recognition (97% word recognition) which is better than that of other reported works.

[1]  Sargur N. Srihari,et al.  Experiments in Text Recognition with Binary n-Gram and Viterbi Algorithms , 1982, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Sargur N. Srihari,et al.  An Integrated Algorithm for Text Recognition: Comparison with a Cascaded Algorithm , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Theodosios Pavlidis,et al.  On the Recognition of Printed Characters of Any Font and Size , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Josef Raviv,et al.  Decision making in Markov chains applied to the problem of pattern recognition , 1967, IEEE Trans. Inf. Theory.

[5]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[6]  Godfried T. Toussaint,et al.  The use of context in pattern recognition , 1978, Pattern Recognit..

[7]  Ching Y. Suen,et al.  n-Gram Statistics for Natural Language Understanding and Text Processing , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Allen R. Hanson,et al.  A Contextual Postprocessing System for Error Correction Using Binary n-Grams , 1974, IEEE Transactions on Computers.

[9]  Sargur N. Srihari,et al.  Integrating diverse knowledge sources in text recognition , 1982, TOIS.

[10]  Rajjan Shinghal A hybrid algorithm for contextual text recognition , 1983, Pattern Recognit..

[11]  Godfried T. Toussaint,et al.  Experiments in Text Recognition with the Modified Viterbi Algorithm , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  R. Mahesh K. Sinha Some Characteristic Curves for Dictionary Organization with Digital Search , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[13]  David L. Neuhoff,et al.  The Viterbi algorithm as an aid in text recognition (Corresp.) , 1975, IEEE Trans. Inf. Theory.

[14]  H. Kucera,et al.  Computational analysis of present-day American English , 1967 .

[15]  James L. Peterson,et al.  Computer programs for detecting and correcting spelling errors , 1980, CACM.

[16]  Barry Blesser,et al.  Interactive Recognition of Handprinted Characters for Computer Input , 1985, IEEE Computer Graphics and Applications.

[17]  James L. Peterson,et al.  A note on undetected typing errors , 1986, CACM.

[18]  Allen R. Hanson,et al.  Context in word recognition , 1976, Pattern Recognition.

[19]  Jonathan J. Hull Hypothesis Generation in a Computational Model for Visual Word Recognition , 1986, IEEE Expert.

[20]  Donald E. Knuth,et al.  Sorting and Searching , 1973 .

[21]  Godfried T. Toussaint,et al.  A bottom-up and top-down approach to using context in text recognition , 1979 .