Recognition of strings using nonstationary Markovian models: an application in ZIP code recognition

This paper presents nonstationary Markovian models and their application to recognition of strings of tokens, such as ZIP codes in the US mailstream. Unlike traditional approaches where digits are simply recognized in isolation, the novelty of our approach lies in the manner in which recognitions scores along with domain specific knowledge about the frequency distribution of various combination of digits are all integrated into one unified model. The domain knowledge is derived from postal directory files. This data feeds into the models as n-grams statistics that are seamlessly integrated with recognition scores of digit images. We present the recognition accuracy (90%) achieved on a set of 20,000 ZIP codes.

[1]  T. W. Anderson,et al.  Statistical Inference about Markov Chains , 1957 .

[2]  Sai-Sing. Lin,et al.  Statistical inference about Markov chains , 1966 .

[3]  P. L. Dobruschin The Description of a Random Field by Means of Conditional Probabilities and Conditions of Its Regularity , 1968 .

[4]  Godfried T. Toussaint,et al.  Experiments in Text Recognition with the Modified Viterbi Algorithm , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Malayappan Shridhar,et al.  Context-directed segmentation algorithm for handwritten numeral strings , 1987, Image Vis. Comput..

[6]  Ching Y. Suen,et al.  Structural classification and relaxation matching of totally unconstrained handwritten zip-code numbers , 1988, Pattern Recognit..

[7]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[8]  Yang He,et al.  On optimal order in modeling sequence of letters in words of common language as a Markov chain , 1991, Pattern Recognit..

[9]  Fumitaka Kimura,et al.  Handwritten numerical recognition based on multiple algorithms , 1991, Pattern Recognit..

[10]  Jian Zhou,et al.  Off-Line Handwritten Word Recognition Using a Hidden Markov Model Type Stochastic Network , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Djamel Bouchaffra,et al.  Incorporating diverse information sources in handwriting recognition postprocessing , 1996, Int. J. Imaging Syst. Technol..

[12]  Geetha Srikantan,et al.  A multiple feature/resolution approach to handprinted digit and character recognition , 1996 .

[13]  Jonathan J. Hull Incorporating Language Syntax in Visual Text Recognition with a Statistical Model , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Djamel Bouchaffra,et al.  Incorporating diverse information sources in handwriting recognition postprocessing , 1996 .

[15]  Gyeonghwan Kim,et al.  A Lexicon Driven Approach to Handwritten Word Recognition for Real-Time Applications , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Venu Govindaraju,et al.  Segmentation and recognition of connected handwritten numeral strings , 1997, Pattern Recognit..

[17]  Yves Lecourtier,et al.  Optimal Order of Markov Models Applied to Bankchecks , 1997, Int. J. Pattern Recognit. Artif. Intell..