Spontaneous Handwriting Text Recognition and Classification Using Finite-State Models

Finite-state models are used to implement a handwritten text recognition and classification system for a real application entailing casual, spontaneous writing with large vocabulary. Handwritten short phrases which involve a wide variety of writing styles and contain many non-textual artifacts, are to be classified into a small number of predefined classes. To this end, two different types of statistical framework for phrase recognition-classification are considered, based on finite-state models. HMMs are used for text recognition process. Depending to the considered architecture, N-grams are used for performing text recognition and then text classification (serial approach) or for performing both simultaneously (integrated approach). The multinomial text classifier is also employed in the classification phase of the serial approach. Experimental results are reported which, given the extreme difficulty of the task, are encouraging.

[1]  Francisco Casacuberta,et al.  Offline Recognition of Syntax-Constrained Cursive Handwritten Text , 2000, SSPR/SPR.

[2]  Ching Y. Suen,et al.  Recognition of legal amounts on bank cheques , 1998, Pattern Analysis and Applications.

[3]  Ian H. Witten,et al.  The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression , 1991, IEEE Trans. Inf. Theory.

[4]  Alejandro Héctor Toselli,et al.  Spontaneous handwriting recognition and classification , 2004, ICPR 2004.

[5]  W. B. Cavnar,et al.  N-gram-based text categorization , 1994 .

[6]  Hermann Ney,et al.  Integrated Handwriting Recognition And Interpretation Using Finite-State Models , 2004, Int. J. Pattern Recognit. Artif. Intell..

[7]  Horst Bunke,et al.  Using a Statistical Language Model to Improve the Performance of an HMM-Based Cursive Handwriting Recognition System , 2001, Int. J. Pattern Recognit. Artif. Intell..

[8]  Frederick Jelinek,et al.  Statistical methods for speech recognition , 1997 .

[9]  Alejandro Héctor Toselli,et al.  Projection Profile Based Algorithm for Slant Removal , 2004, ICIAR.

[10]  Slava M. Katz,et al.  Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[11]  Mohamed S. Kamel,et al.  Image Analysis and Recognition , 2014, Lecture Notes in Computer Science.

[12]  Hermann Ney,et al.  Reversing and Smoothing the Multinomial Naive Bayes Text Classifier , 2002, PRIS.

[13]  Richard M. Schwartz,et al.  An Omnifont Open-Vocabulary OCR System for English and Arabic , 1999, IEEE Trans. Pattern Anal. Mach. Intell..