Variable length and context-dependent HMM letter form models for Arabic handwritten word recognition

We present in this paper an HMM-based recognizer for the recognition of unconstrained Arabic handwritten words. The recognizer is a context-dependent HMM which considers variable topology and contextual information for a better modeling of writing units. We propose an algorithm to adapt the topology of each HMM to the character to be modeled. For modeling the contextual units, a state-tying process based on decision tree clustering is introduced which significantly reduces the number of parameters. Decision trees are built according to a set of expert-based questions on how characters are written. Questions are divided into global questions yielding larger clusters and precise questions yielding smaller ones. We apply this modeling to the recognition of Arabic handwritten words. Experiments conducted on the OpenHaRT2010 database show that variable length topology and contextual information significantly improves the recognition rate.

[1]  Volker Märgner,et al.  ICFHR 2010 - Arabic Handwriting Recognition Competition , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[2]  Gernot A. Fink,et al.  Markov models for offline handwriting recognition: a survey , 2009, International Journal on Document Analysis and Recognition (IJDAR).

[3]  Nicole Vincent,et al.  Shape-Based Alphabet for Off-line Arabic Handwriting Recognition , 2007 .

[4]  H. Niemann,et al.  A HMM–based System for Recognition of Handwritten Address Words , 1999 .

[5]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[6]  Gernot A. Fink,et al.  On the Use of Context-Dependent Modeling Units for HMM-Based Offline Handwriting Recognition , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[7]  Chafic Mokbel,et al.  Combining Slanted-Frame Classifiers for Improved HMM-Based Arabic Handwriting Recognition , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Horst Bunke,et al.  Hidden Markov model length optimization for handwriting recognition systems , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[9]  S. J. Young,et al.  Tree-based state tying for high accuracy acoustic modelling , 1994 .

[10]  Heiga Zen,et al.  Decision tree-based simultaneous clustering of phonetic contexts, dimensions, and state positions for acoustic modeling , 2003, INTERSPEECH.

[11]  Marc-Peter Schambach Model length adaptation of an HMM based cursive word recognition system , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[12]  F. Perronnin,et al.  Local gradient histogram features for word spotting in unconstrained handwritten documents , 2008 .

[13]  Volker Märgner,et al.  HMM based approach for handwritten arabic word recognition using the IFN/ENIT - database , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[14]  Chafic Mokbel,et al.  Recognition of Arabic handwritten words using contextual character models , 2008, Electronic Imaging.

[15]  Chafic Mokbel,et al.  Dynamic and Contextual Information in HMM Modeling for Handwritten Word Recognition , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Ciprian Chelba,et al.  Mutual information phone clustering for decision tree induction , 2002, INTERSPEECH.

[17]  M. Pechwitz,et al.  IFN/ENIT: database of handwritten arabic words , 2002 .

[18]  Adnan Amin,et al.  Off-line Arabic character recognition: the state of the art , 1998, Pattern Recognit..

[19]  Venu Govindaraju,et al.  Offline Arabic handwriting recognition: a survey , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Christopher Kermorvant,et al.  Context-dependent HMM modeling using tree-based clustering for the recognition of handwritten words , 2010, Electronic Imaging.

[21]  Richard M. Schwartz,et al.  Multilingual Machine Printed OCR , 2001, Int. J. Pattern Recognit. Artif. Intell..