Multi-stage HMM based Arabic text recognition with rescoring

In this paper, we present a multi-stage approach to handwritten Arabic text recognition using HMM where we separate the Arabic text image into core components and diacritics and recognize them separately using two separate HMM recognition systems. In the next stage, we combine the scores from both recognizers to make a final word hypothesis. This approach leads to huge reduction in the number of HMM models that need to be trained. Experiments conducted on a word recognition task using a publicly available benchmark database show the effectiveness of the technique. We achieve state-of-the-art results in addition to a compact model set for the recognition system.

[1]  Hany Ahmed,et al.  Effective technique for the recognition of offline Arabic handwritten words using hidden Markov models , 2013, International Journal on Document Analysis and Recognition (IJDAR).

[2]  Gernot A. Fink,et al.  Toward automatic video-based whiteboard reading , 2004, International Journal of Document Analysis and Recognition (IJDAR).

[3]  Gernot A. Fink,et al.  Novel Sub-character HMM Models for Arabic Text Recognition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[4]  Mohamed Cheriet,et al.  Arabic Cheque Processing System: Issues and Future Trends , 2007 .

[5]  Gernot A. Fink,et al.  Video-based whiteboard reading , 2005 .

[6]  Liangrui Peng,et al.  A Novel Baseline-independent Feature Set for Arabic Handwriting Recognition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[7]  Volker Märgner,et al.  ICDAR 2011 - Arabic Handwriting Recognition Competition , 2011, ICDAR.

[8]  Venu Govindaraju,et al.  Offline Arabic handwriting recognition: a survey , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Horst Bunke,et al.  Handwritten sentence recognition , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[10]  Sabri A. Mahmoud,et al.  Recognition : A Survey , 2013 .

[11]  Hermann Ney,et al.  RWTH OCR: A Large Vocabulary Optical Character Recognition System for Arabic Scripts , 2012 .

[12]  M. Pechwitz,et al.  IFN/ENIT: database of handwritten arabic words , 2002 .

[13]  PlötzThomas,et al.  Markov models for offline handwriting recognition: a survey , 2009 .

[14]  Nicole Vincent,et al.  Shape-Based Alphabet for Off-line Arabic Handwriting Recognition , 2007 .

[15]  Volker Märgner,et al.  Arabic Handwriting Recognition Competition , 2005, ICDAR.

[16]  Gernot A. Fink,et al.  Improvements in Sub-character HMM Model Based Arabic Text Recognition , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[17]  Chafic Mokbel,et al.  Combining Slanted-Frame Classifiers for Improved HMM-Based Arabic Handwriting Recognition , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Volker Märgner,et al.  ICFHR 2010 - Arabic Handwriting Recognition Competition , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[19]  Gernot A. Fink,et al.  Markov models for offline handwriting recognition: a survey , 2009, International Journal on Document Analysis and Recognition (IJDAR).