Recognizing handwritten Arabic script through efficient skeleton-based grapheme segmentation algorithm

To recognize unlimited set of handwritten Arabic words, an efficient segmentation algorithm is needed to segment these cursive words into a limited set of primal graphemes. We propose a rule-based segmentation algorithm that segments cursive words into graphemes through collecting special feature points from the word skeleton. The development of this algorithm is motivated by the need to solve problems and limitations available in the state-of-the-art algorithms in this area. The preliminary evaluation of the proposed algorithm is promising with over 96% accuracy on a sample subset of the IFN/ENIT database.

[1]  Mohammad S. Khorsheed,et al.  Off-Line Arabic Character Recognition – A Review , 2002, Pattern Analysis & Applications.

[2]  Liangrui Peng,et al.  Multi-queue merging scheme and its application in Arabic script segmentation , 2006, Second International Conference on Document Image Analysis for Libraries (DIAL'06).

[3]  Edward S. Deutsch,et al.  Thinning algorithms on rectangular, hexagonal, and triangular arrays , 1972, Commun. ACM.

[4]  A. Benyettou,et al.  Optimized segmentation techniques for arabic handwritten numeral character recognition , 2005, SITIS.

[5]  Robert Sabourin,et al.  Segmentation of Arabic cursive script , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[6]  M. Pechwitz,et al.  IFN/ENIT: database of handwritten arabic words , 2002 .

[7]  Venu Govindaraju,et al.  Segmentation and pre-recognition of Arabic handwriting , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[8]  Gheith A. Abandah,et al.  Analysis of Handwritten Arabic Letters Using Selected Feature Extraction Techniques , 2009, Int. J. Comput. Process. Orient. Lang..

[9]  Ashraf Elnagar,et al.  Handwriting segmentation of Arabic text , 2008 .

[10]  R. Smith,et al.  An Overview of the Tesseract OCR Engine , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[11]  Volker Märgner,et al.  Arabic Handwriting Recognition Competition , 2005, ICDAR.

[12]  Victoria J. Hodge,et al.  A Comparison of Standard Spell Checking Algorithms and a Novel Binary Neural Approach , 2003, IEEE Trans. Knowl. Data Eng..

[13]  Nicole Vincent,et al.  Shape-Based Alphabet for Off-line Arabic Handwriting Recognition , 2007 .

[14]  Ahmad Abdulkader,et al.  Two-Tier Approach for Arabic Offline Handwriting Recognition , 2006 .

[15]  Venu Govindaraju,et al.  Segmentation of Arabic Handwriting Based on both Contour and Skeleton Segmentation , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[16]  Noureddine Ellouze,et al.  A hybrid method for three segmentation level of handwritten Arabic script , 2009, MOCR '09.

[17]  Volker Märgner,et al.  ICDAR 2009-Arabic handwriting recognition competition , 2011, 2011 International Conference on Document Analysis and Recognition.

[18]  Mokhtar Sellami,et al.  Off-line handwritten Arabic character segmentation algorithm: ACSA , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[19]  Haikal El Abed,et al.  Baseline Extraction : Comparison of Six Methods on IFN / ENIT Database , 2008 .