Comparative Study of Two Segmentation Methods of Handwritten Arabic Text - MM-OIC and HT-MM

We present in this paper a comparative study of two segmentation methods of handwritten Arabic text. The first method is a combination of the Mathematical Morphology (MM) and the algorithm of construction of the Outer Isothetic Cover of a digital object (OIC) named MM-OIC. The second method uses the Hough Transform (HT) andMMto segment the handwriting Arabic script called HT-MM. These methods are applied for two levels of segmentation: text lines and Pieces Words. The two proposed methods are evaluated and compared to the three databases : IFN/ENIT-database, BSB and KSU online databases. We propose a concept for automatic evaluation of the results, based on label tools for the different parts of the used documents.

[1]  Syed Saqib Bukhari,et al.  Script-Independent Handwritten Textlines Segmentation Using Active Contours , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[2]  Haikal El Abed,et al.  Baseline Extraction : Comparison of Six Methods on IFN / ENIT Database , 2008 .

[3]  Laurence Likforman-Sulem,et al.  Overlapping and multi-touching text-line segmentation by Block Covering analysis , 2008, Pattern Analysis and Applications.

[4]  Richard O. Duda,et al.  Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.

[5]  Yi Li,et al.  Script-Independent Text Line Segmentation in Freestyle Handwritten Documents , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Laurence Likforman-Sulem,et al.  Text Line Segmentation of Historical Arabic Documents , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[7]  Basilios Gatos,et al.  Handwritten Text Line Segmentation by Shredding Text into its Lines , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[8]  Abdel Belaïd,et al.  Noname manuscript No. (will be inserted by the editor) A General Approach for Multi-oriented Text Line Extraction of Handwritten Documents , 2011 .

[9]  Véronique Eglin,et al.  Text Lines and Snippets Extraction for 19th Century Handwriting Documents Layout Analysis , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[10]  Ahmad Abdulkader,et al.  Two-Tier Approach for Arabic Offline Handwriting Recognition , 2006 .

[11]  Partha Bhowmick,et al.  Word Segmentation and Baseline Detection in Handwritten Documents Using Isothetic Covers , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[12]  Christian Olivier,et al.  Multi-level Arabic Handwritten Words Recognition , 1998, SSPR/SPR.

[13]  Noureddine Ellouze,et al.  A hybrid method for three segmentation level of handwritten Arabic script , 2009, MOCR '09.

[14]  Partha Bhowmick,et al.  Construction of isothetic covers of a digital object: A combinatorial approach , 2010, J. Vis. Commun. Image Represent..