Over-segmentation of handwriting Arabic scripts using an efficient heuristic technique

As is well known, good segmentation is one reason for high accuracy of character recognition; this paper proposes and investigates a new technique for segmentation of handwritten Arabic scripts. A new Arabic heuristic segmenter (AHS) has been implemented. The AHS employs three new features to locate a Prospected Segmentation Point (PSP) based on shape of the word image, first, remove the punctuation marks (dots), second, ligature detection, and third, additional techniques. Remove the punctuation marks technique has been used to avoid the overlap of the ligature to decrease errors of "missed" and "bad" segmentation points. Ligature detection technique has been used to improve locate the segmentation points that calculated based on distance between local minima and maxima of histogram, the technique calculated based on the distance between foreground and background pixels of word image histogram. An additional technique that contains the average character width technique, and close/open holes detection technique has also been investigated to enhance the overall results of segmentation.

[1]  Brijesh Verma,et al.  An investigation of the modified direction feature for cursive character recognition , 2007, Pattern Recognit..

[2]  Sargur N. Srihari,et al.  Segmentation-Based And Segmentation-Free Methods for Spotting Handwritten Arabic Words , 2006 .

[3]  Eric Lecolinet,et al.  A Survey of Methods and Strategies in Character Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Ramzi A. Haraty,et al.  Segmenting Handwritten Arabic Text , 2002 .

[5]  R. Casey Moment normalization of handprinted characters , 1970 .

[6]  Berrin A. Yanikoglu,et al.  Segmentation of off-line cursive handwriting using linear programming , 1998, Pattern Recognit..

[7]  Ramzi A. Haraty,et al.  A neuro-heuristic approach for segmenting handwritten Arabic text , 2001, Proceedings ACS/IEEE International Conference on Computer Systems and Applications.

[8]  S.N. Nawaz,et al.  An approach to offline Arabic character recognition using neural networks , 2003, 10th IEEE International Conference on Electronics, Circuits and Systems, 2003. ICECS 2003. Proceedings of the 2003.

[9]  Venu Govindaraju,et al.  Offline Arabic handwriting recognition: a survey , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Graham Leedham,et al.  Knowledge-based English cursive script segmentation , 2000, Pattern Recognit. Lett..

[11]  A. Benyettou,et al.  Optimized segmentation techniques for arabic handwritten numeral character recognition , 2005, SITIS.

[12]  Michael Blumenstein,et al.  The neural-based segmentation of cursive words using enhanced heuristics , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).