Development of an efficient neural-based segmentation technique for Arabic handwriting recognition

Off-line Arabic handwriting recognition and segmentation has been a popular field of research for many years. It still remains an open problem. The challenging nature of handwriting recognition and segmentation has attracted the attention of researchers from industry and academic circles. Recognition and segmentation of Arabic handwritten script is a difficult task because the Arabic handwritten characters are naturally both cursive and unconstrained. The analysis of Arabic script is more complicated in comparison with English script. It is believed, good segmentation is one reason for high accuracy character recognition. This paper proposes and investigates four main segmentation techniques. First, a new feature-based Arabic heuristic segmentation AHS technique is proposed for the purpose of partitioning Arabic handwritten words into primitives (over-segmentations) that may then be processed further to provide the best segmentation. Second, a new feature extraction technique (modified direction features-MDF) with modifications in accordant with the characteristics of Arabic scripts is also investigated for the purpose of segmented character classification. Third, a novel neural-based technique for validating prospective segmentation points of Arabic handwriting is proposed and investigated based on direction features. In particular, the vital process of handwriting segmentation is examined in great detail. The classifier chosen for segmentation point validation is a feed-forward neural network trained with the back-propagation algorithm. Many experiments were performed, and their elapsed CPU times and accuracies were reported. Fourth, new fusion equations are proposed and investigation to examine and evaluate a prospective segmentation points by obtaining a fused value from three neural confidence values obtained from right and center character recognition outputs in addition to the segmentation point validation (SPV) output. Confidence values are assigned to each segmentation point located through feature detection. All techniques components are tested on a local benchmark database. High segmentation accuracy is reported in this research along with comparable results for character recognition and segmentation.

[1]  Ishwar K. Sethi,et al.  Off-line cursive handwriting segmentation , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[2]  Seong-Whan Lee,et al.  A New Methodology for Gray-Scale Character Segmentation and Recognition , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Brijesh Verma,et al.  Segmentation vs. non-segmentation based neural techniques for cursive word recognition: an experimental analysis , 2001, Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001.

[4]  Ramzi A. Haraty,et al.  A neuro-heuristic approach for segmenting handwritten Arabic text , 2001, Proceedings ACS/IEEE International Conference on Computer Systems and Applications.

[5]  Brijesh Verma,et al.  An investigation of the modified direction feature for cursive character recognition , 2007, Pattern Recognit..

[6]  Yoshinao Aoki,et al.  On the segmentation of multi-font printed Uygur scripts , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[7]  Sherif Sami El-Dabi,et al.  Arabic character recognition system: A statistical approach for recognizing cursive typewritten text , 1990, Pattern Recognit..

[8]  Jianmin Jiang,et al.  Component-based Segmentation of words from handwritten Arabic text , 2009 .

[9]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[10]  Brijesh Verma,et al.  Segmentation Versus Non-Segmentation Based Neural Techniques for Cursive Word Recognition: An Experimental Analysis , 2002, Int. J. Comput. Intell. Appl..

[11]  A SIMPLE AND EFFECTIVE CURSIVE WORD SEGMENTATION METHOD , 2004 .

[12]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Brijesh Verma,et al.  A Segmentation Algorithm used in Conjunction with Artificial Neural Networks for the Recognition of Real-World Postal Addresses , 1997 .

[14]  Talaat S. El-Sheikh,et al.  Computer recognition of arabic cursive scripts , 1988, Pattern Recognit..

[15]  M. Blumenstein,et al.  A modified direction feature for cursive character recognition , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[16]  Venu Govindaraju,et al.  Offline Arabic handwriting recognition: a survey , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Robert M. Haralick,et al.  A segmentation-free approach to text recognition with application to Arabic text , 1996, International Journal on Document Analysis and Recognition.

[18]  Venu Govindaraju,et al.  Segmentation and pre-recognition of Arabic handwriting , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[19]  Ramzi A. Haraty,et al.  Segmenting Handwritten Arabic Text , 2002 .

[20]  Reda A. Ammar,et al.  Arabic character recognition in a multi-processing environment , 1997, Proceedings Second IEEE Symposium on Computer and Communications.

[21]  Berrin A. Yanikoglu,et al.  Segmentation of off-line cursive handwriting using linear programming , 1998, Pattern Recognit..

[22]  Michael Blumenstein,et al.  The neural-based segmentation of cursive words using enhanced heuristics , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[23]  Behrooz Parhami,et al.  Automatic recognition of printed Farsi texts , 1981, Pattern Recognit..