A new approach for off-line handwritten Arabic word recognition using KNN classifier

Due to similarities between Arabic letters, and the various writing styles employed, recognition of Arabic handwritten text can be difficult. In this paper, an off-line Arabic handwritten word recognition system is proposed, in which technical details are presented in terms of three stages, i.e. preprocessing, feature extraction and classification. Firstly, words are segmented from input scripts and also normalized in size. Secondly, each segmented word is divided into overlapping blocks. Absolute mean values computed for each block of segmented words constitutes a feature vector. Finally, the resulting feature vectors are used to classify the words using the K nearest Neighbour classifier (KNN). The proposed system has been successfully tested on the IFN/ENIT database consisting of 32492 Arabic handwritten words which are written by more than 1000 different writers. Experimental results show a good recognition rate when compared with other methods.

[1]  Peter Burrow,et al.  Arabic Handwriting Recognition , 2004 .

[2]  Adnan Amin,et al.  Off-line Arabic character recognition: the state of the art , 1998, Pattern Recognit..

[3]  Volker Märgner,et al.  Comparison of Different Preprocessing and Feature Extraction Methods for Offline Recognition of Handwritten ArabicWords , 2007, ICDAR.

[4]  Venu Govindaraju,et al.  Offline Arabic handwriting recognition: a survey , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Chafic Mokbel,et al.  Arabic handwriting recognition using baseline dependant features and hidden Markov modeling , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[6]  Jinchang Ren,et al.  Knowledge-Based Baseline Detection and Optimal Thresholding for Words Segmentation in Efficient Pre-Processing of Handwritten Arabic Text , 2008, Fifth International Conference on Information Technology: New Generations (itng 2008).

[7]  Mokhtar Sellami,et al.  Semi-continuous HMMs with explicit state duration for unconstrained Arabic word modeling and recognition , 2008, Pattern Recognit. Lett..

[8]  Mokhtar Sellami,et al.  HMMs with Explicit State Duration Applied to Handwritten Arabic Word Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[9]  Volker Märgner,et al.  HMM based approach for handwritten arabic word recognition using the IFN/ENIT - database , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[11]  Mohammad S. Khorsheed,et al.  Off-Line Arabic Character Recognition – A Review , 2002, Pattern Analysis & Applications.