Recognition of Handwritten Persian/Arabic Numerals Based on Robust Feature Set and K-NN Classifier

Persian handwritten numerals recognition has been a frontier area of research for the last few decades under pattern recognition. Recognition of handwritten numerals is a difficult task owing to various writing styles of individuals. A robust and efficient method for Persian/Arabic handwritten numerals recognition based on K Nearest Neighbors (K-NN) classifier is presented in this paper. The system first prepares a contour form of the handwritten numerals, then the transit, angle and distance features information about the character is extracted and finally K-NN classifier is used to character recognition. Angle, transit and distance features of a character have been computed based on distribution of points on the bitmap image of character. In K-NN method, the Euclidean distance between testing point and reference points is calculated in order to find the k-nearest neighbors. We evaluated our method on 20,000 handwritten samples of Persian numerals. Using 15,000 samples for training, we tested our method on other 5,000 samples and obtained 99.82% correct recognition rate. Further, we obtained 89.90% accuracy using four-fold cross validation technique on 20,000 dataset.

[1]  Abdesselam Bouzerdoum,et al.  A combined method for Persian and Arabic handwritten digit recognition , 1996, 1996 Australian New Zealand Conference on Intelligent Information Systems. Proceedings. ANZIIS 96.

[2]  Sargur N. Srihari,et al.  An Assessment of Arabic Handwriting Recognition Technology , 2012 .

[3]  Morteza Analoui,et al.  A Scalable Method for Improving the Performance of Classifiers in Multiclass Applications by Pairwise Classifiers and GA , 2008, 2008 Fourth International Conference on Networked Computing and Advanced Information Management.

[4]  Karim Faez,et al.  Recognition of handwritten Persian/Arabic numerals by shadow coding and an edited probabilistic neural network , 1995, Proceedings., International Conference on Image Processing.

[5]  Karim Faez,et al.  Recognition of isolated handwritten Persian/Arabic characters and numerals using support vector machines , 2003, 2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718).

[6]  Saeed Mozaffari,et al.  Structural decomposition and statistical description of Farsi/Arabic handwritten numeric characters , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[7]  K. Faez,et al.  Recognition of isolated handwritten Farsi/Arabic alphanumeric using fractal codes , 2004, 6th IEEE Southwest Symposium on Image Analysis and Interpretation, 2004..

[8]  Karim Faez,et al.  Feature extraction with wavelet transform for recognition of isolated handwritten Farsi/Arabic characters and numerals , 2002, 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628).

[9]  H. Alizadeh,et al.  Divide & Conquer Classification and Optimization by Genetic Algorithm , 2008, 2008 Third International Conference on Convergence and Hybrid Information Technology.

[10]  Mohammad Rahmati,et al.  Recognition of Persian handwritten digits using image profiles of multiple orientations , 2004, Pattern Recognit. Lett..

[11]  H. R. Mamatha,et al.  Kannada Characters Recognition - A Novel Approach Using Image Zoning and Run Length Count , 2011 .

[12]  Ehsanollah Kabir,et al.  Introducing a very large dataset of handwritten Farsi digits and a study on their varieties , 2007, Pattern Recognit. Lett..

[13]  Ali Aghagolzadeh,et al.  A New Pattern for Handwritten Persian/Arabic Digit Recognition , 2007 .

[14]  M. Dehghan,et al.  Farsi handwritten character recognition with moment invariants , 1997, Proceedings of 13th International Conference on Digital Signal Processing.

[15]  Ching Y. Suen,et al.  Application of Support Vector Machines for Recognition of Handwritten Arabic/Persian Digits , 2003 .

[16]  Karim Faez,et al.  Language-Based Feature Extraction Using Template-Matching in Farsi/Arabic Handwritten Numeral Recognition , 2007 .