Size of Training Set Vis-à-vis Recognition Accuracy of Handwritten Character Recognition System

Support Vector Machines (SVMs) have successfully been used for character recognition. In the present study, we have shown how the recognition accuracy of a SVM classifier varies with variation in the training set size. The training set for this work is taken from samples of offline handwritten Gurmukhi characters. For recognition of a handwritten Gurmukhi character, we have used curvature features extracted from the skeletonized image of each Gurmukhi character. Features of a character have been computed based on statistical measures of distribution of points on the bitmap image of character. To extract these features, the image of each Gurmukhi character is first segmented into few zones and then the curvature shape is computed within each of these zones. Considering all the zones, a feature set is formed for representation of each image pattern and a database of 3500 isolated handwritten Gurmukhi characters has been used for the same. The results of investigation presented in this paper show that the size of training set has a significant effect on the accuracy of offline handwritten Gurmukhi script recognition system. Index Terms—Feature extraction; curve fitting; handwritten character recognition; SVM.

[1]  P. Vanaja Ranjan,et al.  Zone based Feature Extraction Algorithm for Handwritten Numeral Recognition of Kannada Script , 2009, 2009 IEEE International Advance Computing Conference.

[2]  K. Roy,et al.  Word-wise Hand-written Script Separation for Indian Postal automation , 2006 .

[3]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Pengfei Shi,et al.  Handwritten Bangla numeral recognition system and its application to postal automation , 2007, Pattern Recognit..

[5]  M. K. Jindal,et al.  SVM Based Offline Handwritten Gurmukhi Character Recognition , 2011 .

[6]  Manish Kumar,et al.  Degraded Text Recognition of Gurmukhi Script , 2008 .

[7]  Tetsushi Wakabayashi,et al.  A System for Off-Line Oriya Handwritten Character Recognition Using Curvature Feature , 2007, 10th International Conference on Information Technology (ICIT 2007).

[8]  Rajesh Kumar,et al.  Online Handwritten Gurmukhi Character Recognition Using Elastic Matching , 2008, 2008 Congress on Image and Signal Processing.

[9]  Bidyut Baran Chaudhuri,et al.  An OCR system to read two Indian language scripts: Bangla and Devnagari (Hindi) , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[10]  Dinesh Kumar AI Approach to Hand Written Devnagiri Script Recognition , 1991, TENCON '91. Region 10 International Conference on EC3-Energy, Computer, Communication and Control Systems.

[11]  Venu Govindaraju,et al.  Offline Arabic handwriting recognition: a survey , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Bidyut Baran Chaudhuri,et al.  Automatic Recognition of Unconstrained Off-Line Bangla Handwritten Numerals , 2000, ICMI.

[13]  Tetsushi Wakabayashi,et al.  Handwritten Bangla Compound Character Recognition Using Gradient Feature , 2007 .

[14]  Chandan Singh,et al.  A Gurmukhi script recognition system , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[15]  Veena Bansal,et al.  Integrating knowledge sources in Devanagari text recognition system , 2000, IEEE Trans. Syst. Man Cybern. Part A.

[16]  Munish Kumar,et al.  Segmentation of lines and words in handwritten Gurmukhi script documents , 2010, IITM '10.

[17]  C. Chandra Sekhar,et al.  Online Handwritten Character Recognition of Devanagari and Telugu Characters using Support Vector Machines , 2006 .

[18]  Veena Bansal,et al.  Segmentation of touching and fused Devanagari characters , 2002, Pattern Recognit..

[19]  Madasu Hanmandlu,et al.  Input Fuzzy Modeling for the Recognition of Handwritten Hindi Numerals , 2007, Fourth International Conference on Information Technology (ITNG'07).

[20]  B. Chaudhuri,et al.  A procedure for recognition of connected handwritten numerals , 1982 .

[21]  Rajendra Kumar Sharma,et al.  HMM-based online handwritten gurmukhi character recognition , 2010 .