Off-Line Handwritten Character Recognition of Devnagari Script

In this paper we present a system towards the recognition of off-line handwritten characters of Devnagari, the most popular script in India. The features used for recognition purpose are mainly based on directional information obtained from the arc tangent of the gradient. To get the feature, at first, a 2times2 mean filtering is applied 4 times on the gray level image and a non-linear size normalization is done on the image. The normalized image is then segmented to 49times49 blocks and a Roberts filter is applied to obtain gradient image. Next, the arc tangent of the gradient (direction of gradient) is initially quantized into 32 directions and the strength of the gradient is accumulated with each of the quantized direction. Finally, the blocks and the directions are down sampled using Gaussian filter to get 392 dimensional feature vector. A modified quadratic classifier is applied on these features for recognition. We used 36172 handwritten data for testing our system and obtained 94.24% accuracy using 5-fold cross-validation scheme.

[1]  Nobuyuki Otsu,et al.  ATlreshold Selection Method fromGray-Level Histograms , 1979 .

[2]  Fumitaka Kimura,et al.  Modified Quadratic Discriminant Functions and the Application to Chinese Character Recognition , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Ujjwal Bhattacharya,et al.  Neural Combination of ANN and HMM for Handwritten Devanagari Numeral Recognition , 2006 .

[5]  Madasu Hanmandlu,et al.  Fuzzy Model Based Recognition of Handwritten Hindi Numerals using Bacterial Foraging , 2007, 6th IEEE/ACIS International Conference on Computer and Information Science (ICIS 2007).

[6]  Santanu Chaudhury,et al.  Devnagari numeral recognition by combining decision of multiple connectionist classifiers , 2002 .

[7]  Tetsushi Wakabayashi,et al.  Increasing the feature size in handwritten numeral recognition to improve accuracy , 1995, Systems and Computers in Japan.

[8]  Veena Bansal,et al.  A complete OCR for printed Hindi text in Devanagari script , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[9]  Atul Negi,et al.  An OCR system for Telugu , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[10]  Ishwar K. Sethi,et al.  Machine recognition of constrained hand printed devanagari , 1977, Pattern Recognit..

[11]  F. Kimura,et al.  A Lexicon Driven Method for Unconstrained Bangla , 2006 .

[12]  Bidyut Baran Chaudhuri,et al.  A complete printed Bangla OCR system , 1998, Pattern Recognit..

[13]  Bidyut Baran Chaudhuri,et al.  Indian script character recognition: a survey , 2004, Pattern Recognit..

[14]  A. G. Ramakrishnan,et al.  A Complete Tamil Optical Character Recognition System , 2002, Document Analysis Systems.

[15]  Fumitaka Kimura,et al.  Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier , 2006, ICVGIP.