Gujarati handwritten numeral optical character reorganization through neural network

This paper deals with an optical character recognition (OCR) system for handwritten Gujarati numbers. One may find so much of work for Indian languages like Hindi, Kannada, Tamil, Bangala, Malayalam, Gurumukhi etc, but Gujarati is a language for which hardly any work is traceable especially for handwritten characters. Here in this work a neural network is proposed for Gujarati handwritten digits identification. A multi layered feed forward neural network is suggested for classification of digits. The features of Gujarati digits are abstracted by four different profiles of digits. Thinning and skew-correction are also done for preprocessing of handwritten numerals before their classification. This work has achieved approximately 82% of success rate for Gujarati handwritten digit identification.

[1]  G. Hemantha Kumar,et al.  Multilingual OCR system for South Indian scripts and English documents: An approach based on Fourier transform and principal component analysis , 2008, Eng. Appl. Artif. Intell..

[2]  Bidyut B. Chaudhuri,et al.  Segmentation of touching characters in printed Devnagari and Bangla scripts using fuzzy multifactorial analysis , 2002 .

[3]  B. Chatterjee,et al.  Design of a Nearest Neighbour Classifier System for Bengali Character Recognition , 1984 .

[4]  M. B. Sukhaswami,et al.  Recognition of telugu characters using neural networks , 1995, Int. J. Neural Syst..

[5]  P. Vanaja Ranjan,et al.  Efficient Zone-based Hybrid Feature Extraction Algorithm for Handwritten Numeral Recognition of Kannada Script , 2009, IICAI.

[6]  Suban G. Krishnamoorthy,et al.  Recognition of handprinted Tamil characters , 1980, Pattern Recognit..

[7]  Gurpreet Singh Lehal,et al.  Segmentation of Horizontally Overlapping Lines in Printed Indian Scripts , 2007 .

[8]  Atul Negi,et al.  On developing high accuracy OCR systems for Telugu and other Indian scripts , 2002, Language Engineering Conference, 2002. Proceedings.

[9]  Santanu Chaudhury,et al.  Bengali alpha-numeric character recognition using curvature features , 1993, Pattern Recognit..

[10]  Bidyut Baran Chaudhuri,et al.  Indian script character recognition: a survey , 2004, Pattern Recognit..

[11]  P. V. S. Rao,et al.  Telugu script recognition-a feature based approach , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[12]  Sameer Antani,et al.  Gujarati character recognition , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[13]  B. L. Deekshatulu,et al.  Recognition of Printed Telugu Characters , 1977 .

[14]  Bidyut Baran Chaudhuri,et al.  A complete printed Bangla OCR system , 1998, Pattern Recognit..

[15]  N. Shanthi,et al.  A novel SVM-based handwritten Tamil character recognition system , 2010, Pattern Analysis and Applications.

[16]  Bidyut Baran Chaudhuri,et al.  Skew Angle Detection of Digitized Indian Script Documents , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Chin Luh Tan,et al.  Digit Recognition Using Neural Networks , 2004 .

[18]  Bidyut Baran Chaudhuri,et al.  Automatic recognition of printed Oriya script , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[19]  Mandar Mitra,et al.  Automatic recognition of printed Oriya script , 2002 .

[20]  P. S. Sastry,et al.  A font and size-independent OCR system for printed Kannada documents using support vector machines , 2002 .

[21]  Atul Negi,et al.  Zone identification in the printed Gujarati text , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[22]  P. S. Satyanarayana,et al.  OCR for Script Identification of Hindi (Devnagari) Numerals using Error Diffusion Halftoning Algorithm with Neural Classifier , 2007 .

[23]  U. Pal,et al.  Multi-script line identification from Indian documents , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..