Adapting Moments for Handwritten Kannada Kagunita Recognition

The Handwriting character recognition (HCR) for Indian Languages is an important problem where there is relatively little work has been done. In this paper, we investigate the use of moments features on Kannada Kagunita. Kannada characters are curved in nature with some kind of symmetric structure observed in the shape. This information can be best extracted as a feature if we extract moment features from the directional images. To recognize a Kagunita, we need to identify the vowel and the consonant present in the image. So we are finding 4 directional images using Gabor wavelets from the dynamically preprocessed original image. We analyze the Kagunita set and identify the regions with vowel information and consonant information and cut these portions from the preprocessed original image and form a set of cut images. We then extract moments features from them. These features are trained and tested for both vowel and Kagunita recognition on Multi Layer Perceptron with Back Propagation Neural Network. The recognition results for vowels is average 85% and consonants is 59% when tested on separate test data with moments features from directional images and cut images.

[1]  Mei Xie,et al.  A novel character-recognition method based on Gabor transform , 2005, Proceedings. 2005 International Conference on Communications, Circuits and Systems, 2005..

[2]  Stéphane Mallat,et al.  Multifrequency channel decompositions of images and wavelet models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[3]  Venu Govindaraju,et al.  Script Independent Word Spotting in Multilingual Documents , 2008, IJCNLP.

[4]  Cheng-Lin Liu,et al.  Gabor feature extraction for character recognition: comparison with gradient feature , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[5]  Miroslaw Pawlak,et al.  On Image Analysis by Moments , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  R. D. Sudhaker Samuel,et al.  A Bilingual Machine-Interface OCR for Printed Kannada and English Text Employing Wavelet Features , 2007, 10th International Conference on Information Technology (ICIT 2007).

[7]  Vaibhav Sharma,et al.  A New Termination Detection Protocol for Mobile Distributed Systems , 2007 .

[8]  R. S. Kunte,et al.  A Bilingual Machine-Interface OCR for Printed Kannada and English Text Employing Wavelet Features , 2007 .

[9]  R. D. Sudhaker Samuel,et al.  A Novel Bilingual OCR for Printed Malayalam-English Text Based on Gabor Features and Dominant Singular Values , 2009, 2009 International Conference on Digital Image Processing.

[10]  Miroslaw Pawlak,et al.  On the Accuracy of Zernike Moments for Image Analysis , 1998, IEEE Trans. Pattern Anal. Mach. Intell..