Shape descriptors based handwritten character recognition engine with application to Kannada characters

In this paper, we discuss the implementation of shape based features, namely, Fourier descriptors and chain codes, for performing optical character recognition of binary images with application to Kannada handwritten characters. Invariant Fourier descriptors and normalized chain codes are obtained as features from preprocessed Kannada character binary images. Well known SVM classifier is used for recognition purpose. As an initial step towards recognition of handwritten characters, we have performed experiments on handwritten Kannada character numerals and vowels. The result computation is done using five-fold cross validation. The mean performance of the recognition system with the two shape based features together is 98.45% and 93.92%, for numeral characters and vowels, respectively. Further, the mean recognition rate of 95% is obtained for both vowels and characters taken together.

[1]  Tetsushi Wakabayashi,et al.  Handwritten Bangla Compound Character Recognition Using Gradient Feature , 2007, 10th International Conference on Information Technology (ICIT 2007).

[2]  Tetsushi Wakabayashi,et al.  Handwritten Character Recognition of Popular South Indian Scripts , 2006, SACH.

[3]  Bidyut Baran Chaudhuri,et al.  Indian script character recognition: a survey , 2004, Pattern Recognit..

[4]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[5]  Masaki Nakagawa,et al.  'Online recognition of Chinese characters: the state-of-the-art , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[7]  Malayappan Shridhar,et al.  On Recognition of Handwritten Bangla Characters , 2006, ICVGIP.

[8]  Ashraf S. Mahmoud,et al.  ARABIC CHARACTER RECOGNITION USING MODIFIED FOURIER SPECTRUM (MFS) VS. FOURIER DESCRIPTORS , 2009, Cybern. Syst..

[9]  Anil K. Jain,et al.  Feature extraction methods for character recognition-A survey , 1996, Pattern Recognit..

[10]  Tetsushi Wakabayashi,et al.  Handwritten Numeral Recognition of Six Popular Indian Scripts , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[11]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Ashraf S. Mahmoud,et al.  Arabic Character Recognition using Modified Fourier Spectrum (MFS) , 2006, Geometric Modeling and Imaging--New Trends (GMAI'06).

[13]  Hong Yan,et al.  An adaptive logical method for binarization of degraded document images , 2000, Pattern Recognit..

[14]  Anilkumar N. Holambe,et al.  Printed and Handwritten Character & Number Recognition of Devanagari Script using Gradient Features , 2010 .

[15]  Ralph Roskies,et al.  Fourier Descriptors for Plane Closed Curves , 1972, IEEE Transactions on Computers.

[16]  Fumitaka Kimura,et al.  Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier , 2006, ICVGIP.

[17]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[18]  Keiji Yamada,et al.  Optimal sampling intervals for Gabor features and printed Japanese character recognition , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[19]  Neil W. Bergmann,et al.  An Arabic optical character recognition system using recognition-based segmentation , 2001, Pattern Recognit..

[20]  B. V. Dhandra,et al.  Spatial Features for Handwritten Kannada and English Character Recognition , 2010 .

[21]  Jean-Paul Gauthier,et al.  Generalized Fourier Descriptors with Applications to Objects Recognition in SVM Context , 2007, Journal of Mathematical Imaging and Vision.

[22]  V. Kumar,et al.  FLD Based Unconstrained Handwritten Kannada Character Recognition , 2008, 2008 Second International Conference on Future Generation Communication and Networking Symposia.