PCA-based Offline Handwritten Character Recognition System

Principal component analysis (PCA) has been used widely in pattern recognition to reduce the extent of the data. In this paper, we explore using this technique to recognize offline handwritten Gurmukhi characters, and a system for offline handwritten Gurmukhi character recognition using PCA is proposed. The system first prepares a skeleton of the character so that meaningful feature information about the character can be extracted. For classification, we used k-nearest neighbor, Linear-SVM, polynomial-SVM and RBF-SVM based approaches and combinations of these approaches. In this work, we collected 16,800 samples of isolated offline handwritten Gurmukhi characters. These samples were divided into three categories. In category 1 (5600 samples), each Gurmukhi character was written 100 times by a single writer. In category 2 (5600 samples), each Gurmukhi character was written 10 times by 10 different writers, and in category 3 (5600 samples), each Gurmukhi character was written by 100 different writers. The set of the basic 35 akhars of Gurmukhi has been considered here. A partitioning strategy for selecting the training and testing patterns is also explored in this work. We used zoning, diagonal, directional, transition, intersection and open end point, parabola curve fitting?based and power curve fitting?based feature extraction in order to find the feature set for a given character. The proposed system achieves a recognition accuracy of 99.06% in category 1, 98.73% in category 2 and 78.30% in category 3.

[1]  S. Sundaram,et al.  Two Dimensional Principal Component Analysis for Online Tamil Character Recognition , 2008 .

[2]  Ankush Mittal,et al.  Evaluation of Different Feature Extractors and Classifiers for Offline Handwritten Devnagari Character Recognition , 2011 .

[3]  Sriganesh Madhvanath,et al.  Principal component analysis for online handwritten character recognition , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[4]  Munish Kumar,et al.  Classification of characters and grading writers in offline handwritten Gurmukhi script , 2011, 2011 International Conference on Image Information Processing.

[5]  Rajesh Kumar,et al.  Online Handwritten Gurmukhi Character Recognition Using Elastic Matching , 2008, 2008 Congress on Image and Signal Processing.

[6]  Chandan Singh,et al.  A Gurmukhi script recognition system , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[7]  Subhadip Basu,et al.  A hierarchical approach to recognition of handwritten Bangla characters , 2009, Pattern Recognit..

[8]  Bidyut Baran Chaudhuri,et al.  Indian script character recognition: a survey , 2004, Pattern Recognit..

[9]  Tetsushi Wakabayashi,et al.  Handwritten Bangla Compound Character Recognition Using Gradient Feature , 2007 .

[10]  Anandarup Roy,et al.  SVM-based hierarchical architectures for handwritten Bangla character recognition , 2009, International Journal on Document Analysis and Recognition (IJDAR).

[11]  G. Hemantha Kumar,et al.  Multilingual OCR system for South Indian scripts and English documents: An approach based on Fourier transform and principal component analysis , 2008, Eng. Appl. Artif. Intell..

[12]  Bidyut Baran Chaudhuri,et al.  Offline recognition of handwritten Bangla characters: an efficient two-stage approach , 2012, Pattern Analysis and Applications.

[13]  M. K. Jindal,et al.  SVM Based Offline Handwritten Gurmukhi Character Recognition , 2011 .

[14]  Tetsushi Wakabayashi,et al.  Handwritten Numeral Recognition of Six Popular Indian Scripts , 2007 .

[15]  Jong Sou Park,et al.  A DTN System for Nanosatellite-based Sensor Networks using a New ALOHA Multiple Access with Gateway Priority , 2013, Smart Comput. Rev..

[16]  Daewon Jung,et al.  New Introduction: Search Method for Similar Images from Representative Video Frame Images , 2013, Smart Comput. Rev..

[17]  Jung-Hsien Chiang,et al.  Handwritten word recognition with character and inter-character neural networks , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[18]  Ching Y. Suen,et al.  A fast parallel algorithm for thinning digital patterns , 1984, CACM.

[19]  Tetsushi Wakabayashi,et al.  A System for Off-Line Oriya Handwritten Character Recognition Using Curvature Feature , 2007, 10th International Conference on Information Technology (ICIT 2007).

[20]  Pengfei Shi,et al.  Handwritten Bangla numeral recognition system and its application to postal automation , 2007, Pattern Recognit..