Isolated handwritten character recognition has been the subject of intensive research during last decades because it is useful in wide range of real world problems. It also provides a solution for processing large volumes of data automatically. Work has been done in recognizing handwritten characters in many languages like Chinese, Arabic, Devnagari, Urdu and English. The work presented in this thesis, focuses on the problem of recognition of isolated handwritten characters in Gurmukhi script. The whole process consists of two stages. The first, feature extraction stage analyzes the set of isolated characters and selects a set of features that can be used to uniquely identify characters. The performance of recognition system depends heavily on what features are being used. The selection of stable and representative set of features is the heart of recognition system. The feature extraction method Zoning, is used for extracting features of the character under consideration in this problem. In Zoning method, the frame containing the character is divided into several overlapping or non-overlapping zones and the densities of object pixels in each zone are calculated. Densities are used to form a representation. The final, classification stage is the main decision making stage of the recognition system. It uses features extracted in the feature extraction stage to identify the character. K-Nearest Neighbor and Support Vector Machine are the two classifiers used for identifying the character in the problem. In k-nearest classification method, the Euclidean distance between the test point and all the reference points is calculated in order to find K nearest neighbors, and then the obtained distances are ranked in ascending order and the reference points corresponding to the k smallest Euclidean distances are taken. The Support Vector Machine (SVM) is learning machine with very good generalization ability. SVM implements the Structural Risk Minimization Principal which seeks to minimize an upper bound of the generalization error. An SVM classifier discriminates two classes of feature vectors by generating hyper-surfaces in the feature space, which are "optimal" in a specific sense that is the hyper-surface obtained by the SVM optimization is guaranteed to have the maximum distance to the "nearest” support vectors. SVM operate on kernel evaluations of the feature vectors. An annotated sample image database of isolated handwritten characters in Gurmukhi script has been prepared which has been used for training and testing of the system.
[1]
Yuan Yan Tang,et al.
Handwritten Character Recognition Using Combined Gradient and Wavelet Feature
,
2006,
2006 International Conference on Computational Intelligence and Security.
[2]
Hong Yan,et al.
Structural decomposition and description of printed and handwritten characters
,
1996,
Proceedings of 13th International Conference on Pattern Recognition.
[3]
Tian Ming,et al.
Improving support vector machine classifier by combining it with k nearest neighbor principle based on the best distance measurement
,
2003,
Proceedings of the 2003 IEEE International Conference on Intelligent Transportation Systems.
[4]
Beatrice Lazzerini,et al.
Fuzzy classification of handwritten characters
,
1999,
18th International Conference of the North American Fuzzy Information Processing Society - NAFIPS (Cat. No.99TH8397).
[5]
Nikos Fakotakis,et al.
A new shape transformation approach to handwritten character recognition
,
2002,
Object recognition supported by user interaction for service robots.
[6]
H. Ishigaki,et al.
A Statistical Approach for Handwritten Character Recognition Using Bayesian Filter
,
2008,
2008 3rd International Conference on Innovative Computing Information and Control.
[7]
Yves Lecourtier,et al.
Combining structural and statistical features for the recognition of handwritten characters
,
1996,
Proceedings of 13th International Conference on Pattern Recognition.
[8]
Tetsushi Wakabayashi,et al.
Off-Line Handwritten Character Recognition of Devnagari Script
,
2007,
Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).
[9]
Madasu Hanmandlu,et al.
Fuzzy logic based handwritten character recognition
,
2001,
Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).
[10]
Julius T. Tou,et al.
Recognition of Handwritten Characters by Topological Feature Extraction and Multilevel Categorization
,
1972,
IEEE Transactions on Computers.
[11]
J.C.H. Poon,et al.
A fuzzy-attributed graph approach to handwritten character recognition
,
1993,
[Proceedings 1993] Second IEEE International Conference on Fuzzy Systems.
[12]
L. J. Cao,et al.
Feature extraction in support vector machine: a comparison of PCA, XPCA and ICA
,
2002,
Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..