Increasing the effectiveness of handwritten Manipuri Meetei-Mayek character recognition using multiple-HOG-feature descriptors

Detection and reading of the text from natural images is a difficult computer vision task, which is essential in a variety of emerging applications. Document character recognition is one such problem, which has been widely studied and documented by many machine learning and computer vision researchers, which is practically used for solving applications like recognizing handwritten digits. In this paper, a new approach for efficiently extracting cognition out of a total of 56 different classes of Handwritten Manipuri Meetei-Mayek (HMMM) (an Indian language) is described. Although character recognition algorithms have been researched and developed for other Indian scripts, no research work has been reported so far for recognizing all the characters of the Manipuri Meetei-Mayek (MMM). The work begins with a thorough analysis of the recognition task using a single hidden layer type Multilayer Perceptron Feedforward Artificial Neural Network with Histogram of Oriented Gradient (HOG) feature descriptors. After reviewing the level of accuracy and time it takes to train the network, the limitations are experimentally removed using multiple-sized cell grids using HOG descriptors. HOG, being a gradient-based descriptor, is very efficient in data discrimination and very stable with illumination variation. For efficient classification of the HOG features of the MMM, a linear multiclass support vector machine (SVM) classifier has been proposed for classifying the different offline characters because of its simplicity and speed. The classification based on linear multiclass SVM yielded a very high overall accuracy of 96.928%

[1]  David Malah,et al.  Skeleton-based morphological coding of binary images , 1998, IEEE Trans. Image Process..

[2]  Mita Nasipuri,et al.  Performance Comparison of SVM and ANN for Handwritten Devnagari Character Recognition , 2010, ArXiv.

[3]  Tetsushi Wakabayashi,et al.  Handwritten Bangla Compound Character Recognition Using Gradient Feature , 2007, 10th International Conference on Information Technology (ICIT 2007).

[4]  R. Mahesh K. Sinha,et al.  A Journey from Indian Scripts Processing to Indian Language Processing , 2009, IEEE Annals of the History of Computing.

[5]  Sergio Escalera,et al.  Separability of ternary codes for sparse designs of error-correcting output codes , 2009, Pattern Recognit. Lett..

[6]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[8]  Renu Dhir,et al.  Combination of Different Feature Sets and SVM Classifier for Handwritten Gurumukhi Numeral Recognition , 2012 .

[9]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Fumitaka Kimura,et al.  Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier , 2006, ICVGIP.

[11]  Subhadip Basu,et al.  Handwritten Bangla Alphabet Recognition using an MLP Based Classifier , 2012, ArXiv.

[12]  Sanjib Kumar Kalita,et al.  Recognition of Handwritten Numerals of Manipuri Script , 2013 .

[13]  Madhumita Panda,et al.  A Hybrid Differential Evolution and Back-Propagation Algorithm for Feedforward Neural Network Training , 2013 .

[14]  Romesh Laishram,et al.  A neural network based handwritten Meitei Mayek alphabet optical character recognition system , 2014, 2014 IEEE International Conference on Computational Intelligence and Computing Research.

[15]  Seema Bawa,et al.  Recognition of Handwritten Character of Manipuri Script , 2010, J. Comput..

[16]  Sergio Escalera,et al.  On the Decoding Process in Ternary Error-Correcting Output Codes , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.