Bangla Handwritten City Name Recognition Using Gradient-Based Feature

In recent times, holistic word recognition has achieved enormous attention from the researchers due to its segmentation-free approach. In the present work, a holistic word recognition method is presented for the recognition of handwritten city names in Bangla script. At first, each word image is hypothetically segmented into equal number of grids. Then gradient-based features, inspired by Histogram of Oriented Gradients (HOG) feature descriptor, are extracted from each of the grids. For the selection of suitable classifier, five well-known classifiers are compared in terms of their recognition accuracies and finally the classifier Sequential Minimal Optimization (SMO) is chosen. The system has achieved 90.65% accuracy on 10,000 samples comprising of 20 most popular city names of West Bengal, a state of India.

[1]  Bidyut Baran Chaudhuri,et al.  A complete printed Bangla OCR system , 1998, Pattern Recognit..

[2]  Mita Nasipuri,et al.  A holistic word recognition technique for handwritten Bangla words , 2015, Int. J. Appl. Pattern Recognit..

[3]  Subhadip Basu,et al.  An improved offline handwritten character segmentation algorithm for Bangla script , 2011, IICAI.

[4]  R. Manmatha,et al.  Holistic word recognition for handwritten historical documents , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[5]  Mita Nasipuri,et al.  Handwritten Bangla Word Recognition Using Elliptical Features , 2014, 2014 International Conference on Computational Intelligence and Communication Networks.

[6]  Simon M. Lucas,et al.  Top-Down Likelihood Word Image Generation Model for Holistic Word Recognition , 2002, Document Analysis Systems.

[7]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[9]  Subhadip Basu,et al.  A hierarchical approach to recognition of handwritten Bangla characters , 2009, Pattern Recognit..

[10]  Malayappan Shridhar,et al.  Offline Handwritten Devanagari Word Recognition: A Holistic Approach Based on Directional Chain Code Feature and HMM , 2008, 2008 International Conference on Information Technology.

[11]  Mita Nasipuri,et al.  Handwritten Bangla Word Recognition Using HOG Descriptor , 2014, 2014 Fourth International Conference of Emerging Applications of Information Technology.

[12]  Subhadip Basu,et al.  CMATERdb1: a database of unconstrained handwritten Bangla and Bangla–English mixed script document image , 2011, International Journal on Document Analysis and Recognition (IJDAR).

[13]  Debashis Ghosh,et al.  Handwritten Devanagari Word Recognition: A Curvelet Transform Based Approach , 2011 .

[14]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[15]  Karim Faez,et al.  Handwritten Farsi (Arabic) word recognition: a holistic approach using discrete HMM , 2001, Pattern Recognit..