Bangla Handwritten Character Recognition: an overview of the state of the art classification algorithm with new dataset

Recognition of handwritten characters from Bangla handwritten texts is of immense importance considering the complexity of the task. Researchers have explored the task of recognizing Bangla handwritten digits, but a few numbers of published works are available for Bangla Handwritten Character Recognition (BHCR). In our paper, we present a comparative overview of classification algorithms for BHCR, which may help the researcher to decide an appropriate classification algorithm for their work. We have created a new dataset of Bangla handwritten characters from 150 volunteers at different levels. We extracted around 2500 samples of Bangla characters, which consist of Bangla Vowels only. Histogram adjustment and other image preprocessing techniques are applied in handwritten characters before their classification. We compare the performance of seven commonly used classification algorithms for BHCR in terms of Sensitivity, Miss Rate, Specificity, Precision, Fall-out, F-score, and Overall Accuracy. This result shows that among the seven algorithms, ANN (Artificial Neural Network) performed best. LR (Logistic Regression) performed well compared to others in terms of the standard measures like sensitivity, specificity and error rate. This comparative overview will help scientists, especially the new researchers to give a quick start with Bangla handwritten character recognition.

[1]  Jonathan Cheung-Wai Chan,et al.  Multiple Criteria for Evaluating Machine Learning Algorithms for Land Cover Classification from Satellite Data , 2000 .

[2]  Mahantapas Kundu,et al.  A statistical-topological feature combination for recognition of handwritten numerals , 2012, Appl. Soft Comput..

[3]  Mahantapas Kundu,et al.  A genetic algorithm based region sampling for selection of local features in handwritten digit recognition application , 2012, Appl. Soft Comput..

[4]  Md. Al-Amin,et al.  Sentiment analysis of Bengali comments with Word2Vec and sentiment information of words , 2017, 2017 International Conference on Electrical, Computer and Communication Engineering (ECCE).

[5]  Mahantapas Kundu,et al.  Handwritten Bangla Digit Recognition Using Classifier Combination Through DS Technique , 2005, PReMI.

[6]  Ching Y. Suen,et al.  A new benchmark on the recognition of handwritten Bangla and Farsi numeral characters , 2009, Pattern Recognit..

[7]  Pooja Kamavisdar,et al.  A Survey on Image Classification Approaches and Techniques , 2013 .

[8]  Md Shopon,et al.  Bangla handwritten digit recognition using autoencoder and deep convolutional neural network , 2016, 2016 International Workshop on Computational Intelligence (IWCI).

[9]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Subhadip Basu,et al.  Recognition of Handwritten Bangla Basic Characters and Digits using Convex Hull based Feature Set , 2014, ArXiv.

[11]  Vijayan K. Asari,et al.  Handwritten Bangla Digit Recognition Using Deep Learning , 2017, ArXiv.

[12]  Vijayan K. Asari,et al.  Handwritten Bangla Character Recognition Using the State-of-the-Art Deep Convolutional Neural Networks , 2017, Comput. Intell. Neurosci..

[13]  P. Atkinson,et al.  Introduction Neural networks in remote sensing , 1997 .

[14]  Md. Mostofa Akbar,et al.  A Comparative Overview of Classification Algorithm for Bangla Handwritten Digit Recognition , 2018, IJCCI.

[15]  Subhadip Basu,et al.  Recognition of Numeric Postal Codes from Multi-script Postal Address Blocks , 2009, PReMI.

[16]  Lambert Schomaker,et al.  Recognition of handwritten characters using local gradient feature descriptors , 2015, Eng. Appl. Artif. Intell..

[17]  Sargur N. Srihari,et al.  Recognition of handwritten and machine-printed text for postal address interpretation , 1993, Pattern Recognit. Lett..

[18]  Mohammad Badrul Alam Miah,et al.  Handwritten Digit Recognition Using Machine Learning Algorithms , 2018 .

[19]  Nilanjan Dey,et al.  A survey of image classification methods and techniques , 2014, 2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT).

[20]  Shyla Afroge,et al.  Bangla optical character recognition through segmentation using curvature distance and multilayer perceptron algorithm , 2017, 2017 International Conference on Electrical, Computer and Communication Engineering (ECCE).