Sankhya: An Unbiased Benchmark for Bangla Handwritten Digits Recognition

The rise of artificial intelligence technology along with machine and deep learning are opening up almost limitless possibilities. In recent years, application-based researchers in machine learning and deep learning have started developing solutions for many practical problems. Handwriting recognition is one such area of interest. Bangla, being the seventh most spoken language in the world, is not an exception. However, unlike English, there have not been concerted formal attempts in building a benchmark in comparing the different approaches reported in the literature, mainly because of the lack of openly and freely available datasets and diversity of the approaches without formal comparative studies. In this research paper, we seek to rectify this gap. We have focused on benchmarking five robust algorithms: K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Random Forest (RF), Multi-layer Perceptron (MLP), Convolutional Neural Network (CNN), on all publicly available Bangla handwriting digits datasets, including Ekush, NumtaDB, CMARTdb, and BDRW. NumtaDB itself is a collection of five handwriting datasets. We have worked on fine-tuning these algorithms by finding the best possible hyper-parameters of these algorithms. It is our hope that Sankhya will work as a beginning point of an open and verifiable benchmarking process that we plan to repeat every two years for now on and set a standard for testing and validating newer and novel algorithms that will be reported in this area in the future. In addition, we have extensively compared our research with other states of the art research and our versions of these algorithms are now outperforming every reported result on these datasets.All the datasets we used are open-sourced. In addition, we are making the.csv version of these datasets available in public GitHub. Of all the models we tested, the Sankhya CNN model performed the best for all these datasets, which we fine-tuned specifically for Bangla character recognition. We are making this CNN model available in public GitHub.

[1]  Syed Akhter Hossain,et al.  Ekush: A Multipurpose and Multitype Comprehensive Database for Online Off-Line Bangla Handwritten Characters , 2018, RTIP2R.

[2]  Rakhal Das Banerji,et al.  The Origin of the Bengali Script , 2003 .

[3]  Sheikh Abujar,et al.  Bangla Handwritten Digit Recognition Using Convolutional Neural Network , 2018, Advances in Intelligent Systems and Computing.

[4]  Mamunur Rahaman Mamun,et al.  Bangla Handwritten Digit Recognition Approach with an Ensemble of Deep Residual Networks , 2018, 2018 International Conference on Bangla Speech and Language Processing (ICBSLP).

[5]  Laurent Wendling,et al.  Character recognition based on non-linear multi-projection profiles measure , 2015, Frontiers of Computer Science.

[6]  Hsuan-Tien Lin A Study on Sigmoid Kernels for SVM and the Training of non-PSD Kernels by SMO-type Methods , 2005 .

[7]  Samiul Alam,et al.  NumtaDB - Assembled Bengali Handwritten Digits , 2018, ArXiv.

[8]  Geoffrey E. Hinton,et al.  Dynamic Routing Between Capsules , 2017, NIPS.

[9]  Nabeel Mohammed,et al.  Unconventional Wisdom: A New Transfer Learning Approach Applied to Bengali Numeral Classification , 2018, 2018 International Conference on Bangla Speech and Language Processing (ICBSLP).

[10]  Rafiqul Islam,et al.  BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters , 2017, Data in brief.

[11]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[12]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[13]  Nafees Mansoor,et al.  A hybrid deep model with HOG features for Bangla handwritten numeral classification , 2016, 2016 9th International Conference on Electrical and Computer Engineering (ICECE).

[14]  Mita Nasipuri,et al.  A multi-objective approach towards cost effective isolated handwritten Bangla character and digit recognition , 2016, Pattern Recognit..

[15]  Syed Akhter Hossain,et al.  ShonkhaNet: A Dynamic Routing for Bangla Handwritten Digits Recognition Using Capsule Network , 2018, 2018 International Conference on Bangla Speech and Language Processing (ICBSLP).

[16]  Subhadip Basu,et al.  CMATERdb1: a database of unconstrained handwritten Bangla and Bangla–English mixed script document image , 2011, International Journal on Document Analysis and Recognition (IJDAR).

[17]  B. Schölkopf,et al.  1 A primer on kernel methods , 2004 .

[18]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[19]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[20]  Tasnuva Hassan,et al.  Handwritten Bangla numeral recognition using Local Binary Pattern , 2015, 2015 International Conference on Electrical Engineering and Information Communication Technology (ICEEICT).

[21]  Bidyut Baran Chaudhuri,et al.  Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Md Mahmudul Hasan,et al.  Recognition of Bengali Handwritten Digits Using Convolutional Neural Network Architectures , 2018, 2018 International Conference on Bangla Speech and Language Processing (ICBSLP).

[23]  Firoz Mahmud,et al.  Bangla Handwritten Digit Recognition Using Deep CNN for Large and Unbiased Dataset , 2018, 2018 International Conference on Bangla Speech and Language Processing (ICBSLP).