Bangla handwritten character recognition using deep belief network

Recognition of Bangla handwritten characters is a difficult but important task for various emerging applications. For better recognition performance, good feature representation of the character images is a primary requirement. In this study, we investigate a recently proposed machine learning approach called deep learning [1] for Bangla hand written character recognition, with a focus on automatic learning of good representations. This approach differs from the traditional methods of preprocessing the characters for constructing the handcrafted features such as loops and strokes. Among different deep learning structures, we employ the deep belief network (DBN) that takes the raw character images as input and learning proceeds in two steps - an unsupervised feature learning followed by a supervised fine tuning of the network parameters. Unlike traditional neural networks, the DBN is a probabilistic generative model, i.e., we can generate samples from the model and it can fit both the semi-supervised and supervised learning settings. We demonstrate the advantages of unsupervised feature learning through the experimental studies carried on the Bangla basic characters and numerals dataset collected from the Indian Statistical Institute.

[1]  Tara N. Sainath,et al.  Deep Belief Networks using discriminative features for phone recognition , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[3]  Andrew Y. Ng,et al.  Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning , 2011, 2011 International Conference on Document Analysis and Recognition.

[4]  Po Yang,et al.  Handwritten English Word Recognition Based on Convolutional Neural Networks , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[5]  Khalid Saeed,et al.  A view-based approach for recognition of Bengali printed characters , 2009, 2009 World Congress on Nature & Biologically Inspired Computing (NaBIC).

[6]  Peter Kulchyski and , 2015 .

[7]  Geoffrey E. Hinton,et al.  Exponential Family Harmoniums with an Application to Information Retrieval , 2004, NIPS.

[8]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[9]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[10]  Fuad Rahman,et al.  Recognition of handwritten Bengali characters: a novel multistage approach , 2002, Pattern Recognit..

[11]  Geoffrey E. Hinton,et al.  Using fast weights to improve persistent contrastive divergence , 2009, ICML '09.

[12]  Fahim Irfan Alam,et al.  Offline isolated bangla handwritten character recognition using spatial relationships , 2013, 2013 International Conference on Informatics, Electronics and Vision (ICIEV).

[13]  G. G. Rajput,et al.  Handwritten Script Recognition Using DCT, Gabor Filter and Wavelet Features at Line Level , 2012, SOCO 2012.

[14]  Yann LeCun,et al.  Convolutional networks and applications in vision , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[15]  Lawrence D. Jackel,et al.  Reading handwritten digits: a ZIP code recognition system , 1992, Computer.

[16]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[17]  Anandarup Roy,et al.  SVM-based hierarchical architectures for handwritten Bangla character recognition , 2009, International Journal on Document Analysis and Recognition (IJDAR).

[18]  Rajat Raina,et al.  Self-taught learning: transfer learning from unlabeled data , 2007, ICML '07.