Improved BLSTM Neural Networks for Recognition of On-Line Bangla Complex Words

While bi-directional long short-term BLSTM neural network have been demonstrated to perform very well for English or Arabic, the huge number of different output classes characters encountered in many Asian fonts, poses a severe challenge. In this work we investigate different encoding schemes of Bangla compound characters and compare the recognition accuracies. We propose to model complex characters not as unique symbols, which are represented by individual nodes in the output layer. Instead, we exploit the property of long-distance-dependent classification in BLSTM neural networks. We classify only basic strokes and use special nodes which react to semantic changes in the writing, i.e., distinguishing inter-character spaces from intra-character spaces. We show that our approach outperforms the common approaches to BLSTM neural network-based handwriting recognition considerably.

[1]  Haikal El Abed,et al.  Guide to OCR for Arabic Scripts , 2012, Springer London.

[2]  Masaki Nakagawa,et al.  Recent Results of Online Japanese Handwriting Recognition and Its Applications , 2006, SACH.

[3]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  S. K. Parui,et al.  An Analytic Scheme for Online Handwritten Bangla Cursive Word Recognition , 2008 .

[5]  Seiichi Uchida,et al.  A new HMM for on-line character recognition using pen-direction and pen-coordinate features , 2008, 2008 19th International Conference on Pattern Recognition.

[6]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[7]  Fumitaka Kimura,et al.  A System for Bangla Online Handwritten Text , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[8]  Stefan Jäger,et al.  Arabic and Chinese Handwriting Recognition - SACH 2006 Summit College Park, MD, USA, September 27-28, 2006 Selected Papers , 2008, SACH.

[9]  Alex Graves,et al.  Connectionist Temporal Classification , 2012 .

[10]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Ujjwal Bhattacharya,et al.  Direction Code Based Features for Recognition of Online Handwritten Characters of Bangla , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[12]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[13]  Sebastiano Impedovo,et al.  Frontiers in Handwriting Recognition , 1994 .

[14]  Umapada Pal,et al.  Design of Unsupervised Feature Extraction System for On-line Bangla Handwriting Recognition , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[15]  Bidyut Baran Chaudhuri,et al.  Online Bangla Word Recognition Using Sub-Stroke Level Features and Hidden Markov Models , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[16]  Ujjwal Bhattacharya,et al.  On-line Handwriting Recognition of Indian Scripts - The First Benchmark , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[17]  U. Pal,et al.  Online Bangla Handwriting Recognition System , 2006 .

[18]  Ludmila I. Kuncheva,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2004 .

[19]  Jürgen Schmidhuber,et al.  Recurrent nets that time and count , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[20]  Jürgen Schmidhuber,et al.  Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks , 2008, NIPS.

[21]  J. Schmidhuber,et al.  Framewise phoneme classification with bidirectional LSTM networks , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[22]  Bidyut Baran Chaudhuri,et al.  Online handwritten Bangla character recognition using HMM , 2008, 2008 19th International Conference on Pattern Recognition.

[23]  Bidyut Baran Chaudhuri,et al.  Online handwritten Indian script recognition: a human motor function based framework , 2002, Object recognition supported by user interaction for service robots.

[24]  Volkmar Frinken,et al.  Improved Handwriting Recognition by Combining Two Forms of Hidden Markov Models and a Recurrent Neural Network , 2009, CAIP.