A BLSTM Network for Printed Bengali OCR System with High Accuracy

This paper presents a printed Bengali and English text OCR system developed by us using a single hidden BLSTM-CTC architecture having 128 units. Here, we did not use any peephole connection and dropout in the BLSTM, which helped us in getting better accuracy. This architecture was trained by 47,720 text lines that include English words also. When tested over 20 different Bengali fonts, it has produced character level accuracy of 99.32% and word level accuracy of 96.65%. A good Indic multi script OCR system is also developed by Google. It sometimes recognizes a character of Bengali into the same character of a non-Bengali script, especially Assamese, which has no distinction from Bengali, except for a few characters. For example, Bengali character for 'RA' is sometimes recognized as that of Assamese, mainly in conjunct consonant forms. Our OCR is free from such errors. This OCR system is available online at this https URL

[1]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[2]  Bidyut Baran Chaudhuri,et al.  A complete printed Bangla OCR system , 1998, Pattern Recognit..

[3]  Thomas M. Breuel,et al.  High-Performance OCR for Printed English and Fraktur Using LSTM Networks , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[4]  Pan He,et al.  Detecting Text in Natural Image with Connectionist Text Proposal Network , 2016, ECCV.

[5]  Saad Bin Ahmed,et al.  Offline Printed Urdu Nastaleeq Script Recognition with Bidirectional LSTM Networks , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[6]  Navdeep Jaitly,et al.  Hybrid speech recognition with Deep Bidirectional LSTM , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[7]  Thomas M. Breuel,et al.  A segmentation-free approach for printed Devanagari script recognition , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[8]  C. V. Jawahar,et al.  A bilingual OCR for Hindi-Telugu documents and its applications , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[9]  T. Munich,et al.  Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks , 2008, NIPS.

[10]  Santanu Chaudhury,et al.  Text recognition using deep BLSTM networks , 2015, 2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR).

[11]  Jürgen Schmidhuber,et al.  Unconstrained On-line Handwriting Recognition with Recurrent Neural Networks , 2007, NIPS.

[12]  C. V. Jawahar,et al.  Towards a Robust OCR System for Indic Scripts , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[13]  Ujjwal Bhattacharya,et al.  A Hybrid Deep Architecture for Robust Recognition of Text Lines of Degraded Printed Documents , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[14]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[15]  Manish Kumar Gupta,et al.  Printed text recognition using BLSTM and MDLSTM for Indian languages , 2017, 2017 Fourth International Conference on Image Information Processing (ICIIP).

[16]  Chandan Singh,et al.  A Gurmukhi script recognition system , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[17]  C. V. Jawahar,et al.  Multilingual OCR for Indic Scripts , 2016, 2016 12th IAPR Workshop on Document Analysis Systems (DAS).

[18]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[19]  C. V. Jawahar,et al.  Recognition of printed Devanagari text using BLSTM Neural Network , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[20]  Chandan Singh,et al.  A Complete OCR System for Gurmukhi Script , 2002, SSPR/SPR.

[21]  Boonserm Kijsirikul,et al.  Thai Printed Character Recognition Using Long Short-Term Memory and Vertical Component Shifting , 2016, PRICAI.

[22]  Thomas M. Breuel,et al.  Scanning Neural Network for Text Line Recognition , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[23]  Jürgen Schmidhuber,et al.  Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.