Indic Handwritten Script Identification using Offline-Online Multimodal Deep Network

In this paper, we propose a novel approach of word-level Indic script identification using only character-level data in training stage. The advantages of using character level data for training have been outlined in section I. Our method uses a multimodal deep network which takes both offline and online modality of the data as input in order to explore the information from both the modalities jointly for script identification task. We take handwritten data in either modality as input and the opposite modality is generated through intermodality conversion. Thereafter, we feed this offline-online modality pair to our network. Hence, along with the advantage of utilizing information from both the modalities, it can work as a single framework for both offline and online script identification simultaneously which alleviates the need for designing two separate script identification modules for individual modality. One more major contribution is that we propose a novel conditional multimodal fusion scheme to combine the information from offline and online modality which takes into account the real origin of the data being fed to our network and thus it combines adaptively. An exhaustive experiment has been done on a data set consisting of English and six Indic scripts. Our proposed framework clearly outperforms different frameworks based on traditional classifiers along with handcrafted features and deep learning based methods with a clear margin. Extensive experiments show that using only character level training data can achieve state-of-art performance similar to that obtained with traditional training using word level data in our framework.

[1]  Prasenjit Dey,et al.  HMM-based Indic handwritten word recognition using zone segmentation , 2016, Pattern Recognit..

[2]  Makoto Yasuhara,et al.  Recovery of Drawing Order from Single-Stroke Handwriting Images , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Mohamed Cheriet,et al.  Feature Set Evaluation for Offline Handwriting Recognition Systems: Application to the Recurrent Neural Network Model , 2016, IEEE Transactions on Cybernetics.

[4]  Jiwen Lu,et al.  MMSS: Multi-modal Sharable and Specific Feature Learning for RGB-D Object Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[5]  Ikram Moalla,et al.  Extraction of Arabic text from multilingual documents , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[6]  Mita Nasipuri,et al.  Handwritten Mixed-Script Recognition System: A Comprehensive Approach , 2016, FICTA.

[7]  Giuseppe Pirlo,et al.  Script Identification of Multi-Script Documents: A Survey , 2017, IEEE Access.

[8]  P. S. Hiremath,et al.  Script identification in a handwritten document image using texture features , 2010, 2010 IEEE 2nd International Advance Computing Conference (IACC).

[9]  Bastian Leibe,et al.  Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[10]  Junwei Han,et al.  CNNs-Based RGB-D Saliency Detection via Cross-View Transfer and Multiview Fusion. , 2018, IEEE transactions on cybernetics.

[11]  Umapada Pal,et al.  Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[12]  Tetsushi Wakabayashi,et al.  Handwritten Numeral Recognition of Six Popular Indian Scripts , 2007 .

[13]  Mita Nasipuri,et al.  Offline Script Identification from multilingual Indic-script documents: A state-of-the-art , 2015, Comput. Sci. Rev..

[14]  Mohammed Bennamoun,et al.  A Multi-Modal, Discriminative and Spatially Invariant CNN for RGB-D Object Labeling , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Nibaran Das,et al.  Deep learning for word-level handwritten Indic script identification , 2018, RTIP2R.

[16]  Nibaran Das,et al.  Automatic Indic script identification from handwritten documents: page, block, line and word-level approach , 2019, Int. J. Mach. Learn. Cybern..

[17]  Umapada Pal,et al.  Word-Wise Thai and Roman Script Identification , 2009, TALIP.

[18]  Anil K. Jain,et al.  Online handwritten script recognition , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Alexander H. Waibel,et al.  Online handwriting recognition: the NPen++ recognizer , 2001, International Journal on Document Analysis and Recognition.

[20]  Umapada Pal,et al.  Handwriting Recognition in Indian Regional Scripts: A Survey of Offline Techniques , 2012, TALIP.

[21]  Umapada Pal,et al.  HMM-based writer identification in music score documents without staff-line removal , 2017, Expert Syst. Appl..

[22]  Qiang Chen,et al.  Network In Network , 2013, ICLR.

[23]  Subhransu Maji,et al.  Bilinear CNN Models for Fine-Grained Visual Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[24]  Jürgen Schmidhuber,et al.  Unconstrained On-line Handwriting Recognition with Recurrent Neural Networks , 2007, NIPS.

[25]  Dimosthenis Karatzas,et al.  Improving patch-based scene text script identification with ensembles of conjoined networks , 2016, Pattern Recognit..

[26]  Nibaran Das,et al.  Handwritten Indic Script Identification in Multi-Script Document Images: A Survey , 2018, Int. J. Pattern Recognit. Artif. Intell..

[27]  Hao Chen,et al.  CNNs-Based RGB-D Saliency Detection via Cross-View Transfer and Multiview Fusion , 2017 .

[28]  Marcus Liwicki,et al.  A sequence learning approach for multiple script identification , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[29]  B. V. Dhandra,et al.  Offline Handwritten Script Identification in Document Images , 2010 .

[30]  Ben M. Herbst,et al.  Estimating the pen trajectories of static signatures using hidden Markov models , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Nibaran Das,et al.  Improved word-level handwritten Indic script identification by integrating small convolutional neural networks , 2019, Neural Computing and Applications.

[32]  Fei-Fei Li,et al.  Deep visual-semantic alignments for generating image descriptions , 2015, CVPR.

[33]  Yang Gao,et al.  Compact Bilinear Pooling , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Jiebo Luo,et al.  Multi-modal deep feature learning for RGB-D object detection , 2017, Pattern Recognit..

[35]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[36]  Miguel Angel Ferrer-Ballester,et al.  Multiple Training - One Test Methodology for Handwritten Word-Script Identification , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[37]  Shijian Lu,et al.  Discriminative Multi-modal Feature Fusion for RGBD Indoor Scene Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Umapada Pal,et al.  Local Binary Pattern for Word Spotting in Handwritten Historical Document , 2016, S+SSPR.

[39]  Richard Bowden,et al.  Local binary patterns for multi-view facial expression recognition , 2011 .

[40]  Nibaran Das,et al.  Extreme learning machine for handwritten Indic script identification in multiscript documents , 2018, J. Electronic Imaging.

[41]  Jiashi Feng,et al.  Multimodal Learning and Reasoning for Visual Question Answering , 2017, NIPS.

[42]  Xiang Bai,et al.  An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  G. G. Rajput,et al.  Handwritten Script Recognition using DCT and Wavelet Features at Block Level , 2010 .

[44]  Angelo Chianese,et al.  Recovering dynamic information from static handwriting , 1993, Pattern Recognit..

[45]  Xiang Bai,et al.  Script identification in the wild via discriminative convolutional neural network , 2016, Pattern Recognit..

[46]  Parul Sahare,et al.  Script identification algorithms: a survey , 2017, International Journal of Multimedia Information Retrieval.

[47]  Adel M. Alimi,et al.  Temporal Order Recovery of the Scanned Handwriting , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[48]  Mita Nasipuri,et al.  Word-level script identification for handwritten Indic scripts , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[49]  Adel M. Alimi,et al.  2009 10th International Conference on Document Analysis and Recognition Combining Multiple HMMs Using On-line and Off-line Features for Off-line Arabic Handwriting Recognition , 2022 .

[50]  Umapada Pal,et al.  Cross-language Framework for Word Recognition and Spotting of Indic Scripts , 2017, Pattern Recognit..

[51]  Bidyut Baran Chaudhuri,et al.  A system for Indian postal automation , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[52]  C. V. Jawahar,et al.  Recognition of printed Devanagari text using BLSTM Neural Network , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[53]  Nibaran Das,et al.  PHDIndic_11: page-level handwritten document image dataset of 11 official Indic scripts for script identification , 2017, Multimedia Tools and Applications.

[54]  Nitish Srivastava,et al.  Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..

[55]  Xiang Bai,et al.  Scene text script identification with Convolutional Recurrent Neural Networks , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[56]  Azriel Rosenfeld,et al.  Recovery of temporal information from static images of handwriting , 2005, International Journal of Computer Vision.

[57]  Nibaran Das,et al.  Numeral Script Identification from Handwritten Document Images , 2015 .

[58]  U. Pal,et al.  Neural network based word-wise handwritten script identification system for Indian postal automation , 2005, Proceedings of 2005 International Conference on Intelligent Sensing and Information Processing, 2005..

[59]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[60]  Nibaran Das,et al.  AUTOMATIC LINE-LEVEL SCRIPT IDENTIFICATION FROM HANDWRITTEN DOCUMENT IMAGES - A REGION-WISE CLASSIFICATION FRAMEWORK FOR INDIAN SUBCONTINENT , 2018 .

[61]  Debashis Ghosh,et al.  Script Recognition—A Review , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.