Two streams deep neural network for handwriting word recognition

Handwritten word recognition is one of the hot topics in automatic handwritten text recognition that received a lot of attention in recent years. Unlike character recognition, word recognition deals with considerable variations in word shape and written style. This paper proposes a novel deep model for language-independent handwritten word recognition. The proposed deep structure has two parallel stages for jointly learning character and word-level information. In the character-level stage, a weakly character segmentation method is performed and then applies a series of Long short-term memory (LSTM) layers for character-level representation. The word-level stage employs a series of convolutional layers for the shape and structure representation of the word. These representations are then concatenated and followed by a series of fully connected layers for jointly learning the words and the character-level information. Since the character segmentation is language independent and error-prone, the proposed deep structure only applies weakly separation scheme and does not rely on any character segmentation algorithm. Thus, it effectively utilizes character level representation without bounding on any language model. In the proposed methodology, we use two new data augmentation strategies based on a psychological assumption to increase the model generalization performance. Experimental results on five public datasets including Arabic, English and German languages demonstrate that the proposed deep model has a superior performance to the state-of-the-art methods.

[1]  Ernest Valveny,et al.  Word Spotting and Recognition with Embedded Attributes , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Raymond W. Ptucha,et al.  Intelligent character recognition using fully convolutional neural networks , 2019, Pattern Recognit..

[3]  Andrew Zisserman,et al.  Reading Text in the Wild with Convolutional Neural Networks , 2014, International Journal of Computer Vision.

[4]  Saeed Mozaffari,et al.  Statistical geometric components of straight lines (SGCSL) feature extraction method for offline Arabic/Persian handwritten words recognition , 2018, IET Image Process..

[5]  Hermann Ney,et al.  A Comparison of Sequence-Trained Deep Neural Networks and Recurrent Neural Networks Optical Modeling for Handwriting Recognition , 2014, SLSP.

[6]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[7]  Partha Pratim Roy,et al.  Script Identification in Natural Scene Image and Video Frame using Attention based Convolutional-LSTM Network , 2018, Pattern Recognit..

[8]  Luiz Eduardo Soares de Oliveira,et al.  Automatic Recognition of Handwritten Numerical Strings: A Recognition and Verification Strategy , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Marwan Torki,et al.  AlexU-Word: A New Dataset for Isolated-Word Closed-Vocabulary Offline Arabic Handwriting Recognition , 2014, ArXiv.

[10]  Ángel Sánchez,et al.  Offline continuous handwriting recognition using sequence to sequence neural networks , 2018, Neurocomputing.

[11]  Mohamed Cheriet,et al.  Word spotting and recognition via a joint deep embedding of image and text , 2019, Pattern Recognit..

[12]  Byron L. D. Bezerra,et al.  Boosting the Deep Multidimensional Long-Short-Term Memory Network for Handwritten Recognition Systems , 2018, 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[13]  Wen-Li Wang,et al.  A Normalization Process to Standardize Handwriting Data Collected from Multiple Resources for Recognition , 2015, Complex Adaptive Systems.

[14]  Andrew Zisserman,et al.  Deep Features for Text Spotting , 2014, ECCV.

[15]  Jianmin Jiang,et al.  Offline handwritten Arabic cursive text recognition using Hidden Markov Models and re-ranking , 2011, Pattern Recognit. Lett..

[16]  Sumedha B. Hallale,et al.  Twelve Directional Feature Extraction for Handwritten English Character Recognition , 2013 .

[17]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[18]  Alex Graves,et al.  Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[19]  Khairuddin Omar,et al.  Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions , 2019, J. Imaging.

[20]  Salvador España Boquera,et al.  Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN Models , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[22]  Nija Babu,et al.  Character Recognition in Historical Handwritten Documents – A Survey , 2019, 2019 International Conference on Communication and Signal Processing (ICCSP).

[23]  Mustafa S. Kadhm,et al.  Handwriting Word Recognition Based on SVM Classifier , 2015 .

[24]  Fei Yin,et al.  Simultaneous Script Identification and Handwriting Recognition via Multi-Task Learning of Recurrent Neural Networks , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[25]  Mohamed Cheriet,et al.  Convolutional pyramid of bidirectional character sequences for the recognition of handwritten words , 2018, Pattern Recognit. Lett..

[26]  Hermann Ney,et al.  Moment-Based Image Normalization for Handwritten Text Recognition , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[27]  Thomas Deselaers,et al.  A Scalable Handwritten Text Recognition System , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[28]  Reza Azad,et al.  Recognition of Handwritten Persian/Arabic Numerals Based on Robust Feature Set and K-NN Classifier , 2014, ArXiv.

[29]  Kuldeep Singh,et al.  CSgI: A Deep Learning based approach for Marijuana Leaves Strain Classification , 2018, 2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON).

[30]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[31]  Lovekesh Vig,et al.  An Efficient End-to-End Neural Model for Handwritten Text Recognition , 2018, BMVC.

[32]  Amit Sharan Character recognition using Fourier coefficients , 1993 .

[33]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[34]  Shu Feng,et al.  A novel variational model for noise robust document image binarization , 2019, Neurocomputing.

[35]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[36]  Neeta Nain,et al.  Handwritten text documents binarization and skew normalization approaches , 2012, 2012 4th International Conference on Intelligent Human Computer Interaction (IHCI).

[37]  Hong Yan,et al.  Skew Correction of Document Images Using Interline Cross-Correlation , 1993, CVGIP Graph. Model. Image Process..

[38]  Manoj Sonkusare,et al.  A SURVEY ON HANDWRITTEN CHARACTER RECOGNITION (HCR) TECHNIQUES FOR ENGLISH ALPHABETS , 2016 .

[39]  Venu Govindaraju,et al.  Hidden Markov models combining discrete symbols and continuous attributes in handwriting recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  M. Pechwitz,et al.  IFN/ENIT: database of handwritten arabic words , 2002 .

[41]  K SrikantaMurthy,et al.  Performance analysis of various filters for De-noising of Handwritten Kannada documents , 2012 .

[42]  Monji Kherallah,et al.  A New Design Based-SVM of the CNN Classifier Architecture with Dropout for Offline Arabic Handwritten Recognition , 2016, ICCS.

[43]  Xiang Bai,et al.  An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Naresh Kumar Garg,et al.  Binarization Techniques used for Grey Scale Images , 2013 .

[45]  Wojciech Kacalak,et al.  Selected Problems of Intelligent Handwriting Recognition , 2007, Analysis and Design of Intelligent Systems using Soft Computing Techniques.

[46]  Abdelmajid Ben Hamadou,et al.  Off-line handwritten word recognition using multi-stream hidden Markov models , 2010, Pattern Recognit. Lett..

[47]  Monji Kherallah,et al.  Convolutional Neural Network and BLSTM for Offline Arabic Handwriting Recognition , 2018, 2018 International Arab Conference on Information Technology (ACIT).

[48]  Sherif Abdelazeem,et al.  Combining Analytical and Holistic Strategies for Handwriting Recognition , 2016, 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA).

[49]  Anas Arram,et al.  Length Independent Writer Identification Based on the Fusion of Deep and Hand-Crafted Descriptors , 2019, IEEE Access.

[50]  C. V. Jawahar,et al.  Improving CNN-RNN Hybrid Networks for Handwriting Recognition , 2018, 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[51]  Jinchang Ren,et al.  Word-based handwritten Arabic scripts recognition using DCT features and neural network classifier , 2008, 2008 5th International Multi-Conference on Systems, Signals and Devices.

[52]  Robert Sablatnig,et al.  CVL-DataBase: An Off-Line Database for Writer Retrieval, Writer Identification and Word Spotting , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[53]  Shelbi Joseph,et al.  A Novel Approach for Handwriting Recognition in Malayalam Manuscripts using Contour Detection and Convolutional Neural Nets , 2018, 2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[54]  Hermann Ney,et al.  Tandem HMM with convolutional neural network for handwritten word recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[55]  Mouhcine Rabi,et al.  Convolutional Feature Learning and CNN Based HMM for Arabic Handwriting Recognition , 2018, ICISP.

[56]  Najoua Essoukri Ben Amara,et al.  Arabic handwritten word recognition based on dynamic bayesian network , 2016, Int. Arab J. Inf. Technol..

[57]  Ahmed Lawgali A Survey on Arabic Character Recognition , 2015 .

[58]  Lambert Schomaker,et al.  Deep Adaptive Learning for Writer Identification based on Single Handwritten Word Images , 2018, Pattern Recognit..