From Textline to Paragraph: A Promising Practice for Chinese Text Recognition

Although handwritten Chinese text recognition (HCTR) has achieved tremendous progress in the past decades, the traditional document analysis system suffers from two main problems: (1) The annotation of position and transcript at line level is costly to obtain; (2) The framework consists of several separately trained modules, and it’s difficult for the complex system to get satisfying results. Therefore, handwritten paragraph recognition attempts to incorporate the textline segmentation and recognition into a complete network. However, large character set and great insufficient training samples make it troublesome for handwritten Chinese paragraph recognition (HCPR). In this paper, a novel framework is proposed for HCPR. To make the training process faster and more stable, we put forward the Multi-Dimensional LSTM Convolutional Attention (MLCA) recognition framework. A new writing-style-aware image synthesis method is utilized as well to overcome the problem of data insufficiency. We conduct several experiments on the ICDAR-2013 competition dataset and the corresponding corrupted dataset. From the compelling results, we can draw an encouraging conclusion that it would be a promising trend to move from HCTR to HCPR for Chinese document analysis system.

[1]  Jihad El-Sana,et al.  Simplifying the reading of historical manuscripts , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[2]  David S. Doermann,et al.  Voronoi++: A Dynamic Page Segmentation Approach Based on Voronoi and Docstrum Features , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[3]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[4]  Jérôme Louradour,et al.  Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention , 2016, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[5]  Christian Wolf,et al.  Recognition : Learning Where to Start and When to Stop , 2017 .

[6]  Jérôme Louradour,et al.  Segmentation-free handwritten Chinese text recognition with LSTM-RNN , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[7]  Jürgen Schmidhuber,et al.  Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks , 2008, NIPS.

[8]  William A. Barrett,et al.  Start, Follow, Read: End-to-End Full-Page Handwriting Recognition , 2018, ECCV.

[9]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[10]  Tianwen Zhang,et al.  Off-line recognition of realistic Chinese handwriting using segmentation-free strategy , 2009, Pattern Recognit..

[11]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[12]  Abdel Belaïd,et al.  Multi-oriented Text Line Extraction from Handwritten Arabic Documents , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[13]  Lin Li,et al.  Enhancing RNN Based OCR by Transductive Transfer Learning From Text to Images , 2018, AAAI.

[14]  Christopher Kermorvant,et al.  Curriculum Learning for Handwritten Text Line Recognition , 2013, 2014 11th IAPR International Workshop on Document Analysis Systems.

[15]  Yi-Chao Wu,et al.  Improving handwritten Chinese text recognition using neural network language models and convolutional neural network shape models , 2017, Pattern Recognit..

[16]  Fei Yin,et al.  Handwritten Chinese Text Recognition by Integrating Multiple Contexts , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Christopher Kermorvant,et al.  The A2iA Multi-lingual Text Recognition System at the Second Maurdor Evaluation , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[18]  Yi-Chao Wu,et al.  Handwritten Chinese Text Recognition Using Separable Multi-Dimensional Recurrent Neural Network , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[19]  Christopher Kermorvant,et al.  The A2iA Arabic Handwritten Text Recognition System at the Open HaRT2013 Evaluation , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[20]  Yann Dauphin,et al.  Convolutional Sequence to Sequence Learning , 2017, ICML.

[21]  Fei Yin,et al.  CASIA Online and Offline Chinese Handwriting Databases , 2011, 2011 International Conference on Document Analysis and Recognition.

[22]  Théodore Bluche,et al.  Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition , 2016, NIPS.