Online trajectory recovery from offline handwritten Japanese kanji characters of multiple strokes

In general, it is straightforward to render an offline handwriting image from an online handwriting pattern. However, it is challenging to reconstruct an online handwriting pattern given an offline handwriting image, especially for multiple-stroke character as Japanese kanji. The multiple-stroke character requires not only point coordinates but also stroke orders whose difficulty is exponential growth by the number of strokes. Besides, several crossed and touch points might increase the difficulty of the recovered task. We propose a deep neural network-based method to solve the recovered task using a large online handwriting database. Our proposed model has two main components: Convolutional Neural Network-based encoder and Long Short-Term Memory Network-based decoder with an attention layer. The encoder focuses on feature extraction while the decoder refers to the extracted features and generates the time-sequences of coordinates. We also demonstrate the effect of the attention layer to guide the decoder during the reconstruction. We evaluate the performance of the proposed method by both visual verification and handwritten character recognition. Although the visual verification reveals some problems, the recognition experiments demonstrate the effect of trajectory recovery in improving the accuracy of offline handwritten character recognition when online recognition for the recovered trajectories are combined.

[1]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[2]  Yu Qiao,et al.  Recover Writing Trajectory from Multiple Stroked Image Using Bidirectional Dynamic Search , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[3]  Zhong-sheng Cao,et al.  A Model for Recovering Writing Sequence from Offline Handwritten Chinese Character Image , 2008, 2008 Congress on Image and Signal Processing.

[4]  Tianqi Chen,et al.  Empirical Evaluation of Rectified Activations in Convolutional Network , 2015, ArXiv.

[5]  Ben M. Herbst,et al.  Verification of dynamic curves extracted from static handwritten scripts , 2008, Pattern Recognit..

[6]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[7]  Hung Tuan Nguyen,et al.  Online Japanese Handwriting Recognizers using Recurrent Neural Networks , 2018, 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[8]  Louis Vuurpijl,et al.  Automatic trajectory extraction and validation of scanned handwritten characters , 2006 .

[9]  Michael Blumenstein,et al.  Techniques for static handwriting trajectory recovery: a survey , 2010, DAS '10.

[10]  Masaki Nakagawa,et al.  Collection of on-line handwritten Japanese character pattern databases and their analyses , 2004, Document Analysis and Recognition.

[11]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[12]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[13]  Samy Bengio,et al.  Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  David H. Douglas,et al.  ALGORITHMS FOR THE REDUCTION OF THE NUMBER OF POINTS REQUIRED TO REPRESENT A DIGITIZED LINE OR ITS CARICATURE , 1973 .

[15]  Cuong Tuan Nguyen,et al.  An Attention-Based End-to-End Model for Multiple Text Lines Recognition in Japanese Historical Documents , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[16]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[17]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[18]  Stefan Jäger Recovering dynamic information from static, handwritten word images: bridging the gap between on-line and off-line handwriting recognition , 1998 .

[19]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[20]  Thomas Deselaers,et al.  A Scalable Handwritten Text Recognition System , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[21]  Volker Sorge,et al.  Trajectory recovery and stroke reconstruction of handwritten mathematical symbols , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[22]  Makoto Yasuhara,et al.  Recovery of Drawing Order from Single-Stroke Handwriting Images , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[24]  Fei Yin,et al.  CASIA Online and Offline Chinese Handwriting Databases , 2011, 2011 International Conference on Document Analysis and Recognition.

[25]  Shiliang Zhang,et al.  Watch, attend and parse: An end-to-end neural network based approach to handwritten mathematical expression recognition , 2017, Pattern Recognit..

[26]  Douglas Eck,et al.  A Neural Representation of Sketch Drawings , 2017, ICLR.

[27]  Umapada Pal,et al.  Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).