TRACE: A Differentiable Approach to Line-level Stroke Recovery for Offline Handwritten Text

Stroke order and velocity are helpful features in the fields of signature verification, handwriting recognition, and handwriting synthesis. Recovering these features from offline handwritten text is a challenging and well-studied problem. We propose a new model called TRACE (Trajectory Recovery by an Adaptively-trained Convolutional Encoder). TRACE is a differentiable approach that uses a convolutional recurrent neural network (CRNN) to infer temporal stroke information from long lines of offline handwritten text with many characters and dynamic time warping (DTW) to align predictions and ground truth points. TRACE is perhaps the first system to be trained end-to-end on entire lines of text of arbitrary width and does not require the use of dynamic exemplars. Moreover, the system does not require images to undergo any pre-processing, nor do the predictions require any post-processing. Consequently, the recovered trajectory is differentiable and can be used as a loss function for other tasks, including synthesizing offline handwritten text. We demonstrate that temporal stroke information recovered by TRACE from offline data can be used for handwriting synthesis and establish the first benchmarks for a stroke trajectory recovery system trained on the IAM online handwriting dataset.

[1]  Ullrich Köthe,et al.  Analyzing Inverse Problems with Invertible Neural Networks , 2018, ICLR.

[2]  Samy Bengio,et al.  Density estimation using Real NVP , 2016, ICLR.

[3]  Hung Tuan Nguyen,et al.  Online trajectory recovery from offline handwritten Japanese kanji characters of multiple strokes , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[4]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[5]  Brian L. Price,et al.  Text and Style Conditioned GAN for the Generation of Offline-Handwriting Lines , 2020, BMVC.

[6]  Guohua Ren,et al.  Recognition of Online Handwriting with Variability on Smart Devices , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[9]  Victor Carbune,et al.  Multi-Language Online Handwriting Recognition , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Claudio M. Privitera,et al.  The segmentation of cursive handwriting: an approach based on off-line recovery of the motor-temporal information , 1999, IEEE Trans. Image Process..

[11]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[12]  Michael Blumenstein,et al.  Techniques for static handwriting trajectory recovery: a survey , 2010, DAS '10.

[13]  Alexei A. Efros,et al.  The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[14]  Umapada Pal,et al.  Indic Handwritten Script Identification using Offline-Online Multimodal Deep Network , 2018, Inf. Fusion.

[15]  Seiichi Uchida,et al.  Modality Conversion of Handwritten Patterns by Cross Variational Autoencoders , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[16]  Umapada Pal,et al.  Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[17]  Sarel Cohen,et al.  ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Victor Carbune,et al.  Fast multi-language LSTM-based online handwriting recognition , 2020, International Journal on Document Analysis and Recognition (IJDAR).

[19]  William A. Barrett,et al.  Data Augmentation for Recognition of Handwritten Words and Lines Using a CNN-LSTM Network , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[20]  Jianhua Tao,et al.  Pen Tip Motion Prediction for Handwriting Drawing Order Recovery using Deep Neural Network , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[21]  Ben M. Herbst,et al.  Estimating the pen trajectories of static signatures using hidden Markov models , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Alex Graves,et al.  Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[23]  Alex Graves,et al.  Stochastic Backpropagation through Mixture Density Distributions , 2016, ArXiv.

[24]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[25]  Marcus Liwicki,et al.  IAM-OnDB - an on-line English sentence database acquired from handwritten text on a whiteboard , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).