End-to-End Online Writer Identification With Recurrent Neural Network

Writer identification is an important topic for pattern recognition and artificial intelligence. Traditional methods rely heavily on sophisticated hand-crafted features to represent the characteristics of different writers. In this paper, we propose an end-to-end framework for online text-independent writer identification by using a recurrent neural network (RNN). Specifically, the handwriting data of a particular writer are represented by a set of random hybrid strokes (RHSs). Each RHS is a randomly sampled short sequence representing pen tip movements ( $xy$-coordinates) and pen-down or pen-up states. RHS is independent of the content and language involved in handwriting; therefore, writer identification at the RHS level is more general and convenient than the character level or the word level, which also requires character/word segmentation. The RNN model with bidirectional long short-term memory is used to encode each RHS into a fixed-length vector for final classification. All the RHSs of a writer are classified independently, and then, the posterior probabilities are averaged to make the final decision. The proposed framework is end-to-end and does not require any domain knowledge for handwriting data analysis. Experiments on both English (133 writers) and Chinese (186 writers) databases verify the advantages of our method compared with other state-of-the-art approaches.

[1]  Jürgen Schmidhuber,et al.  Learning Precise Timing with LSTM Recurrent Networks , 2003, J. Mach. Learn. Res..

[2]  Pietro Perona,et al.  Visual Identification by Signature Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[4]  Nasir D. Memon,et al.  Online Signature Verification on Mobile Devices , 2014, IEEE Transactions on Information Forensics and Security.

[5]  Lianwen Jin,et al.  DeepWriterID: An End-to-End Online Text-Independent Writer Identification System , 2015, IEEE Intelligent Systems.

[6]  Marcus Liwicki,et al.  A writer identification system for on-line whiteboard data , 2008, Pattern Recognit..

[7]  Lianwen Jin,et al.  Chinese character-level writer identification using path signature feature, DropStroke and deep CNN , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[8]  Samy Bengio,et al.  Writer Identification for Smart Meeting Room Systems , 2006, Document Analysis Systems.

[9]  Jürgen Schmidhuber,et al.  Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.

[10]  Wojciech Zaremba,et al.  An Empirical Exploration of Recurrent Network Architectures , 2015, ICML.

[11]  Marcus Liwicki,et al.  Automatic gender detection using on-line and off-line information , 2011, Pattern Analysis and Applications.

[12]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[13]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[14]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Giuseppe Pirlo,et al.  Automatic Signature Verification: The State of the Art , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[16]  Tieniu Tan,et al.  Hierarchical Shape Primitive Features for Online Text-independent Writer Identification , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[17]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[18]  Marcos Faúndez-Zanuy,et al.  Biometric recognition using online uppercase handwritten text , 2012, Pattern Recognit..

[19]  Alex Graves,et al.  Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[20]  Yoshua Bengio,et al.  Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[21]  Yoshua Bengio,et al.  Describing Multimedia Content Using Attention-Based Encoder-Decoder Networks , 2015, IEEE Transactions on Multimedia.

[22]  Lambert Schomaker,et al.  Text-Independent Writer Identification and Verification Using Textural and Allographic Features , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[24]  Arun Ross,et al.  An introduction to biometrics , 2008, ICPR 2008.

[25]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[26]  Tieniu Tan,et al.  Personal identification based on handwriting , 2000, Pattern Recognit..

[27]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[28]  Anil K. Jain,et al.  Online handwritten script recognition , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[30]  Atsuto Maki,et al.  Factors of Transferability for a Generic ConvNet Representation , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[32]  Razvan Pascanu,et al.  Theano: new features and speed improvements , 2012, ArXiv.

[33]  H. N. Prakash,et al.  Online Signature Verification and Recognition: An Approach Based on Symbolic Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Marcus Liwicki,et al.  A novel approach to on-line handwriting recognition based on bidirectional long short-term memory networks , 2007 .

[35]  Nidal S. Kamel,et al.  Glove-Based Approach to Online Signature Verification , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Lianwen Jin,et al.  Character-level Chinese Writer Identification using Path Signature Feature, DropStroke and Deep CNN , 2015, ArXiv.