Writer Identification and Retrieval Using a Convolutional Neural Network

In this paper a novel method for writer identification and retrieval is presented. Writer identification is the process of finding the author of a specific document by comparing it to documents in a database where writers are known, whereas retrieval is the task of finding similar handwritings or all documents of a specific writer. The method presented is using Convolutional Neural Networks CNN to generate a feature vector for each writer, which is then compared with the precalculated feature vectors stored in the database. For the generation of this vector the CNN is trained on a database with known writers and after training the classification layer is cut off and the output of the second last fully connected layer is used as feature vector. For the identification a nearest neighbor classification is used. The evaluation is performed on the ICDAR2013 Competition on Writer Identification, ICDAR 2011 Writer Identification Contest, and the CVL-Database datasets. Experiments show, that this novel approach achieves better results to previously presented writer identification approaches.

[1]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[2]  David S. Doermann,et al.  Offline Writer Identification Using K-Adjacent Segments , 2011, 2011 International Conference on Document Analysis and Recognition.

[3]  Robert Sablatnig,et al.  CVL-DataBase: An Off-Line Database for Writer Retrieval, Writer Identification and Word Spotting , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[4]  Robert Sablatnig,et al.  Writer Retrieval and Writer Identification Using Local Features , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[5]  Robert Sablatnig,et al.  End-to-End Text Recognition Using Local Ternary Patterns, MSER and Deep Convolutional Nets , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[6]  Tara N. Sainath,et al.  Deep convolutional neural networks for LVCSR , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[8]  Basilios Gatos,et al.  ICDAR 2011 Writer Identification Contest , 2011, 2011 International Conference on Document Analysis and Recognition.

[9]  Horst Bunke,et al.  Writer identification using text line based features , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[10]  Robert Sablatnig,et al.  Text Line Detection for Heterogeneous Documents , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[11]  A. Papandreou,et al.  ICDAR 2013 Competition on Writer Identification , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[12]  Xin Li,et al.  Writer Identification of Chinese Handwriting Using Grid Microstructure Feature , 2009, ICB.

[13]  Robert Sablatnig,et al.  Writer Identification and Writer Retrieval Using the Fisher Vector on Visual Vocabularies , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[14]  Elli Angelopoulou,et al.  Writer identification and verification using GMM supervectors , 2014, IEEE Winter Conference on Applications of Computer Vision.

[15]  S. Shivashankar,et al.  Writer identification in a handwritten document image using texture features , 2010, 2010 International Conference on Signal and Image Processing.

[16]  David Doermann,et al.  Combining Local Features for Offline Writer Identification , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[17]  Louis Vuurpijl,et al.  Writer identification using edge-based directional features , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[18]  Tao Wang,et al.  End-to-end text recognition with convolutional neural networks , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[19]  David S. Doermann,et al.  Writer Identification Using an Alphabet of Contour Gradient Descriptors , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[20]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.