A deep learning framework for text-independent writer identification

Abstract Handwriting Writer Identification (HWI) refers to the process of handwriting text image analysis to identify the authorship of the documents. It has yielded promising results in various applications, including digital forensics, criminal purposes, exploring the writer of historical documents, etc. The complexity of the text image, especially in images with various handwriting makes the writer identification difficult. In this work, we propose an end-to-end system that relies on a straightforward yet well-designed deep network and very efficient feature extraction, emphasizing feature engineering. Our system is an extended version of ResNet by conjugating deep residual networks and a new traditional yet high-quality handwriting descriptor towards handwriting analysis. Our descriptor analyzes the handwriting thickness as a preliminary and essential feature for human handwriting characteristics. Our approach can also provide text-independent writer identification that we do not need to have the same handwriting content for learning our model. The proposed approach is evaluated and achieved consistent results on four public and well-known datasets of IAM, Firemaker, CVL, and CERUG-EN. We empirically demonstrate that our conjugated network outperforms the original ResNet, and it can work well for real-world applications in which patches with few letters exist.

[1]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Fouad Khelifi,et al.  Robust off-line text independent writer identification using bagged discrete cosine transform features , 2017, Expert Syst. Appl..

[3]  Karl Ni,et al.  Writer Identification in Noisy Handwritten Documents , 2017, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[4]  Louis Vuurpijl,et al.  Forensic writer identification: a benchmark data set and a comparison of two systems , 2000 .

[5]  Yassine Ruichek,et al.  Cross multi-scale locally encoded gradient patterns for off-line text-independent writer identification , 2020, Eng. Appl. Artif. Intell..

[6]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[7]  Lambert Schomaker,et al.  Deep Adaptive Learning for Writer Identification based on Single Handwritten Word Images , 2018, Pattern Recognit..

[8]  Reza Safabakhsh,et al.  Offline text-independent writer identification using codebook and efficient code extraction methods , 2013, Image Vis. Comput..

[9]  Jian Sun,et al.  Convolutional neural networks at constrained time cost , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Yu-Jie Xiong,et al.  Off-line Text-Independent Writer Recognition: A Survey , 2017, Int. J. Pattern Recognit. Artif. Intell..

[11]  Lambert Schomaker,et al.  Writer identification using directional ink-trace width measurements , 2012, Pattern Recognit..

[12]  Bipin Indurkhya,et al.  Text-independent writer identification using convolutional neural network , 2017, Pattern Recognit. Lett..

[13]  Youbao Tang,et al.  Offline Text-Independent Writer Identification Based on Scale Invariant Feature Transform , 2014, IEEE Transactions on Information Forensics and Security.

[14]  Lambert Schomaker,et al.  Writer identification using curvature-free features , 2017, Pattern Recognit..

[15]  Lambert Schomaker,et al.  Junction detection in handwritten documents and its application to writer identification , 2015, Pattern Recognit..

[16]  Linjie Xing,et al.  DeepWriter: A Multi-stream Deep CNN for Text-Independent Writer Identification , 2016, 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[17]  Yassine Ruichek,et al.  An effective and conceptually simple feature representation for off-line text-independent writer identification , 2019, Expert Syst. Appl..

[18]  Claudio De Stefano,et al.  Reliable writer identification in medieval manuscripts through page layout features: The "Avila" Bible case , 2018, Eng. Appl. Artif. Intell..

[19]  Youbao Tang,et al.  Text-Independent Writer Identification via CNN Features and Joint Bayesian , 2016, 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[20]  Lambert Schomaker,et al.  FragNet: Writer Identification Using Deep Fragment Networks , 2020, IEEE Transactions on Information Forensics and Security.

[21]  Mahdi Jampour,et al.  Chaos game theory and its application for offline signature identification , 2019, IET Biom..

[22]  Imran Siddiqi,et al.  Writer identification using texture descriptors of handwritten fragments , 2016, Expert Syst. Appl..

[23]  A. Ibañez-Molina,et al.  Spatial Analysis of Handwritten Texts as a Marker of Cognitive Control , 2018, Journal of motor behavior.

[24]  Robert Sablatnig,et al.  Writer Identification and Retrieval Using a Convolutional Neural Network , 2015, CAIP.

[25]  Fouad Khelifi,et al.  Dissimilarity Gaussian Mixture Models for Efficient Offline Handwritten Text-Independent Identification Using SIFT and RootSIFT Descriptors , 2019, IEEE Transactions on Information Forensics and Security.

[26]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.