An online writer identification system using regression-based feature normalization and codebook descriptors

Adaptation of the VLAD framework to online writer identification.Addressal of a potential drawback with the VLAD framework.Proposal of a novel descriptor that alleviates the drawback of the VLAD.A feature normalization method that enhances the writer identification performance.Proposal tested on IAM and IBM UB 1 Online Handwriting Databases. This paper describes a strategy to identify the authorship of online handwritten documents. We regard our research framework to that of a retrieval problem and adapt the so called codebook based Vector of Local Aggregate descriptor (VLAD) that has been promising for the object retrieval application in image processing. The codebook comprises a set of code vectors with associated Voronoi cells computed from a clustering algorithm on a set of feature vectors along the online trace. However, we show that the VLAD formulation at times, cannot effectively discriminate between writers, when their respective feature vectors are not linearly separable in the Voronoi cell of the code vectors. To overcome this problem, we propose a novel descriptor that improves upon the VLAD formulation. Secondly, we explore a normalization for the feature vectors prior to the generation of the VLAD. Our method is different to the minmax and z-score in that it takes care in ensuring that the codevectors are not influenced by the presence of outliers in the data. The performance of our proposed descriptor with the new feature normalization are evaluated on two publicly available Online Handwriting Databases the IAM and IBM-UB1. The results show a marked improvement over the VLAD.

[1]  Thomas Mensink,et al.  Image Classification with the Fisher Vector: Theory and Practice , 2013, International Journal of Computer Vision.

[2]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[3]  Venu Govindaraju,et al.  A hierarchical Bayesian approach to online writer identification , 2013, IET Biom..

[4]  Robinson Piramuthu,et al.  Geometric VLAD for Large Scale Image Search , 2014, ArXiv.

[5]  Christian Viard-Gaudin,et al.  Automatic writer identification framework for online handwritten documents using character prototypes , 2009, Pattern Recognit..

[6]  Anoop M. Namboodiri,et al.  Text Independent Writer Identification from Online Handwriting , 2006 .

[7]  Marcus Liwicki,et al.  A writer identification system for on-line whiteboard data , 2008, Pattern Recognit..

[8]  Christian Viard-Gaudin,et al.  Online writer identification using character prototypes distributions , 2008, Electronic Imaging.

[9]  Venu Govindaraju,et al.  Modeling Writing Styles for Online Writer Identification: A Hierarchical Bayesian Approach , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[10]  Marcus Liwicki,et al.  IAM-OnDB - an on-line English sentence database acquired from handwritten text on a whiteboard , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[11]  Lianwen Jin,et al.  DeepWriterID: An End-to-End Online Text-Independent Writer Identification System , 2015, IEEE Intelligent Systems.

[12]  Andrew Zisserman,et al.  All About VLAD , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Christian Viard-Gaudin,et al.  Individuality of alphabet knowledge in online writer identification , 2009, International Journal on Document Analysis and Recognition (IJDAR).

[14]  Thierry Paquet,et al.  A writer identification and verification system , 2005, Pattern Recognit. Lett..

[15]  Slim Kanoun,et al.  Text-Independent Writer Identification on Online Arabic Handwriting , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[16]  Adel M. Alimi,et al.  Multi-fractal Modeling for On-line Text-Independent Writer Identification , 2011, 2011 International Conference on Document Analysis and Recognition.

[17]  Douglas A. Reynolds,et al.  Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..

[18]  Arun Ross,et al.  Score normalization in multimodal biometric systems , 2005, Pattern Recognit..

[19]  Luiz Eduardo Soares de Oliveira,et al.  Texture-based descriptors for writer identification and verification , 2013, Expert Syst. Appl..

[20]  Suresh Sundaram,et al.  A subtractive clustering scheme for text-independent online writer identification , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[21]  Cordelia Schmid,et al.  Aggregating Local Image Descriptors into Compact Codes , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Christian Viard-Gaudin,et al.  Online Writer Identification Using Fuzzy C-means Clustering of Character Prototypes , 2008, ICFHR 2008.

[23]  H. Bunke,et al.  Fusing Asynchronous Feature Streams for On-line Writer Identification , 2007 .

[24]  Tieniu Tan,et al.  Hierarchical Shape Primitive Features for Online Text-independent Writer Identification , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[25]  Bin Fang,et al.  Fragmented edge structure coding for Chinese writer identification , 2012, Neurocomputing.

[26]  Andreas K. Maier,et al.  Writer Identification Using GMM Supervectors and Exemplar-SVMs , 2017, Pattern Recognit..

[27]  Tieniu Tan,et al.  Online Text-Independent Writer Identification Based on Stroke's Probability Distribution Function , 2007, ICB.

[28]  Venu Govindaraju,et al.  IBM_UB_1: A Dual Mode Unconstrained English Handwriting Dataset , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[29]  Adel M. Alimi,et al.  Deep neural network for online writer identification using Beta-elliptic model , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[30]  Samy Bengio,et al.  Writer Identification for Smart Meeting Room Systems , 2006, Document Analysis Systems.

[31]  Venu Govindaraju,et al.  Data Sufficiency for Online Writer Identification: A Comparative Study of Writer-Style Space vs. Feature Space Models , 2014, 2014 22nd International Conference on Pattern Recognition.

[32]  Arun Ross,et al.  An introduction to biometric recognition , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  Marcus Liwicki,et al.  Sparse radial sampling LBP for writer identification , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[34]  Nicole Vincent,et al.  Text independent writer recognition using redundant writing patterns with contour-based orientation and curvature features , 2010, Pattern Recognit..

[35]  Adel M. Alimi,et al.  Online Arabic writer identification based on Beta-elliptic model , 2015, 2015 15th International Conference on Intelligent Systems Design and Applications (ISDA).