A subtractive clustering scheme for text-independent online writer identification

This paper proposes a text-independent writer identification framework for online handwritten text. The method utilizes an unsupervised learning scheme termed `subtractive clustering' to discover the unique writing styles of a given author. Subtractive clustering has been adopted in the literature for the problems of image segmentation and speaker identification. To the best of our knowledge, its applicability in the domain of writer identification is yet to be explored. Unlike traditional clustering techniques such as k-means and fuzzy c-means, the subtractive clustering algorithm does not rely on the initial choice of seed points. Instead, it locates the high density regions in the feature space, and this make this scheme an interesting exploration to capture the writing styles of an author (referred to as `prototypes'). The discovered prototypes from the clustering algorithm are subsequently employed to score the authorship of an unknown handwritten text. In addition, inspired from the t f-idf approach used in document retrieval, we propose a modified scoring scheme for identifying the writer. The efficacy of the algorithms are evaluated on the paragraphs from the IAM-Online Handwritten Database.

[1]  Douglas A. Reynolds,et al.  Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..

[2]  JaeYeol Rheem,et al.  Speaker Identification Based on Subtractive Clustering Algorithm with Estimating Number of Clusters , 2005, TSD.

[3]  Arun Ross,et al.  An introduction to biometric recognition , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Venu Govindaraju,et al.  Modeling Writing Styles for Online Writer Identification: A Hierarchical Bayesian Approach , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[5]  Marcus Liwicki,et al.  A writer identification system for on-line whiteboard data , 2008, Pattern Recognit..

[6]  Gökhan Bilgin,et al.  Segmentation of Hyperspectral Images via Subtractive Clustering and Cluster Validation Using One-Class Support Vector Machines , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Slim Kanoun,et al.  Text-Independent Writer Identification on Online Arabic Handwriting , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[8]  Christian Viard-Gaudin,et al.  Individuality of alphabet knowledge in online writer identification , 2009, International Journal on Document Analysis and Recognition (IJDAR).

[9]  Tieniu Tan,et al.  Online Text-Independent Writer Identification Based on Stroke's Probability Distribution Function , 2007, ICB.

[10]  Samy Bengio,et al.  Writer Identification for Smart Meeting Room Systems , 2006, Document Analysis Systems.

[11]  Anil K. Jain,et al.  On-line signature verification, , 2002, Pattern Recognit..

[12]  Christian Viard-Gaudin,et al.  Automatic writer identification framework for online handwritten documents using character prototypes , 2009, Pattern Recognit..

[13]  Anoop M. Namboodiri,et al.  Text Independent Writer Identification from Online Handwriting , 2006 .

[14]  Tieniu Tan,et al.  Hierarchical Shape Primitive Features for Online Text-independent Writer Identification , 2009, 2009 10th International Conference on Document Analysis and Recognition.