Dissimilarity Gaussian Mixture Models for Efficient Offline Handwritten Text-Independent Identification Using SIFT and RootSIFT Descriptors

Handwriting biometrics is the science of identifying the behavioral aspect of an individual’s writing style and exploiting it to develop automated writer identification and verification systems. This paper presents an efficient handwriting identification system which combines scale-invariant feature transform (SIFT) and RootSIFT descriptors in a set of Gaussian mixture models (GMMs). In particular, a new concept of similarity and dissimilarity Gaussian mixture models (SGMM and DGMM) is introduced. While an SGMM is constructed for every writer to describe the intra-class similarity that is exhibited between the handwritten texts of the same writer, a DGMM represents the contrast or dissimilarity that exists between the writer’s style on one hand and other different handwriting styles on the other hand. Furthermore, because the handwritten text is described by a number of key point descriptors where each descriptor generates an SGMM/DGMM score, a new weighted histogram method is proposed to derive the intermediate prediction score for each writer’s GMM. The idea of weighted histogram exploits the fact that handwritings from the same writer should exhibit more similar textual patterns than dissimilar ones, hence, by penalizing the bad scores with a cost function, the identification rate can be significantly enhanced. Our proposed system has been extensively assessed using six different public datasets (including three English, two Arabic, and one hybrid language), and the results have shown the superiority of the proposed system over the state-of-the-art techniques.

[1]  Bhabatosh Chanda,et al.  A novel sparse model based forensic writer identification , 2014, Pattern Recognit. Lett..

[2]  Nicole Vincent,et al.  Text independent writer recognition using redundant writing patterns with contour-based orientation and curvature features , 2010, Pattern Recognit..

[3]  M.E. Moghaddam,et al.  A Text-Independent Persian Writer Identification System Using LCS Based Classifier , 2008, 2008 IEEE International Symposium on Signal Processing and Information Technology.

[4]  Lambert Schomaker,et al.  Text-Independent Writer Identification and Verification Using Textural and Allographic Features , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Mohsen Ebrahimi Moghaddam,et al.  A text-independent Persian writer identification based on feature relation graph (FRG) , 2010, Pattern Recognit..

[6]  Lambert Schomaker,et al.  Automatic writer identification using connected-component contours and edge-based features of uppercase Western script , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[8]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[9]  Nicole Vincent,et al.  Combining Contour Based Orientation and Curvature Features for Writer Recognition , 2009, CAIP.

[10]  Mohsen Ebrahimi Moghaddam,et al.  Text-independent Persian Writer Identification Using Fuzzy Clustering Approach , 2009, 2009 International Conference on Information Management and Engineering.

[11]  Fouad Khelifi,et al.  Offline text independent writer identification using ensemble of multi-scale local ternary pattern histograms , 2016, 2016 6th European Workshop on Visual Information Processing (EUVIP).

[12]  Luiz Eduardo Soares de Oliveira,et al.  Writer verification using texture-based features , 2011, International Journal on Document Analysis and Recognition (IJDAR).

[13]  Volker Märgner,et al.  A New Text-Independent GMM Writer Identification System Applied to Arabic Handwriting , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[14]  Horst Bunke,et al.  Writer identification using text line based features , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[15]  Anoop M. Namboodiri,et al.  Text Independent Writer Identification from Online Handwriting , 2006 .

[16]  Lambert Schomaker,et al.  Text-Independent Writer Identification and Verification on Offline Arabic Handwriting , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[17]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[18]  Nicole Vincent,et al.  A Set of Chain Code Based Features for Writer Recognition , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[19]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[20]  Maher Khemakhem,et al.  A model-based approach to offline text-independent Arabic writer identification and verification , 2015, Pattern Recognit..

[21]  Thierry Paquet,et al.  A writer identification and verification system , 2005, Pattern Recognit. Lett..

[22]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[23]  Alicia Fornés,et al.  Writer Identification in Old Handwritten Music Scores , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[24]  Lambert Schomaker,et al.  Junction detection in handwritten documents and its application to writer identification , 2015, Pattern Recognit..

[25]  Tieniu Tan,et al.  Personal identification based on handwriting , 2000, Pattern Recognit..

[26]  Lianwen Jin,et al.  DeepWriterID: An End-to-End Online Text-Independent Writer Identification System , 2015, IEEE Intelligent Systems.

[27]  Jaime Gómez,et al.  Writer Identification Forensic System Based on Support Vector Machines with Connected Components , 2004, IEA/AIE.

[28]  Louis Vuurpijl,et al.  Forensic writer identification: a benchmark data set and a comparison of two systems , 2000 .

[29]  Xin Li,et al.  Writer Identification of Chinese Handwriting Using Grid Microstructure Feature , 2009, ICB.

[30]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[31]  Imran Siddiqi,et al.  Writer identification using texture descriptors of handwritten fragments , 2016, Expert Syst. Appl..

[32]  Lambert Schomaker,et al.  Writer identification using directional ink-trace width measurements , 2012, Pattern Recognit..

[33]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[34]  Reza Safabakhsh,et al.  Offline text-independent writer identification using codebook and efficient code extraction methods , 2013, Image Vis. Comput..

[35]  Ahmed Bouridane,et al.  Off-line writer identification using an ensemble of grapheme codebook features , 2015, Pattern Recognit. Lett..

[36]  M. Pechwitz,et al.  IFN/ENIT: database of handwritten arabic words , 2002 .

[37]  Lucas Ballard,et al.  Evaluating the Security of Handwriting Biometrics , 2006 .

[38]  Andreas K. Maier,et al.  Writer Identification Using GMM Supervectors and Exemplar-SVMs , 2017, Pattern Recognit..

[39]  Djeddi Chawki,et al.  A texture based approach for Arabic writer identification and verification , 2010, 2010 International Conference on Machine and Web Intelligence.

[40]  Robert Sablatnig,et al.  Writer Identification and Retrieval Using a Convolutional Neural Network , 2015, CAIP.

[41]  Ying Wen,et al.  Text-independent writer identification using SIFT descriptor and contour-directional feature , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[42]  Alicia Fornés,et al.  On the Use of Textural Features for Writer Identification in Old Handwritten Music Scores , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[43]  Slim Kanoun,et al.  A Database for Arabic Handwritten Text Image Recognition and Writer Identification , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[44]  Luiz Eduardo Soares de Oliveira,et al.  Texture-based descriptors for writer identification and verification , 2013, Expert Syst. Appl..

[45]  Anil K. Jain,et al.  On-line signature verification, , 2002, Pattern Recognit..

[46]  F. Nejad,et al.  A New Method for Writer Identification and Verification Based on Farsi/Arabic Handwritten Texts , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[47]  Hassiba Nemmour,et al.  An efficient open system for offline handwritten signature identification based on curvelet transform and one-class principal component analysis , 2017, Neurocomputing.

[48]  Douglas A. Reynolds,et al.  Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..

[49]  Arun Ross,et al.  An introduction to biometric recognition , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[50]  Maher Khemakhem,et al.  Arabic Writer Identification and Verification Using Template Matching Analysis of Texture , 2012, 2012 IEEE 12th International Conference on Computer and Information Technology.

[51]  Robert Sablatnig,et al.  Writer Retrieval and Writer Identification Using Local Features , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[52]  Labiba Souici-Meslati,et al.  Artificial Immune Recognition System for Arabic writer identification , 2011, International Symposium on Innovations in Information and Communications Technology.

[53]  Horst Bunke,et al.  Off-lineWriter Identification Using Gaussian Mixture Models , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[54]  Lambert Schomaker,et al.  Text-Independent Writer Identification and Verification on Offline Arabic Handwriting , 2007 .

[55]  Robert Sablatnig,et al.  CVL-DataBase: An Off-Line Database for Writer Retrieval, Writer Identification and Word Spotting , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[56]  Andrew Zisserman,et al.  Three things everyone should know to improve object retrieval , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[57]  Fouad Khelifi,et al.  Robust off-line text independent writer identification using bagged discrete cosine transform features , 2017, Expert Syst. Appl..

[58]  Robert Sablatnig,et al.  Writer Identification and Writer Retrieval Using the Fisher Vector on Visual Vocabularies , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[59]  Youbao Tang,et al.  Offline Text-Independent Writer Identification Based on Scale Invariant Feature Transform , 2014, IEEE Transactions on Information Forensics and Security.

[60]  Basilios Gatos,et al.  ICDAR 2011 Writer Identification Contest , 2011, 2011 International Conference on Document Analysis and Recognition.

[61]  Horst Bunke,et al.  Using HMM based recognizers for writer identification and verification , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.

[62]  David Doermann,et al.  Combining Local Features for Offline Writer Identification , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.