Extraction and Analysis of Document Examiner Features from Vector Skeletons of Grapheme 'th'

This paper presents a study of 25 structural features extracted from samples of grapheme ‘th' that correspond to features commonly used by forensic document examiners. Most of the features are extracted using vector skeletons produced by a specially developed skeletonisation algorithm. The methods of feature extraction are presented along with the results. Analysis of the usefulness of the features was conducted and three categories of features were identified: indispensable, partially relevant and irrelevant for determining the authorship of genuine unconstrained handwriting. The division was performed based on searching the optimal feature sets using the wrapper method. A constructive neural network was used as a classifier and a genetic algorithm was used to search for optimal feature sets. It is shown that structural micro features similar to those used in forensic document analysis do possess discriminative power. The results are also compared to those obtained in our preceding study, and it is shown that use of the vector skeletonisation allows both extraction of more structural features and improvement the feature extraction accuracy from 87% to 94%.

[1]  Sung-Hyuk Cha,et al.  Establishing handwriting individuality using pattern recognition techniques , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[2]  M. Kam,et al.  Writer Identification by Professional Document Examiners , 1997 .

[3]  S. Chen,et al.  Fast and accurate feature selection using hybrid genetic strategies , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[4]  Wilson R. Harrison,et al.  Suspect Documents: Their Scientific Examination , 1958 .

[5]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[6]  M. Kam,et al.  Signature authentication by forensic document examiners. , 2001, Journal of forensic sciences.

[7]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[8]  Sargur N. Srihari,et al.  Discriminatory power of handwritten words for writer recognition , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[9]  Ordway Hilton,et al.  Scientific Examination Of Questioned Documents , 1992 .

[10]  Jihoon Yang,et al.  DistAl: an inter-pattern distance-based constructive learning algorithm , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[11]  Casimir A. Kulikowski,et al.  Computer Systems That Learn: Classification and Prediction Methods from Statistics, Neural Nets, Machine Learning and Expert Systems , 1990 .

[12]  Sargur N. Srihari,et al.  Individuality of numerals , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[13]  Edna W. Robertson Fundamentals of Document Examination , 1991 .

[14]  Lambert Schomaker,et al.  Sparse-parametric writer identification using heterogeneous feature groups , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[15]  Donald E. Brown,et al.  Fast generic selection of features for neural network classifiers , 1992, IEEE Trans. Neural Networks.

[16]  Bryan Found,et al.  Forensic handwriting examiners' expertise for signature comparison. , 2002, Journal of forensic sciences.

[17]  Kenneth A. De Jong,et al.  Genetic algorithms as a tool for feature selection in machine learning , 1992, Proceedings Fourth International Conference on Tools with Artificial Intelligence TAI '92.

[18]  Gilbert Syswerda,et al.  Uniform Crossover in Genetic Algorithms , 1989, ICGA.

[19]  Jihoon Yang,et al.  Feature Subset Selection Using a Genetic Algorithm , 1998, IEEE Intell. Syst..

[20]  Sung-Hyuk Cha,et al.  Individuality of handwriting. , 2002, Journal of forensic sciences.

[21]  Bryan Found,et al.  A computer program designed to compare the spatial elements of handwriting , 1994 .

[22]  Sargur N. Srihari,et al.  Individuality of handwritten characters , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[23]  Roy Huber,et al.  Handwriting Identification: Facts and Fundamentals , 1999 .

[24]  Jerzy W. Bala,et al.  Hybrid Learning Using Genetic Algorithms and Decision Trees for Pattern Classification , 1995, IJCAI.

[25]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[26]  Graham Leedham,et al.  Handwritten character skeletonisation for forensic document analysis , 2005, SAC '05.