New mathematical and algorithmic schemes for pattern classification with application to the identification of writers of important ancient documents

Abstract: In this paper, a novel approach is introduced for classifying curves into proper families, according to their similarity. First, a mathematical quantity we call plane curvature is introduced and a number of propositions are stated and proved. Proper similarity measures of two curves are introduced and a subsequent statistical analysis is applied. First, the efficiency of the curve fitting process has been tested on 2 shapes datasets of reference. Next, the methodology has been applied to the very important problem of classifying 23 Byzantine codices and 46 Ancient inscriptions to their writers, thus achieving correct dating of their content. The inscriptions have been attributed to ten individual hands and the Byzantine codices to four writers.

[1]  Lior Wolf,et al.  Identifying Join Candidates in the Cairo Genizah , 2011, International Journal of Computer Vision.

[2]  Guillermo Sapiro,et al.  A Continuum Mechanical Approach to Geodesics in Shape Space , 2011, International Journal of Computer Vision.

[3]  Michael I. Miller,et al.  Large Deformation Diffeomorphic Metric Curve Mapping , 2008, International Journal of Computer Vision.

[4]  D. Mumford,et al.  A Metric on Shape Space with Explicit Geodesics , 2007, 0706.4299.

[5]  Mihalis Exarhos,et al.  Contour-shape based reconstruction of fragmented, 1600 BC wall paintings , 2002, IEEE Trans. Signal Process..

[6]  Zhuowen Tu,et al.  Learning Context-Sensitive Shape Similarity by Graph Transduction , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Mihalis Exarhos,et al.  Image and Pattern Analysis of 1650 B.C. Wall Paintings and Reconstruction , 2008, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[8]  Nikos Paragios,et al.  Shape registration in implicit spaces using information theory and free form deformations , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Lambert Schomaker,et al.  Text-Independent Writer Identification and Verification Using Textural and Allographic Features , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  J. B. Alonso,et al.  Writer identification based on graphology techniques , 2008, 2008 42nd Annual IEEE International Carnahan Conference on Security Technology.

[11]  Pitak Thumwarin,et al.  On-line writer recognition for Thai based on velocity of barycenter of pen-point movement , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[12]  Tieniu Tan,et al.  Personal identification based on handwriting , 2000, Pattern Recognit..

[13]  Xiuwen Liu,et al.  Shape of Elastic Strings in Euclidean Space , 2008, International Journal of Computer Vision.

[14]  Yuan Yan Tang,et al.  A novel method for offline handwriting-based writer identification , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[15]  Lambert Schomaker,et al.  A Reevaluation and Benchmark of Hidden Markov Models , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[16]  Tieniu Tan,et al.  Biometric personal identification based on handwriting , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[17]  Graham Leedham,et al.  Writer identification using innovative binarised features of handwritten numerals , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[18]  Nicholas R. Howe,et al.  Style-based retrieval for ancient Syriac manuscripts , 2011, HIP '11.

[19]  Lambert Schomaker,et al.  Automatic writer identification using connected-component contours and edge-based features of uppercase Western script , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Daniel Cremers,et al.  Integral Invariants for Shape Matching , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Ronen Basri,et al.  Shape representation and classification using the Poisson equation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[22]  Horst Bunke,et al.  Off-line handwriting identification using HMM based recognizers , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[23]  Horst Bunke,et al.  A writer identification and verification system using HMM based recognizers , 2006, Pattern Analysis and Applications.

[24]  Anuj Srivastava,et al.  Analysis of planar shapes using geodesic paths on shape spaces , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Horst Bunke,et al.  Writer identification using text line based features , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[26]  Louis Vuurpijl,et al.  Writer identification through information retrieval: the allograph weight vector , 2008, ICFHR 2008.

[27]  Pascal Frossard,et al.  Minimum Distance between Pattern Transformation Manifolds: Algorithm and Applications , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  G. Hommel A comparison of two modified Bonferroni procedures , 1989 .

[29]  Manuel Contreras Seitz,et al.  HACIA LA CONSTITUCIÓN DE UN CORPUS DIACRÓNICO DEL ESPAÑOL DE CHILE , 2009 .

[30]  Aleix M. Martínez,et al.  Rotation Invariant Kernels and Their Application to Shape Analysis , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Joulia Chapran,et al.  Biometric Writer Identification: Feature Analysis and Classification , 2006, Int. J. Pattern Recognit. Artif. Intell..

[32]  Vassilis Anastassopoulos,et al.  Morphological waveform coding for writer identification , 2000, Pattern Recognit..

[33]  Irccyn,et al.  Tenth international workshop on frontiers in handwriting recognition , 2006 .

[34]  Christian Viard-Gaudin,et al.  Online writer identification using character prototypes distributions , 2008, Electronic Imaging.

[35]  Thierry Paquet,et al.  A writer identification and verification system , 2005, Pattern Recognit. Lett..

[36]  Sung-Hyuk Cha,et al.  MULTIPLE FEATURE INTEGRATION FOR WRITER VERIFICATION , 2004 .

[37]  Sargur N. Srihari,et al.  Individuality of handwritten characters , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[38]  Christian Viard-Gaudin,et al.  Automatic writer identification framework for online handwritten documents using character prototypes , 2009, Pattern Recognit..

[39]  Zhenyu He,et al.  Writer identification of Chinese handwriting documents using hidden Markov tree model , 2008, Pattern Recognit..

[40]  Chang-Dong Wang,et al.  A Stroke Shape and Structure Based Approach for Off-line Chinese Handwriting Identification , 2011 .

[41]  Haibin Ling,et al.  Shape Classification Using the Inner-Distance , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Thierry Paquet,et al.  Handwriting analysis for writer verification , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.

[43]  Bhabatosh Chanda,et al.  Writer Identification for Handwritten Telugu Documents Using Directional Morphological Features , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[44]  Lambert Schomaker,et al.  Using codebooks of fragmented connected-component contours in forensic and historic writer identification , 2007, Pattern Recognit. Lett..

[45]  Graham Leedham,et al.  Extraction and analysis of forensic document examiner features used for writer identification , 2007, Pattern Recognit..

[46]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.