U-statistics for estimating performance metrics in forensic handwriting analysis

A class of computationally efficient approximations to a set of natural U-statistics and related U-processes that arise in forensic and biometric comparisons have been developed. This paper details the asymptotic characterization of the natural U-statistics, the development of computationally efficient approximations and expected error bounds of said approximations. The developed statistical methods are presented in the context of forensic handwriting comparisons and are used to estimate the rate at which the random match probability (RMP) decreases as a function of the amount of writing available in the comparison samples. Although presented in a forensic handwriting comparison context, similar problems arise in machine learning or pattern recognition where the task is to classify a batch of exchangeable objects that arise from the same class. Similarly, the developed methods can be used to estimate the rate at which the error rate decreases as the number of objects to be classified increases.

[1]  Christopher P. Saunders,et al.  Building a unified statistical framework for the forensic identification of source problems , 2018 .

[2]  Steven P. Lund,et al.  Likelihood Ratio as Weight of Forensic Evidence: A Closer Look. , 2017, Journal of research of the National Institute of Standards and Technology.

[3]  Bryan Found,et al.  The Modular Forensic Handwriting Method - 2016 Version , 2016 .

[4]  Agnieszka Martyna,et al.  Statistical Analysis in Forensic Science: Evidential Value of Multivariate Physicochemical Data , 2014 .

[5]  Amanda B. Hepler,et al.  Score-based likelihood ratios for handwriting evidence. , 2012, Forensic science international.

[6]  Amanda B. Hepler,et al.  Using subsampling to estimate the strength of handwriting evidence via score-based likelihood ratios. , 2012, Forensic science international.

[7]  JoAnn Buscaglia,et al.  Using Automated Comparisons to Quantify Handwriting Individuality *,†,‡ , 2011, Journal of forensic sciences.

[8]  Michael E. Schuckers,et al.  Computational Methods in Biometric Authentication , 2010 .

[9]  Donald T. Gantz,et al.  Construction and evaluation of classifiers for forensic document analysis , 2010, 1004.0678.

[10]  National Research Council Strengthening Forensic Science in the United States: A Path Forward , 2009, CrimRxiv.

[11]  Lambert Schomaker,et al.  Text-Independent Writer Identification and Verification Using Textural and Allographic Features , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Shaun Ladham,et al.  The risk of body packing: a case of a fatal cocaine overdose. , 2005, Forensic science international.

[13]  C. Aitken,et al.  Statistics and the Evaluation of Evidence for Forensic Scientists: Aitken/Statistics and the Evaluation of Evidence for Forensic Scientists , 2005 .

[14]  D. Balding Weight-of-Evidence for Forensic DNA Profiles , 2005 .

[15]  Franco Taroni,et al.  Statistics and the Evaluation of Evidence for Forensic Scientists , 2004 .

[16]  Sharath Pankanti,et al.  Guide to Biometrics , 2003, Springer Professional Computing.

[17]  Alan Agresti,et al.  Categorical Data Analysis , 2003 .

[18]  Roy Huber,et al.  Handwriting Identification: Facts and Fundamentals , 1999 .

[19]  Michael J. Saks,et al.  Science and Nonscience in the Courts: Daubert Meets Handwriting Identification Expertise , 1996 .

[20]  D. Stoney,et al.  Evaluation of Associative Evidence: Choosing the Relevant Question , 1984 .

[21]  R. Serfling Approximation Theorems of Mathematical Statistics , 1980 .

[22]  M. D. McLean I. The Setting , 1959 .

[23]  Davide Mazzella Questioned Documents , 2015 .

[24]  Jonathan J. Koehler,et al.  The Individualization Fallacy in Forensic Science Evidence , 2008 .

[25]  Brandon L. Garrett,et al.  Committee on Identifying the Needs of the Forensic Sciences Community , 2007 .

[26]  Arun Ross,et al.  Handbook of Multibiometrics , 2006, The Kluwer international series on biometrics.

[27]  S. Selvin,et al.  The probability of non-discrimination or likelihood of guilt of an accused: criminal identification. , 1983, Journal - Forensic Science Society.

[28]  Ing Rj Ser Approximation Theorems of Mathematical Statistics , 1980 .