Score-based likelihood ratios for handwriting evidence.

Score-based approaches for computing forensic likelihood ratios are becoming more prevalent in the forensic literature. When two items of evidential value are entangled via a scorefunction, several nuances arise when attempting to model the score behavior under the competing source-level propositions. Specific assumptions must be made in order to appropriately model the numerator and denominator probability distributions. This process is fairly straightforward for the numerator of the score-based likelihood ratio, entailing the generation of a database of scores obtained by pairing items of evidence from the same source. However, this process presents ambiguities for the denominator database generation - in particular, how best to generate a database of scores between two items of different sources. Many alternatives have appeared in the literature, three of which we will consider in detail. They differ in their approach to generating denominator databases, by pairing (1) the item of known source with randomly selected items from a relevant database; (2) the item of unknown source with randomly generated items from a relevant database; or (3) two randomly generated items. When the two items differ in type, perhaps one having higher information content, these three alternatives can produce very different denominator databases. While each of these alternatives has appeared in the literature, the decision of how to generate the denominator database is often made without calling attention to the subjective nature of this process. In this paper, we compare each of the three methods (and the resulting score-based likelihood ratios), which can be thought of as three distinct interpretations of the denominator proposition. Our goal in performing these comparisons is to illustrate the effect that subtle modifications of these propositions can have on inferences drawn from the evidence evaluation procedure. The study was performed using a data set composed of cursive writing samples from over 400 writers. We found that, when provided with the same two items of evidence, the three methods often would lead to differing conclusions (with rates of disagreement ranging from 0.005 to 0.48). Rates of misleading evidence and Tippet plots are both used to characterize the range of behavior for the methods over varying sized questioned documents. The appendix shows that the three score-based likelihood ratios are theoretically very different not only from each other, but also from the likelihood ratio, and as a consequence each display drastically different behavior.

[1]  Matthieu Schmittbuhl,et al.  Probabilistic evaluation of handwriting evidence: likelihood ratio for authorship , 2008 .

[2]  Donald T. Gantz,et al.  Construction and evaluation of classifiers for forensic document analysis , 2010, 1004.0678.

[3]  Didier Meuwly,et al.  The inference of identity in forensic speaker recognition , 2000, Speech Commun..

[4]  James M. Curran,et al.  The Statistical Interpretation of Forensic Glass Evidence , 2003 .

[5]  C Champod,et al.  Establishing the most appropriate databases for addressing source level propositions. , 2004, Science & justice : journal of the Forensic Science Society.

[6]  Franco Taroni,et al.  Statistics and the Evaluation of Evidence for Forensic Scientists , 2004 .

[7]  Anders Nordgaard,et al.  Assessment of Approximate Likelihood Ratios from Continuous Distributions: A Case Study of Digital Camera Identification * , 2011, Journal of forensic sciences.

[8]  Colin Aitken,et al.  Evaluation of trace evidence in the form of multivariate data , 2004 .

[9]  Didier Meuwly,et al.  Computation of Likelihood Ratios in Fingerprint Identification for Configurations of Three Minutiæ , 2006, Journal of forensic sciences.

[10]  Arun Ross,et al.  Handbook of Multibiometrics , 2006, The Kluwer international series on biometrics.

[11]  I. Evett,et al.  Earmarks as evidence: a critical review. , 2001, Journal of forensic sciences.

[12]  Cedric Neumann,et al.  New perspectives in the use of ink evidence in forensic science: Part III: Operational applications and evaluation. , 2009, Forensic science international.

[13]  Daniel Ramos,et al.  Forensic Automatic Speaker Classification in the "Coming Paradigm Shift" , 2007, Speaker Classification.

[14]  David Lindley,et al.  A problem in forensic science , 1977 .

[15]  David J. Spiegelhalter,et al.  Statistical methods for healthcare regulation: rating, screening and surveillance , 2012 .

[16]  Christophe Champod,et al.  Evidence evaluation in fingerprint comparison and automated fingerprint identification systems--modelling within finger variability. , 2007, Forensic science international.

[17]  Cedric Neumann,et al.  New perspectives in the use of ink evidence in forensic science Part II. Development and testing of mathematical algorithms for the automatic comparison of ink samples analysed by HPTLC. , 2009, Forensic science international.

[18]  Leaverton Le,et al.  The use of statistics. , 1970 .

[19]  Claude Roux,et al.  Statistics and the Evaluation of Evidence for Forensic Scientists, by Colin G. G. Aitken and Franco Taroni 2nd edition. John Wiley and Sons, 2004. , 2006 .

[20]  Amanda B. Hepler,et al.  Using subsampling to estimate the strength of handwriting evidence via score-based likelihood ratios. , 2012, Forensic science international.

[21]  Cedric Neumann,et al.  Quantifying the weight of evidence from a forensic fingerprint comparison: a new paradigm , 2012 .

[22]  Colin Aitken,et al.  The use of statistics in forensic science , 1991 .

[23]  D Meuwly,et al.  Forensic individualisation from biometric data. , 2006, Science & justice : journal of the Forensic Science Society.

[24]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[25]  R. Koons,et al.  Evaluation of Match Criteria Used for the Comparison of Refractive Index of Glass Fragments *,†,‡,§ , 2011, Journal of forensic sciences.

[26]  Cedric Neumann,et al.  Interpretation of complex DNA profiles using Tippett plots , 2008 .

[27]  Ian W. Evett,et al.  A Bayesian approach to interpreting footwear marks in forensic casework , 1998 .