Understanding Y haplotype matching probability.

The Y haplotype population-genetic terrain is better explored from a fresh perspective rather than by analogy with the more familiar autosomal ideas. For haplotype matching probabilities, versus for autosomal matching probabilities, explicit attention to modelling - such as how evolution got us where we are - is much more important while consideration of population frequency is much less so. This paper explores, extends, and explains some of the concepts of "Fundamental problem of forensic mathematics - the evidential strength of a rare haplotype match". That earlier paper presented and validated a "kappa method" formula for the evidential strength when a suspect matches a previously unseen haplotype (such as a Y-haplotype) at the crime scene. Mathematical implications of the kappa method are intuitive and reasonable. Suspicions to the contrary raised in rest on elementary errors. Critical to deriving the kappa method or any sensible evidential calculation is understanding that thinking about haplotype population frequency is a red herring; the pivotal question is one of matching probability. But confusion between the two is unfortunately institutionalized in much of the forensic world. Examples make clear why (matching) probability is not (population) frequency and why uncertainty intervals on matching probabilities are merely confused thinking. Forensic matching calculations should be based on a model, on stipulated premises. The model inevitably only approximates reality, and any error in the results comes only from error in the model, the inexactness of the approximation. Sampling variation does not measure that inexactness and hence is not helpful in explaining evidence and is in fact an impediment. Alternative haplotype matching probability approaches that various authors have considered are reviewed. Some are based on no model and cannot be taken seriously. For the others, some evaluation of the models is discussed. Recent evidence supports the adequacy of the simple exchangability model on which the kappa method rests. However, to make progress toward forensic calculation of Y haplotype mixture evidence a different tack is needed. The "Laplace distribution" model of Andersen et al. [3] which estimates haplotype frequencies by identifying haplotype clusters in population data looks useful.

[1]  Suling Zhu,et al.  The genetic legacy of the Mongols. , 2003, American journal of human genetics.

[2]  W. Ewens The sampling theory of selectively neutral alleles. , 1972, Theoretical population biology.

[3]  Charles H Brenner,et al.  Fundamental problem of forensic mathematics--the evidential value of a rare haplotype. , 2010, Forensic science international. Genetics.

[4]  William G. Hill,et al.  The Evaluation of Forensic DNA Evidence. By Committee on DNA Forensic Science: an Update, National Research Council. National Academy Press, 1996. 254 pages. Price £30.95, hard cover. ISBN 0 309 05395 1. , 1997 .

[5]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[6]  Weibo Liang,et al.  Population genetics for 17 Y-STR loci(AmpFISTR®Y-filerTM) in Luzhou Han ethnic group. , 2013, Forensic science international. Genetics.

[7]  Mikkel Meyer Andersen,et al.  Y-STR Frequency Surveying Method: A critical reappraisal. , 2011, Forensic science international. Genetics.

[8]  J S Buckleton,et al.  The interpretation of lineage markers in forensic DNA testing. , 2011, Forensic science international. Genetics.

[9]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[10]  M. Krawczak Forensic evaluation of Y-STR haplotype matches: a comment. , 2001, Forensic science international.

[11]  Bradley Efron,et al.  Modern science and the Bayesian-frequentist controversy , 2005 .

[12]  P. S. Eriksen,et al.  The discrete Laplace exponential family and estimation of Y-STR haplotype frequencies. , 2013, Journal of theoretical biology.

[13]  V. Pascali,et al.  A new method for the evaluation of matches in non-recombining genomes: application to Y-chromosomal short tandem repeat (STR) haplotypes in European males. , 2000, Forensic science international.

[14]  C. Cockerham,et al.  VARIANCE OF GENE FREQUENCIES , 1969, Evolution; international journal of organic evolution.

[15]  J. Mill A system of logic, ratiocinative and inductive : being a connected view of the principles of evidence, and the methods of scientific investigation. , 2011 .

[16]  M Slatkin,et al.  An exact test for neutrality based on the Ewens sampling distribution. , 1994, Genetical research.

[17]  M. Krawczak,et al.  Estimating trace-suspect match probabilities for singleton Y-STR haplotypes using coalescent theory. , 2013, Forensic science international. Genetics.

[18]  M Slatkin,et al.  A correction to the exact test based on the Ewens sampling distribution. , 1996, Genetical research.

[19]  H. Robbins Estimating the Total Probability of the Unobserved Outcomes of an Experiment , 1968 .

[20]  A. Veldman Evidential strength of Y-STR haplotype matches , 2007 .

[21]  GianCarlo Ghirardi,et al.  Quantum dynamical reduction and reality: Replacing probability densities with densities in real space , 1996, Erkenntnis.

[22]  “Mises redux” — Redux: Fifteen arguments against finite frequentism , 1996, Erkenntnis.