Programming Scripts for Simple and Complex Paternity Testing based on Open-Source Programming Language from the R Project

Programming scripts were written for the statistical analysis of genetic data from simple cases and complex cases of undetermined paternity. The methods presented here involve algorithms constructed with R, an open-source and increasingly popular programming language used for calculations and statistics; these methods also involve conditional probability analysis, Bayes’ Theorem, and pedigree analysis. Previous computer programs for assessing probable paternity in complex cases of undetermined paternity have been written; however, only minimal or generalized formulas are described in the papers presenting these programs. Therefore, these previously published programs are difficult to understand for most forensic researchers. Here, we present the details of the calculations used to evaluate probabilities of paternity and the details of the R scripts used execute these calculations. These scripts were constructed not only for standard trio case where DNA typing of the mother, child, and the alleged father are available, but also for more complex cases where DNA typing of the alleged father is absent. In these more complex cases, the putative genotype of the alleged father is determined from the genotypes of his parents, his siblings, his wife, children known to be his biological children, or some combination of these people. This report provides concrete and orderly descriptions of the calculations and the R scripts so that each facet of this method is easily understood. Furthermore, access to these scripts will enable individual researchers to develop calculation systems of their own.

[1]  Wing K Fung User-friendly programs for easy calculations in paternity testing and kinship determinations. , 2003, Forensic science international.

[2]  Paul Teetor,et al.  R Cookbook , 2011 .

[3]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[4]  Niels Morling,et al.  ISFG: Recommendations on biostatistics in paternity testing. , 2007, Forensic science international. Genetics.

[5]  C. Brenner Symbolic kinship program. , 1997, Genetics.

[6]  R R Gomes,et al.  PedExpert: a computer program for the application of Bayesian networks to human paternity testing. , 2009, Genetics and molecular research : GMR.

[7]  H Shiono,et al.  [Investigation of algorithm for the calculation of probability of paternity likelihood using personal computer program, including the application to parentage testing in the decreased party]. , 1992, Nihon hoigaku zasshi = The Japanese journal of legal medicine.

[8]  J Drábek,et al.  Validation of software for calculating the likelihood ratio for parentage and kinship. , 2009, Forensic science international. Genetics.

[9]  Erik Essen-möller,et al.  Zur Theorie des Vaterschaftsnachweises auf Grund von Ähnlichkeitsbefunden , 1939, Deutsche Zeitschrift für die gesamte gerichtliche Medizin.

[10]  M Krawczak,et al.  A genetic factor model for the statistical analysis of multilocus DNA fingerprints , 1992, Electrophoresis.

[11]  H. Gurtler Principles of blood-group statistical evaluation of paternity cases at the University Institute of Forensic Medicine, Copenhagen. , 1956, Acta medicinae legalis et socialis.

[12]  T Egeland,et al.  A computerised method for calculating the probability of pedigrees from genetic data. , 1997, Science & justice : journal of the Forensic Science Society.

[13]  Jarosław Berent DNAStat, version 2.1--a computer program for processing genetic profile databases and biostatistical calculations. , 2010, Archiwum medycyny sadowej i kryminologii.

[14]  Jessica McDonald,et al.  Forensic DNA Analysis , 2012, American Society for Clinical Laboratory Science.

[15]  T Egeland,et al.  Beyond traditional paternity and identification cases. Selecting the most probable pedigree. , 2000, Forensic science international.

[16]  Dw Van Boxel,et al.  Probabilistic Expert Systems for Forensic Inference from Genetic Markers , 2002 .

[17]  Hon Keung Tony Ng,et al.  Statistics: An Introduction Using R , 2006, Technometrics.

[18]  K Hummel On the theory and practice of Essen-Möller's W value and Gürtler's paternity index (PI). , 1984, Forensic science international.

[19]  Thore Egeland,et al.  FamLink--a user friendly software for linkage calculations in family genetics. , 2012, Forensic science international. Genetics.

[20]  J. Riancho,et al.  A Windows-based software for common paternity and sibling analyses. , 2003, Forensic science international.

[21]  M Nata,et al.  [Spreadsheets of a conventional application software for calculation of plausibility of paternity: application to parentage testing with highly polymorphic markers in deceased party]. , 1997, Nihon hoigaku zasshi = The Japanese journal of legal medicine.