NOCIt: a computational method to infer the number of contributors to DNA samples analyzed by STR genotyping.

Repetitive sequences in the human genome called short tandem repeats (STRs) are used in human identification for forensic purposes. Interpretation of DNA profiles generated using STRs is often problematic because of uncertainty in the number of contributors to the sample. Existing methods to identify the number of contributors work on the number of peaks observed and/or allele frequencies. We have developed a computational method called NOCIt that calculates the a posteriori probability (APP) on the number of contributors. NOCIt works on single source calibration data consisting of known genotypes to compute the APP for an unknown sample. The method takes into account signal peak heights, population allele frequencies, allele dropout and stutter-a commonly occurring PCR artifact. We tested the performance of NOCIt using 278 experimental and 40 simulated DNA mixtures consisting of one to five contributors with total DNA mass from 0.016 to 0.25ng. NOCIt correctly identified the number of contributors in 83% of the experimental samples and in 85% of the simulated mixtures, while the accuracy of the best pre-existing method to determine the number of contributors was 72% for the experimental samples and 73% for the simulated mixtures. Moreover, NOCIt calculated the APP for the true number of contributors to be at least 1% in 95% of the experimental samples and in all the simulated mixtures.

[1]  Michael L. Raymer,et al.  Inferring the Number of Contributors to Mixed DNA Profiles , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2]  F Taroni,et al.  Inference about the number of contributors to a DNA mixture: Comparative analyses of a Bayesian network approach and the maximum allele count method. , 2012, Forensic science international. Genetics.

[3]  Peter Gill,et al.  Towards understanding the effect of uncertainty in the number of contributors to DNA stains. , 2007, Forensic science international. Genetics.

[4]  James Curran,et al.  A discussion of the merits of random man not excluded and likelihood ratios. , 2008, Forensic science international. Genetics.

[5]  John M. Butler,et al.  Fundamentals of Forensic DNA Typing , 2009 .

[6]  Adele A. Mitchell,et al.  Estimating the number of contributors to two-, three-, and four-person mixtures containing DNA in high template and low template amounts , 2011, Croatian medical journal.

[7]  Hinda Haned,et al.  Estimating the Number of Contributors to Forensic DNA Mixtures: Does Maximum Likelihood Perform Better Than Maximum Allele Count? , 2011, Journal of forensic sciences.

[8]  Catherine M. Grgicak,et al.  Investigation of Reproducibility and Error Associated with qPCR Methods using Quantifiler® Duo DNA Quantification Kit * , 2010, Journal of forensic sciences.

[9]  T. Egeland,et al.  Estimating the number of contributors to a DNA profile , 2003, International Journal of Legal Medicine.

[10]  John N. Tsitsiklis,et al.  Introduction to Probability , 2002 .

[11]  Carissa M Krane,et al.  Empirical analysis of the STR profiles resulting from conceptual mixtures. , 2005, Journal of forensic sciences.