Massively parallel sequencing of short tandem repeats-Population data and mixture analysis results for the PowerSeq™ system.

Current forensic DNA analysis predominantly involves identification of human donors by analysis of short tandem repeats (STRs) using Capillary Electrophoresis (CE). Recent developments in Massively Parallel Sequencing (MPS) technologies offer new possibilities in analysis of STRs since they might overcome some of the limitations of CE analysis. In this study 17 STRs and Amelogenin were sequenced in high coverage using a prototype version of the Promega PowerSeq™ system for 297 population samples from the Netherlands, Nepal, Bhutan and Central African Pygmies. In addition, 45 two-person mixtures with different minor contributions down to 1% were analysed to investigate the performance of this system for mixed samples. Regarding fragment length, complete concordance between the MPS and CE-based data was found, marking the reliability of MPS PowerSeq™ system. As expected, MPS presented a broader allele range and higher power of discrimination and exclusion rate. The high coverage sequencing data were used to determine stutter characteristics for all loci and stutter ratios were compared to CE data. The separation of alleles with the same length but exhibiting different stutter ratios lowers the overall variation in stutter ratio and helps in differentiation of stutters from genuine alleles in mixed samples. All alleles of the minor contributors were detected in the sequence reads even for the 1% contributions, but analysis of mixtures below 5% without prior information of the mixture ratio is complicated by PCR and sequencing artefacts.

[1]  Peter M Vallone,et al.  STR allele sequence variation: Current knowledge and future issues. , 2015, Forensic science international. Genetics.

[2]  Bruce Budowle,et al.  Mixture Interpretation: Defining the Relevant Features for Guidelines for the Assessment of Mixed DNA Profiles in Forensic Casework * , 2009, Journal of forensic sciences.

[3]  Jeroen F. J. Laros,et al.  TSSV: a tool for characterization of complex allelic variants in pure and mixed genomes , 2014, Bioinform..

[4]  Titia Sijen,et al.  Assessment of the stochastic threshold, back- and forward stutter filters and low template techniques for NGM. , 2012, Forensic science international. Genetics.

[5]  Jo-Anne Bright,et al.  Characterising stutter in forensic STR multiplexes. , 2012, Forensic science international. Genetics.

[6]  Bruce Budowle,et al.  High sensitivity multiplex short tandem repeat loci analyses with massively parallel sequencing. , 2015, Forensic science international. Genetics.

[7]  Bruce Budowle,et al.  STRait Razor v2.0: the improved STR Allele Identification Tool--Razor. , 2015, Forensic science international. Genetics.

[8]  C. Tyler-Smith,et al.  A Linguistically Informed Autosomal STR Survey of Human Populations Residing in the Greater Himalayan Region , 2014, PloS one.

[9]  Rebecca Just,et al.  Short tandem repeat typing on the 454 platform: strategies and considerations for targeted sequencing of common forensic markers. , 2014, Forensic science international. Genetics.

[10]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[11]  Steven Salzberg,et al.  BIOINFORMATICS ORIGINAL PAPER , 2004 .

[12]  Titia Sijen,et al.  Comparing six commercial autosomal STR kits in a large Dutch population sample. , 2014, Forensic science international. Genetics.

[13]  Douglas R Storts,et al.  Developmental validation of the PowerPlex(®) Fusion System for analysis of casework and reference samples: A 24-locus multiplex for new database standards. , 2014, Forensic science international. Genetics.

[14]  Niels Morling,et al.  Second generation sequencing of three STRs D3S1358, D12S391 and D21S11 in Danes and a new nomenclature for sequenced STR alleles. , 2014, Forensic science international. Genetics.

[15]  Jonathan Scott Friedlaender,et al.  A Human Genome Diversity Cell Line Panel , 2002, Science.

[16]  J. Butler,et al.  STR sequence analysis for characterizing normal, variant, and null alleles. , 2011, Forensic science international. Genetics.

[17]  P. Knijff,et al.  Forensic nomenclature for short tandem repeats updated for sequencing , 2015 .

[18]  C. Quince,et al.  Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform , 2015, Nucleic acids research.