Massively parallel sequencing of forensic STRs: Considerations of the DNA commission of the International Society for Forensic Genetics (ISFG) on minimal nomenclature requirements.

The DNA Commission of the International Society for Forensic Genetics (ISFG) is reviewing factors that need to be considered ahead of the adoption by the forensic community of short tandem repeat (STR) genotyping by massively parallel sequencing (MPS) technologies. MPS produces sequence data that provide a precise description of the repeat allele structure of a STR marker and variants that may reside in the flanking areas of the repeat region. When a STR contains a complex arrangement of repeat motifs, the level of genetic polymorphism revealed by the sequence data can increase substantially. As repeat structures can be complex and include substitutions, insertions, deletions, variable tandem repeat arrangements of multiple nucleotide motifs, and flanking region SNPs, established capillary electrophoresis (CE) allele descriptions must be supplemented by a new system of STR allele nomenclature, which retains backward compatibility with the CE data that currently populate national DNA databases and that will continue to be produced for the coming years. Thus, there is a pressing need to produce a standardized framework for describing complex sequences that enable comparison with currently used repeat allele nomenclature derived from conventional CE systems. It is important to discern three levels of information in hierarchical order (i) the sequence, (ii) the alignment, and (iii) the nomenclature of STR sequence data. We propose a sequence (text) string format the minimal requirement of data storage that laboratories should follow when adopting MPS of STRs. We further discuss the variant annotation and sequence comparison framework necessary to maintain compatibility among established and future data. This system must be easy to use and interpret by the DNA specialist, based on a universally accessible genome assembly, and in place before the uptake of MPS by the general forensic community starts to generate sequence data on a large scale. While the established nomenclature for CE-based STR analysis will remain unchanged in the future, the nomenclature of sequence-based STR genotypes will need to follow updated rules and be generated by expert systems that translate MPS sequences to match CE conventions in order to guarantee compatibility between the different generations of STR data.

[1]  R. Chakraborty,et al.  Automated analysis of sequence polymorphism in STR alleles by PCR and direct electrospray ionization mass spectrometry. , 2012, Forensic science international. Genetics.

[2]  B Brinkmann,et al.  Report of the European DNA profiling group (EDNAP)--towards standardisation of short tandem repeat (STR) loci. , 1994, Forensic science international.

[3]  Niels Morling,et al.  Second generation sequencing of three STRs D3S1358, D12S391 and D21S11 in Danes and a new nomenclature for sequenced STR alleles. , 2014, Forensic science international. Genetics.

[4]  Walther Parson,et al.  EMPOP--a forensic mtDNA database. , 2007, Forensic science international. Genetics.

[5]  Herbert Oberacher,et al.  Increasing the discrimination power of forensic STR testing by employing high-performance mass spectrometry, as illustrated in indigenous South African and Central Asian populations , 2010, International Journal of Legal Medicine.

[6]  B Brinkmann,et al.  DNA commission of the international society for forensic genetics: guidelines for mitochondrial DNA typing. , 2000, Forensic science international.

[7]  P. Knijff,et al.  Forensic nomenclature for short tandem repeats updated for sequencing , 2015 .

[8]  Niels Morling,et al.  Next generation sequencing and its applications in forensic genetics. , 2015, Forensic science international. Genetics.

[9]  W R Mayr,et al.  DNA Commission of the International Society for Forensic Genetics: revised and extended guidelines for mitochondrial DNA typing. , 2014, Forensic science international. Genetics.

[10]  Yali Xue,et al.  Variation of 52 new Y-STR loci in the Y Chromosome Consortium worldwide panel of 76 diverse individuals , 2006, International Journal of Legal Medicine.

[11]  J. Butler,et al.  Genetics and Genomics of Core Short Tandem Repeat Loci Used in Human Identity Testing , 2006, Journal of forensic sciences.

[12]  Miss A.O. Penney (b) , 1974, The New Yale Book of Quotations.

[13]  P. Lincoln DNA recommendations--further report of the DNA Commission of the ISFH regarding the use of short tandem repeat systems. , 1997, Forensic science international.

[14]  A. Carracedo,et al.  DNA commission of the international society for forensic genetics: guidelines for mitochondrial DNA typing. , 2000, Forensic science international.

[15]  Rebecca Just,et al.  Short tandem repeat typing on the 454 platform: strategies and considerations for targeted sequencing of common forensic markers. , 2014, Forensic science international. Genetics.

[16]  P. Gill,et al.  The recombination landscape around forensic STRs: Accurate measurement of genetic distances between syntenic STR pairs using HapMap high density SNP data. , 2012, Forensic science international. Genetics.

[17]  N. Morling,et al.  Second generation sequencing of three STRs D 3 S 1358 , D 12 S 391 and D 21 S 11 in Danes and a new nomenclature for sequenced STR alleles , 2017 .

[18]  B Budowle,et al.  Population data on the thirteen CODIS core short tandem repeat loci in African Americans, U.S. Caucasians, Hispanics, Bahamians, Jamaicans, and Trinidadians. , 1999, Journal of forensic sciences.

[19]  C. H. Becker,et al.  Reliable genotyping of short tandem repeat loci without an allelic ladder using time-of-flight mass spectrometry , 1998, International Journal of Legal Medicine.

[20]  Walther Parson,et al.  SAM: String-based sequence search algorithm for mitochondrial DNA database queries , 2011, Forensic science international. Genetics.

[21]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[22]  Bruce Budowle,et al.  STRait Razor: a length-based forensic STR allele-calling tool for use with second generation sequencing data. , 2013, Forensic science international. Genetics.

[23]  D. Turnbull,et al.  Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA , 1999, Nature Genetics.

[24]  D. Deforce,et al.  Forensic STR analysis using massive parallel sequencing. , 2012, Forensic science international. Genetics.

[25]  Lincoln Pj,et al.  DNA RECOMMENDATIONS : FURTHER REPORT OF THE DNA COMMISSION OF THE ISFH REGARDING THE USE OF SHORT TANDEM REPEAT SYSTEMS , 1997 .

[26]  Hans-Jürgen Bandelt,et al.  Harvesting the fruit of the human mtDNA tree. , 2006, Trends in genetics : TIG.

[27]  J. Butler,et al.  A 26plex Autosomal STR Assay to Aid Human Identity Testing * † , 2009, Journal of forensic sciences.

[28]  N. Morling,et al.  Characterization of mutations and sequence variants in the D21S11 locus by next generation sequencing. , 2014, Forensic science international. Genetics.

[29]  Douglas R Hares,et al.  Selection and implementation of expanded CODIS core loci in the United States. , 2015, Forensic science international. Genetics.

[30]  F. Sanger,et al.  Sequence and organization of the human mitochondrial genome , 1981, Nature.

[31]  Niels Morling,et al.  High-throughput sequencing of core STR loci for forensic genetic investigations using the Roche Genome Sequencer FLX platform. , 2011, BioTechniques.

[32]  L. Gusmão,et al.  New sequence variants detected at DXS10148, DXS10074 and DXS10134 loci. , 2016, Forensic science international. Genetics.

[33]  W. Parson,et al.  The next generation of DNA profiling – STR typing by multiplexed PCR – ion‐pair RP LC–ESI time‐of‐flight MS , 2008, Electrophoresis.

[34]  A. Carracedo,et al.  DNA recommendations , 1997, International Journal of Legal Medicine.

[35]  T. M. Clayton,et al.  The validation of short tandem repeat (STR) loci for use in forensic casework , 2005, International Journal of Legal Medicine.

[36]  M. Holland,et al.  A cautionary note on switching mitochondrial DNA reference sequences in forensic genetics. , 2012, Forensic science international. Genetics.

[37]  Peter M Vallone,et al.  Sequence variation of 22 autosomal STR loci detected by next generation sequencing. , 2016, Forensic science international. Genetics.

[38]  L. Jin,et al.  Identification of repeat sequence heterogeneity at the polymorphic short tandem repeat locus HUMTH01[AATG]n and reassignment of alleles in population analysis by using a locus-specific allelic ladder. , 1993, American journal of human genetics.

[39]  Peter M Schneider,et al.  Scientific standards for studies in forensic genetics. , 2007, Forensic science international.

[40]  P Gill,et al.  Development of guidelines to designate alleles using an STR multiplex system. , 1997, Forensic science international.

[41]  W Parson,et al.  Inter-laboratory evaluation of SNP-based forensic identification by massively parallel sequencing using the Ion PGM™. , 2015, Forensic science international. Genetics.

[42]  W Parson,et al.  European Network of Forensic Science Institutes (ENFSI): Evaluation of new commercial STR multiplexes that include the European Standard Set (ESS) of markers. , 2012, Forensic science international. Genetics.

[43]  Saharon Rosset,et al.  A "Copernican" reassessment of the human mitochondrial DNA tree from its root. , 2012, American journal of human genetics.

[44]  Hans-Jürgen Bandelt,et al.  The case for the continuing use of the revised Cambridge Reference Sequence (rCRS) and the standardization of notation in human mitochondrial DNA studies , 2013, Journal of Human Genetics.

[45]  J. Butler,et al.  Concordance Study Between the AmpFℓSTR® MiniFilerTM PCR Amplification Kit and Conventional STR Typing Kits * , 2007, Journal of forensic sciences.

[46]  C. Watkins,et al.  Tetranucleotide repeat polymorphism at the human beta-actin related pseudogene 2 (ACTBP2) detected using the polymerase chain reaction. , 1991, Nucleic acids research.

[47]  P. Gill,et al.  Report of the European DNA Profiling Group (EDNAP)--an investigation of the hypervariable STR loci ACTBP2, APOAI1 and D11S554 and the compound loci D12S391 and D1S1656. , 1998, Forensic science international.

[48]  B Budowle,et al.  Concordance study on population database samples using the PowerPlex 16 kit and AmpFlSTR Profiler Plus kit and AmpFlSTR COfiler kit. , 2001, Journal of forensic sciences.

[49]  Peter M Vallone,et al.  STR allele sequence variation: Current knowledge and future issues. , 2015, Forensic science international. Genetics.

[50]  W. Parson,et al.  Massively parallel sequencing of complete mitochondrial genomes from hair shaft samples. , 2015, Forensic science international. Genetics.