STRSeq: A catalog of sequence diversity at human identification Short Tandem Repeat loci.

The STR Sequencing Project (STRSeq) was initiated to facilitate the description of sequence-based alleles at the Short Tandem Repeat (STR) loci targeted in human identification assays. This international collaborative effort, which has been endorsed by the ISFG DNA Commission, provides a framework for communication among laboratories. The initial data used to populate the project are the aggregate alleles observed in targeted sequencing studies across four laboratories: National Institute of Standards and Technology (N=1786), Kings College London (N=1043), University of North Texas Health Sciences Center (N=839), and University of Santiago de Compostela (N=944), for a total of 4612 individuals. STRSeq data are maintained as GenBank records at the U.S. National Center for Biotechnology Information (NCBI), which participates in a daily data exchange with the DNA DataBank of Japan (DDBJ) and the European Nucleotide Archive (ENA). Each GenBank record contains the observed sequence of a STR region, annotation ("bracketing") of the repeat region and flanking region polymorphisms, information regarding the sequencing assay and data quality, and backward compatible length-based allele designation. STRSeq GenBank records are organized within a BioProject at NCBI (https://www.ncbi.nlm.nih.gov/bioproject/380127), which is sub-divided into: commonly used autosomal STRs, alternate autosomal STRs, Y-chromosomal STRs, and X-chromosomal STRs. Each of these categories is further divided into locus-specific BioProjects. The BioProject hierarchy facilitates access to the GenBank records by browsing, BLAST searching, or ftp download. Future plans include user interface tools at strseq.nist.gov, a pathway for submission of additional allele records by laboratories performing population sample sequencing and interaction with the STRidER web portal for quality control (http://strider.online).

[1]  Bruce Budowle,et al.  Massively parallel sequencing of forensic STRs: Considerations of the DNA commission of the International Society for Forensic Genetics (ISFG) on minimal nomenclature requirements. , 2016, Forensic science international. Genetics.

[2]  John M. Butler,et al.  STRBase: a short tandem repeat DNA database for the human identity testing community , 2001, Nucleic Acids Res..

[3]  Bruce Budowle,et al.  Characterization of genetic sequence variation of 58 STR loci in four major population groups. , 2016, Forensic science international. Genetics.

[4]  Michael D Coble,et al.  Characterization of 26 MiniSTR Loci for Improved Analysis of Degraded DNA Samples , 2007, Journal of forensic sciences.

[5]  D. Deforce,et al.  Forensic Loci Allele Database (FLAD): Automatically generated, permanent identifiers for sequenced forensic alleles. , 2016, Forensic science international. Genetics.

[6]  Yaniv Erlich,et al.  Genome-wide profiling of heritable and de novo STR variations , 2016, Nature Methods.

[7]  Yaniv Erlich,et al.  Genome-wide profiling of heritable and de novo STR variations , 2016 .

[8]  Bruce Budowle,et al.  Fast STR allele identification with STRait Razor 3.0. , 2017, Forensic science international. Genetics.

[9]  Niels Morling,et al.  Recommendations of the DNA Commission of the International Society for Forensic Genetics (ISFG) on quality control of autosomal Short Tandem Repeat allele frequency databasing (STRidER). , 2016, Forensic science international. Genetics.

[10]  Miss A.O. Penney (b) , 1974, The New Yale Book of Quotations.

[11]  Peter M Vallone,et al.  STR allele sequence variation: Current knowledge and future issues. , 2015, Forensic science international. Genetics.

[12]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[13]  Bruce Budowle,et al.  Flanking region variation of ForenSeq™ DNA Signature Prep Kit STR and SNP loci in Yavapai Native Americans. , 2017, Forensic science international. Genetics.

[14]  N. Morling,et al.  Second generation sequencing of three STRs D 3 S 1358 , D 12 S 391 and D 21 S 11 in Danes and a new nomenclature for sequenced STR alleles , 2017 .

[15]  Niels Morling,et al.  Second generation sequencing of three STRs D3S1358, D12S391 and D21S11 in Danes and a new nomenclature for sequenced STR alleles. , 2014, Forensic science international. Genetics.

[16]  John M. Butler,et al.  Advanced Topics in Forensic DNA Typing: Methodology , 2011 .

[17]  Peter M Vallone,et al.  Sequence variation of 22 autosomal STR loci detected by next generation sequencing. , 2016, Forensic science international. Genetics.

[18]  Bruce Budowle,et al.  European survey on forensic applications of massively parallel sequencing. , 2017, Forensic science international. Genetics.

[19]  Douglas R Storts,et al.  Massively parallel sequencing of short tandem repeats-Population data and mixture analysis results for the PowerSeq™ system. , 2016, Forensic science international. Genetics.

[20]  Jocelyne Bruand,et al.  Developmental validation of the MiSeq FGx Forensic Genomics System for Targeted Next Generation Sequencing in Forensic DNA Casework and Database Laboratories. , 2017, Forensic science international. Genetics.

[21]  David L Duewer,et al.  U.S. population data for 29 autosomal STR loci. , 2013, Forensic science international. Genetics.

[22]  C Phillips,et al.  A genomic audit of newly-adopted autosomal STRs for forensic identification. , 2017, Forensic science international. Genetics.