toaSTR: A web application for forensic STR genotyping by massively parallel sequencing.

Massively parallel sequencing (MPS) is emerging within the forensic community as a promising technique for high-resolution short tandem repeat (STR) genotyping, discovering both length and sequence polymorphisms. Conversely, the application of MPS to routine casework poses new challenges to the DNA analyst in view of the complex sequence data that is generated with this technology. We developed the web application toaSTR to help forensic experts work with MPS data simply and efficiently. An intuitive graphical user interface guides through the STR genotyping workflow. This versatile software handles data from various popular MPS platforms and supports both commercial and in-house multiplex PCR kits. Users can define locus-specific stutter thresholds and create custom sets of STR markers to be analyzed. toaSTR's innovative sequence-based stutter model predicts and identifies common stutter variants. The algorithm automatically differentiates biological (iso-)alleles from stutter and other artefacts to assist the interpretation of mixed samples. toaSTR features a comprehensive data visualization with interactive diagrams and a dynamic tabular overview of sequence observations. The software provides an interface to biostatistics tools and enables PDF result export in compliance with the sequence notation recommended by the International Society for Forensic Genetics (ISFG). An initial compatibility and concordance study confirmed the software's independent functionality and precise allele calling with data of different MPS platforms, STR amplification kits, and library preparation chemistries. Discussion of genotyping results for single source and mixed samples demonstrates toaSTR's advantages and includes suggestions for future MPS software development. The beta version of toaSTR is freely accessible at www.toastr.online.

[1]  P. Walsh,et al.  Sequence analysis and characterization of stutter products at the tetranucleotide repeat locus vWA. , 1996, Nucleic acids research.

[2]  Yao-Yuan Liu,et al.  A review of bioinformatic methods for forensic DNA analyses. , 2018, Forensic science international. Genetics.

[3]  Feng Liu,et al.  Evaluation of the Early Access STR Kit v1 on the Ion Torrent PGM™ platform. , 2016, Forensic science international. Genetics.

[4]  S. Rosset,et al.  lobSTR: A short tandem repeat profiler for personal genomes , 2012, RECOMB.

[5]  Jo-Anne Bright,et al.  Characterising the STR locus D6S1043 and examination of its effect on stutter rates. , 2014, Forensic science international. Genetics.

[6]  Titia Sijen,et al.  FDSTools: A software package for analysis of massively parallel sequencing data with the ability to recognise and correct STR stutter and other PCR or sequencing noise. , 2017, Forensic science international. Genetics.

[7]  Bruce Budowle,et al.  Fast STR allele identification with STRait Razor 3.0. , 2017, Forensic science international. Genetics.

[8]  Brian Bushnell,et al.  BBMerge – Accurate paired shotgun read merging via overlap , 2017, PloS one.

[9]  Steven Salzberg,et al.  BIOINFORMATICS ORIGINAL PAPER , 2004 .

[10]  Bruce Budowle,et al.  Investigation of the STR loci noise distributions of PowerSeq™ Auto System , 2017, Croatian medical journal.

[11]  Bruce Budowle,et al.  Massively parallel sequencing of forensic STRs: Considerations of the DNA commission of the International Society for Forensic Genetics (ISFG) on minimal nomenclature requirements. , 2016, Forensic science international. Genetics.

[12]  Jing Liu,et al.  Massively parallel sequencing of 32 forensic markers using the Precision ID GlobalFiler™ NGS STR Panel and the Ion PGM™ System. , 2017, Forensic science international. Genetics.

[13]  Niels Morling,et al.  Next generation sequencing and its applications in forensic genetics. , 2015, Forensic science international. Genetics.

[14]  Bruce Budowle,et al.  European survey on forensic applications of massively parallel sequencing. , 2017, Forensic science international. Genetics.

[15]  Peter M Vallone,et al.  STR allele sequence variation: Current knowledge and future issues. , 2015, Forensic science international. Genetics.

[16]  Bruce Budowle,et al.  Evaluation of the Illumina(®) Beta Version ForenSeq™ DNA Signature Prep Kit for use in genetic profiling. , 2016, Forensic science international. Genetics.

[17]  Bruce Budowle,et al.  Characterization of genetic sequence variation of 58 STR loci in four major population groups. , 2016, Forensic science international. Genetics.

[18]  Jo-Anne Bright,et al.  Characterising stutter in forensic STR multiplexes. , 2012, Forensic science international. Genetics.

[19]  Peter Wiegand,et al.  Polymerase slippage in relation to the uniformity of tetrameric repeat stretches. , 2003, Forensic science international.

[20]  Bruce Budowle,et al.  A technique for setting analytical thresholds in massively parallel sequencing-based forensic DNA analysis , 2017, PloS one.

[21]  Bruce Budowle,et al.  An evaluation of the PowerSeq™ Auto System: A multiplex short tandem repeat marker kit compatible with massively parallel sequencing. , 2015, Forensic science international. Genetics.

[22]  C. Quince,et al.  Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform , 2015, Nucleic acids research.

[23]  Douglas R Storts,et al.  Massively parallel sequencing of short tandem repeats-Population data and mixture analysis results for the PowerSeq™ system. , 2016, Forensic science international. Genetics.

[24]  Peter M Vallone,et al.  Sequence variation of 22 autosomal STR loci detected by next generation sequencing. , 2016, Forensic science international. Genetics.

[25]  Philip Hugenholtz,et al.  Shining a Light on Dark Sequencing: Characterising Errors in Ion Torrent PGM Data , 2013, PLoS Comput. Biol..