Development and assessment of an optimized next-generation DNA sequencing approach for the mtgenome using the Illumina MiSeq.

The development of molecular tools to detect and report mitochondrial DNA (mtDNA) heteroplasmy will increase the discrimination potential of the testing method when applied to forensic cases. The inherent limitations of the current state-of-the-art, Sanger-based sequencing, including constrictions in speed, throughput, and resolution, have hindered progress in this area. With the advent of next-generation sequencing (NGS) approaches, it is now possible to clearly identify heteroplasmic variants, and at a much lower level than previously possible. However, in order to bring these approaches into forensic laboratories and subsequently as accepted scientific information in a court of law, validated methods will be required to produce and analyze NGS data. We report here on the development of an optimized approach to NGS analysis for the mtDNA genome (mtgenome) using the Illumina MiSeq instrument. This optimized protocol allows for the production of more than 5 gigabases of mtDNA sequence per run, sufficient for detection and reliable reporting of minor heteroplasmic variants down to approximately 0.5-1.0% when multiplexing twelve samples. Depending on sample throughput needs, sequence coverage rates can be set at various levels, but were optimized here for at least 5000 reads. In addition, analysis parameters are provided for a commercially available software package that identify the highest quality sequencing reads and effectively filter out sequencing-based noise. With this method it will be possible to measure the rates of low-level heteroplasmy across the mtgenome, evaluate the transmission of heteroplasmy between the generations of maternal lineages, and assess the drift of variant sequences between different tissue types within an individual.

[1]  Walther Parson,et al.  Evaluation of next generation mtGenome sequencing using the Ion Torrent Personal Genome Machine (PGM)☆ , 2013, Forensic science international. Genetics.

[2]  J. Wakeley Substitution rate variation among sites in hypervariable region 1 of human mitochondrial DNA , 1993, Journal of Molecular Evolution.

[3]  E. Lander,et al.  Genomic mapping by fingerprinting random clones: a mathematical analysis. , 1988, Genomics.

[4]  Timothy B. Stockwell,et al.  Evaluation of next generation sequencing platforms for population targeted sequencing studies , 2009, Genome Biology.

[5]  Á. Carracedo,et al.  Heteroplasmy in mtDNA and the weight of evidence in forensic mtDNA analysis: a case report , 2001, International Journal of Legal Medicine.

[6]  K. Hawkes,et al.  African populations and the evolution of human mitochondrial DNA. , 1991, Science.

[7]  Niels Morling,et al.  High-throughput sequencing of core STR loci for forensic genetic investigations using the Roche Genome Sequencer FLX platform. , 2011, BioTechniques.

[8]  David C. Samuels,et al.  Universal heteroplasmy of human mitochondrial DNA , 2012, Human molecular genetics.

[9]  P. Ivanov,et al.  Mitochondrial DNA sequence heteroplasmy in the Grand Duke of Russia Georgij Romanov establishes the authenticity of the remains of Tsar Nicholas II , 1996, Nature Genetics.

[10]  D. Dressman,et al.  Heteroplasmic mitochondrial DNA mutations in normal and tumor cells , 2010, Nature.

[11]  M. Holland,et al.  Second generation sequencing allows for mtDNA mixture deconvolution and high resolution detection of heteroplasmy , 2011, Croatian medical journal.

[12]  Nancy F. Hansen,et al.  Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry , 2008, Nature.

[13]  Mark Stoneking,et al.  Detecting heteroplasmy from high-throughput sequencing of complete human mitochondrial DNA genomes. , 2010, American journal of human genetics.

[14]  Martin Kircher,et al.  Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform , 2011, Nucleic acids research.

[15]  Dennis C. Friedrich,et al.  A scalable, fully automated process for construction of sequence-ready human exome targeted capture libraries , 2011, Genome Biology.

[16]  Jesse J. Salk,et al.  Detection of ultra-rare mutations by next-generation sequencing , 2012, Proceedings of the National Academy of Sciences.

[17]  M. Holland,et al.  Forensic Mitochondrial DNA Analysis: Current Practice and Future Potential. , 2012, Forensic science review.

[18]  D. Turnbull,et al.  Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA , 1999, Nature Genetics.

[19]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[20]  C. Klein,et al.  Sequence error storms and the landscape of mutations in cancer , 2012, Proceedings of the National Academy of Sciences.

[21]  Juliane C. Dohm,et al.  Substantial biases in ultra-short read data sets from high-throughput DNA sequencing , 2008, Nucleic acids research.

[22]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[23]  M. Nei,et al.  Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. , 1993, Molecular biology and evolution.

[24]  M. Eisen,et al.  Impact of Chromatin Structures on DNA Processing for Genomic Analyses , 2009, PloS one.

[25]  Ralf Bundschuh,et al.  Short-read, high-throughput sequencing technology for STR genotyping. , 2012, BioTechniques. Rapid dispatches.

[26]  F. Sanger,et al.  Sequence and organization of the human mitochondrial genome , 1981, Nature.

[27]  A. Beckett,et al.  AKUFO AND IBARAPA. , 1965, Lancet.

[28]  H. Swerdlow,et al.  A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers , 2012, BMC Genomics.

[29]  B. Freeman,et al.  DNA from Buccal Swabs Recruited by Mail: Evaluation of Storage Effects on Long-Term Stability and Suitability for Multiplex Polymerase Chain Reaction Genotyping , 2003, Behavior genetics.

[30]  D. Altman,et al.  Measuring agreement in method comparison studies , 1999, Statistical methods in medical research.

[31]  T. Dallman,et al.  Performance comparison of benchtop high-throughput sequencing platforms , 2012, Nature Biotechnology.

[32]  Eloisa Arbustini,et al.  Mitochondrial DNA Variant Discovery and Evaluation in Human Cardiomyopathies through Next-Generation Sequencing , 2010, PloS one.

[33]  T. Parsons,et al.  The Use of Mitochondrial DNA Single Nucleotide Polymorphisms to Assist in the Resolution of Three Challenging Forensic Cases , 2009, Journal of forensic sciences.

[34]  Juliane C. Dohm,et al.  Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and Genome Analyzer systems , 2011, Genome Biology.

[35]  T. Parsons,et al.  Investigation of Heteroplasmy in the Human Mitochondrial DNA Control Region: A Synthesis of Observations from More Than 5000 Global Population Samples , 2009, Journal of Molecular Evolution.

[36]  Li Chieh Chen,et al.  Transition to Next Generation Analysis of the Whole Mitochondrial Genome: A Summary of Molecular Defects , 2013, Human mutation.

[37]  Elizabeth M. Ryan,et al.  A scalable, fully automated process for construction of sequence-ready barcoded libraries for 454 , 2010, Genome Biology.

[38]  W. Parson,et al.  Sequencing strategy for the whole mitochondrial genome resulting in high quality sequences , 2009, BMC Genomics.

[39]  S. Morishita,et al.  Efficient frequency-based de novo short-read clustering for error trimming in next-generation sequencing. , 2009, Genome research.

[40]  T. Ozawa,et al.  Automated sequencing of mitochondrial DNA. , 1996, Methods in enzymology.

[41]  Ann-Christine Syvänen,et al.  Next-generation sequencing technologies and applications for human genetic history and forensics , 2011, Investigative Genetics.

[42]  Arne Röhl,et al.  Evaluating length heteroplasmy in the human mitochondrial DNA control region , 2010, International Journal of Legal Medicine.

[43]  Anton Nekrutenko,et al.  Dynamics of mitochondrial heteroplasmy in three families investigated via a repeatable re-sequencing study , 2011, Genome Biology.

[44]  William A. Walters,et al.  Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms , 2012, The ISME Journal.

[45]  D. Altman,et al.  STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT , 1986, The Lancet.