Analyzing Low-Level mtDNA Heteroplasmy—Pitfalls and Challenges from Bench to Benchmarking

Massive parallel sequencing technologies are promising a highly sensitive detection of low-level mutations, especially in mitochondrial DNA (mtDNA) studies. However, processes from DNA extraction and library construction to bioinformatic analysis include several varying tasks. Further, there is no validated recommendation for the comprehensive procedure. In this study, we examined potential pitfalls on the sequencing results based on two-person mtDNA mixtures. Therefore, we compared three DNA polymerases, six different variant callers in five mixtures between 50% and 0.5% variant allele frequencies generated with two different amplification protocols. In total, 48 samples were sequenced on Illumina MiSeq. Low-level variant calling at the 1% variant level and below was performed by comparing trimming and PCR duplicate removal as well as six different variant callers. The results indicate that sensitivity, specificity, and precision highly depend on the investigated polymerase but also vary based on the analysis tools. Our data highlight the advantage of prior standardization and validation of the individual laboratory setup with a DNA mixture model. Finally, we provide an artificial heteroplasmy benchmark dataset that can help improve somatic variant callers or pipelines, which may be of great interest for research related to cancer and aging.

[1]  F. Kronenberg,et al.  Profiling of Mitochondrial DNA Heteroplasmy in a Prospective Oral Squamous Cell Carcinoma Study , 2020, Cancers.

[2]  F. Kronenberg,et al.  OXPHOS remodeling in high-grade prostate cancer involves mtDNA mutations and increased succinate oxidation , 2020, Nature Communications.

[3]  M. C. Arias,et al.  Fidelity of DNA polymerases in the detection of intraindividual variation of mitochondrial DNA , 2019, Mitochondrial DNA Part B: Resources.

[4]  Arslan A. Zaidi,et al.  Bottleneck and selection in the germline and maternal age influence transmission of mitochondrial DNA in human pedigrees , 2019, Proceedings of the National Academy of Sciences.

[5]  Stacey Hume,et al.  CCMG practice guideline: laboratory guidelines for next-generation sequencing , 2019, Journal of Medical Genetics.

[6]  C. Frezza,et al.  Mitochondrial DNA: the overlooked oncogenome? , 2019, BMC Biology.

[7]  T. Godfrey,et al.  Impact of Polymerase Fidelity on Background Error Rates in Next-Generation Sequencing with Unique Molecular Identifiers/Barcodes , 2019, Scientific Reports.

[8]  J. Lee,et al.  Detection of Innate and Artificial Mitochondrial DNA Heteroplasmy by Massively Parallel Sequencing: Considerations for Analysis , 2018, Journal of Korean medical science.

[9]  Bin Zhu,et al.  Comparing the performance of selected variant callers using synthetic data and genome segmentation , 2018, BMC Bioinformatics.

[10]  M. Nagy,et al.  Validation of haplotype-specific extraction for separating a mitochondrial DNA model mixture and application to simulated casework. , 2018, Forensic science international. Genetics.

[11]  Jesse J. Salk,et al.  Enhancing the accuracy of next-generation sequencing for detecting rare and subclonal mutations , 2018, Nature Reviews Genetics.

[12]  Jia Gu,et al.  fastp: an ultra-fast all-in-one FASTQ preprocessor , 2018, bioRxiv.

[13]  J. Lee,et al.  Assessment of mitochondrial DNA heteroplasmy detected on commercial panel using MPS system with artificial mixture samples , 2017, International Journal of Legal Medicine.

[14]  Jennifer D. Churchill,et al.  Parsing apart the contributors of mitochondrial DNA mixtures with massively parallel sequencing data , 2017 .

[15]  Mauricio O. Carneiro,et al.  Scaling accurate genetic variant discovery to tens of thousands of samples , 2017, bioRxiv.

[16]  Nuno A. Fonseca,et al.  Comprehensive molecular characterization of mitochondrial genomes in human cancers , 2017, bioRxiv.

[17]  Christoph Endrullat,et al.  Standardization and quality management in next-generation sequencing , 2016, Applied & translational genomics.

[18]  Günther Specht,et al.  mtDNA-Server: next-generation sequencing data analysis of human mitochondrial DNA in the cloud , 2016, Nucleic Acids Res..

[19]  Hans-Jürgen Bandelt,et al.  HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing , 2016, Nucleic Acids Res..

[20]  Umer Zeeshan Ijaz,et al.  Illumina error profiles: resolving fine-scale variation in metagenomic sequencing data , 2016, BMC Bioinformatics.

[21]  M. Stoneking,et al.  Age-Related and Heteroplasmy-Related Variation in Human mtDNA Copy Number , 2016, bioRxiv.

[22]  O. Hofmann,et al.  VarDict: a novel and versatile variant caller for next-generation sequencing in cancer research , 2016, Nucleic acids research.

[23]  Mannis van Oven,et al.  PhyloTree Build 17: Growing the human mitochondrial DNA tree , 2015 .

[24]  Rebecca S. Just,et al.  Mitochondrial DNA heteroplasmy in the emerging field of massively parallel sequencing , 2015, Forensic science international. Genetics.

[25]  Lukas Forer,et al.  Validation of Next-Generation Sequencing of Entire Mitochondrial Genomes and the Diversity of Mitochondrial DNA Mutations in Oral Squamous Cell Carcinoma , 2015, PloS one.

[26]  T. Kivisild,et al.  Maternal ancestry and population history from whole mitochondrial genomes , 2015, Investigative Genetics.

[27]  J. Leonard,et al.  Effect of the enzyme and PCR conditions on the quality of high-throughput DNA sequencing results , 2015, Scientific Reports.

[28]  Mitchell M Holland,et al.  Development and assessment of an optimized next-generation DNA sequencing approach for the mtgenome using the Illumina MiSeq. , 2014, Forensic science international. Genetics.

[29]  Walther Parson,et al.  Questioning the prevalence and reliability of human mitochondrial DNA heteroplasmy from massively parallel sequencing data , 2014, Proceedings of the National Academy of Sciences.

[30]  Jian Lu,et al.  Reply to Just et al.: Mitochondrial DNA heteroplasmy could be reliably detected with massively parallel sequencing technologies , 2014, Proceedings of the National Academy of Sciences.

[31]  Yunfei Guo,et al.  Long-range PCR in next-generation sequencing: comparison of six enzymes and evaluation on the MiSeq sequencer , 2014, Scientific Reports.

[32]  Jian Lu,et al.  Extensive pathogenicity of mitochondrial heteroplasmy in healthy human individuals , 2014, Proceedings of the National Academy of Sciences.

[33]  Q. Cai,et al.  Very low-level heteroplasmy mtDNA variations are inherited in humans. , 2013, Journal of genetics and genomics = Yi chuan xue bao.

[34]  Philip Quirke,et al.  Accurately Identifying Low‐Allelic Fraction Variants in Single Samples with Next‐Generation Sequencing: Applications in Tumor Subclone Resolution , 2013, Human mutation.

[35]  P. Puigserver,et al.  Mitochondrial biogenesis through activation of nuclear signaling proteins. , 2013, Cold Spring Harbor perspectives in biology.

[36]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[37]  David C. Samuels,et al.  Universal heteroplasmy of human mitochondrial DNA , 2012, Human molecular genetics.

[38]  A. Wilm,et al.  LoFreq: a sequence-quality aware, ultra-sensitive variant caller for uncovering cell-population heterogeneity from high-throughput sequencing datasets , 2012, Nucleic acids research.

[39]  Ramesh Hariharan,et al.  Next-Generation Sequencing of Human Mitochondrial Reference Genomes Uncovers High Heteroplasmy Frequency , 2012, PLoS Comput. Biol..

[40]  Gabor T. Marth,et al.  Haplotype-based variant detection from short-read sequencing , 2012, 1207.3907.

[41]  Christopher A. Miller,et al.  VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. , 2012, Genome research.

[42]  D. Samuels,et al.  Somatic mitochondrial DNA mutations in cancer escape purifying selection and high pathogenicity mutations lead to the oncocytic phenotype: pathogenicity analysis of reported somatic mtDNA mutations in tumors , 2012, BMC Cancer.

[43]  M. Holland,et al.  Second generation sequencing allows for mtDNA mixture deconvolution and high resolution detection of heteroplasmy , 2011, Croatian medical journal.

[44]  Heng Li,et al.  Improving SNP discovery by base alignment quality , 2011, Bioinform..

[45]  F. Kronenberg,et al.  Somatic mutations throughout the entire mitochondrial genome are associated with elevated PSA levels in prostate cancer patients. , 2010, American journal of human genetics.

[46]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[47]  Mark Stoneking,et al.  Detecting heteroplasmy from high-throughput sequencing of complete human mitochondrial DNA genomes. , 2010, American journal of human genetics.

[48]  Sha Tang,et al.  Characterization of mitochondrial DNA heteroplasmy using a parallel sequencing system. , 2010, BioTechniques.

[49]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[50]  T. Parsons,et al.  Investigation of Heteroplasmy in the Human Mitochondrial DNA Control Region: A Synthesis of Observations from More Than 5000 Global Population Samples , 2009, Journal of Molecular Evolution.

[51]  W. Parson,et al.  Sequencing strategy for the whole mitochondrial genome resulting in high quality sequences , 2009, BMC Genomics.

[52]  Hans-Jürgen Bandelt,et al.  Phantom mutation hotspots in human mitochondrial DNA , 2005, Electrophoresis.

[53]  W. Parson,et al.  Mitochondrial DNA heteroplasmy or artefacts—a matter of the amplification strategy? , 2003, International Journal of Legal Medicine.

[54]  D. Turnbull,et al.  Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA , 1999, Nature Genetics.

[55]  P. Ivanov,et al.  Mitochondrial DNA sequence heteroplasmy in the Grand Duke of Russia Georgij Romanov establishes the authenticity of the remains of Tsar Nicholas II , 1996, Nature Genetics.