Examining Sources of Error in PCR by Single-Molecule Sequencing

Next-generation sequencing technology has enabled the detection of rare genetic or somatic mutations and contributed to our understanding of disease progression and evolution. However, many next-generation sequencing technologies first rely on DNA amplification, via the Polymerase Chain Reaction (PCR), as part of sample preparation workflows. Mistakes made during PCR appear in sequencing data and contribute to false mutations that can ultimately confound genetic analysis. In this report, a single-molecule sequencing assay was used to comprehensively catalog the different types of errors introduced during PCR, including polymerase misincorporation, structure-induced template-switching, PCR-mediated recombination and DNA damage. In addition to well-characterized polymerase base substitution errors, other sources of error were found to be equally prevalent. PCR-mediated recombination by Taq polymerase was observed at the single-molecule level, and surprisingly found to occur as frequently as polymerase base substitution errors, suggesting it may be an underappreciated source of error for multiplex amplification reactions. Inverted repeat structural elements in lacZ caused polymerase template-switching between the top and bottom strands during replication and the frequency of these events were measured for different polymerases. For very accurate polymerases, DNA damage introduced during temperature cycling, and not polymerase base substitution errors, appeared to be the major contributor toward mutations occurring in amplification products. In total, we analyzed PCR products at the single-molecule level and present here a more complete picture of the types of mistakes that occur during DNA amplification.

[1]  P. Laird Principles and challenges of genome-wide DNA methylation analysis , 2010, Nature Reviews Genetics.

[2]  M. Fogg,et al.  Recognition of the pro-mutagenic base uracil by family B DNA polymerases from archaea. , 2004, Journal of molecular biology.

[3]  Masood Z. Hadi,et al.  Error Rate Comparison during Polymerase Chain Reaction by DNA Polymerase , 2014, Molecular biology international.

[4]  L. Loeb,et al.  8-Hydroxyguanine, an abundant form of oxidative DNA damage, causes G----T and A----C substitutions. , 1992, The Journal of biological chemistry.

[5]  Haixu Tang,et al.  Rate and molecular spectrum of spontaneous mutations in the bacterium Escherichia coli as determined by whole-genome sequencing , 2012, Proceedings of the National Academy of Sciences.

[6]  J. Braman,et al.  PCR fidelity of pfu DNA polymerase and other thermostable DNA polymerases. , 1996, Nucleic acids research.

[7]  Andrew D. Ellington,et al.  Synthetic evolutionary origin of a proofreading reverse transcriptase , 2016, Science.

[8]  Laurence H. Pearl,et al.  Structural basis for uracil recognition by archaeal family B DNA polymerases , 2002, Nature Structural Biology.

[9]  K. Kinzler,et al.  Detection and quantification of rare mutations with massively parallel sequencing , 2011, Proceedings of the National Academy of Sciences.

[10]  Daniel J. G. Lahr,et al.  Reducing the impact of PCR-mediated recombination in molecular evolution and environmental studies using a new-generation high-fidelity DNA polymerase. , 2009, BioTechniques.

[11]  G. Wang,et al.  Frequency of formation of chimeric molecules as a consequence of PCR coamplification of 16S rRNA genes from mixed bacterial genomes , 1997, Applied and environmental microbiology.

[12]  Andrew J. Grimm,et al.  Reducing chimera formation during PCR amplification to ensure accurate genotyping. , 2010, Gene.

[13]  P. Campbell,et al.  Somatic mutation in cancer and normal cells , 2015, Science.

[14]  S. Shafikhani Factors affecting PCR-mediated recombination. , 2002, Environmental Microbiology.

[15]  T. Skopek,et al.  Fidelity of Thermococcus litoralis DNA polymerase (Vent) in PCR determined by denaturing gradient gel electrophoresis. , 1991, Nucleic acids research.

[16]  S. Odelberg,et al.  Template-switching during DNA synthesis by Thermus aquaticus DNA polymerase I. , 1995, Nucleic acids research.

[17]  W. M. Barnes The fidelity of Taq polymerase catalyzing PCR is improved by an N-terminal deletion. , 1992, Gene.

[18]  Anthony M. Zador,et al.  Sources of PCR-induced distortions in high-throughput sequencing data sets , 2014, bioRxiv.

[19]  Jesse J. Salk,et al.  Detection of ultra-rare mutations by next-generation sequencing , 2012, Proceedings of the National Academy of Sciences.

[20]  H A Erlich,et al.  Next-generation sequencing can reveal in vitro-generated PCR crossover products: some artifactual sequences correspond to HLA alleles in the IMGT/HLA database. , 2014, Tissue antigens.

[21]  M. Chamberlin,et al.  Analysis and suppression of DNA polymerase pauses associated with a trinucleotide consensus. , 1996, Nucleic acids research.

[22]  Patrick D. Schloss,et al.  Reducing the Effects of PCR Amplification and Sequencing Artifacts on 16S rRNA-Based Studies , 2011, PloS one.

[23]  T. Kunkel,et al.  DNA polymerase fidelity and the polymerase chain reaction. , 1991, PCR methods and applications.

[24]  J. Loparo,et al.  Mapping DNA polymerase errors by single-molecule sequencing , 2016, Nucleic acids research.

[25]  T. Kunkel,et al.  The fidelity of DNA synthesis catalyzed by derivatives of Escherichia coli DNA polymerase I. , 1990, The Journal of biological chemistry.

[26]  W. Thilly,et al.  Optimization of the polymerase chain reaction with regard to fidelity: modified T7, Taq, and vent DNA polymerases. , 1991, PCR methods and applications.

[27]  J. Vermeesch,et al.  Polymerase specific error rates and profiles identified by single molecule sequencing. , 2016, Mutation research.

[28]  J. Drake A constant rate of spontaneous mutation in DNA-based microbes. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[29]  High-specificity detection of rare alleles with Paired-End Low Error Sequencing (PELE-Seq) , 2016, BMC Genomics.

[30]  Arthur P. Grollman,et al.  Genome-wide quantification of rare somatic mutations in normal human tissues using massively parallel sequencing , 2016, Proceedings of the National Academy of Sciences.

[31]  K. A. Eckert,et al.  High fidelity DNA synthesis by the Thermus aquaticus DNA polymerase , 1990, Nucleic Acids Res..

[32]  A. Meyerhans,et al.  DNA recombination during PCR. , 1990, Nucleic acids research.

[33]  M. Coble,et al.  An optimized protocol for forensic application of the PreCR™ Repair Mix to multiplex STR amplification of UV-damaged DNA. , 2012, Forensic science international. Genetics.

[34]  Juan F Medrano,et al.  Real-time PCR for mRNA quantitation. , 2005, BioTechniques.

[35]  Hairong Duan,et al.  Benefits and Challenges with Applying Unique Molecular Identifiers in Next Generation Sequencing to Detect Low Frequency Mutations , 2016, PloS one.