A primer-independent DNA polymerase-based method for competent whole-genome amplification of intermediate to high GC sequences

Multiple displacement amplification (MDA) has proven to be a useful technique for obtaining large amounts of DNA from tiny samples in genomics and metagenomics. However, MDA has limitations, such as amplification artifacts and biases that can interfere with subsequent quantitative analysis. To overcome these challenges, alternative methods and engineered DNA polymerase variants have been developed. Here, we present new MDA protocols based on the primer-independent DNA polymerase (piPolB), a replicative-like DNA polymerase endowed with DNA priming and proofreading capacities. These new methods were tested on a genomes mixture containing diverse sequences with high-GC content, followed by deep sequencing. Protocols relying on piPolB as a single enzyme cannot achieve competent amplification due to its limited processivity and the presence of ab initio DNA synthesis. However, an alternative method called piMDA, which combines piPolB with Φ29 DNA polymerases, allows proficient and faithful amplification of the genomes. In addition, the prior denaturation step commonly performed in MDA protocols is dispensable, resulting in a more straightforward protocol. In summary, piMDA outperforms commercial methods in the amplification of metagenomes containing high GC sequences and exhibits similar profiling, error rate, and variant determination as the non-amplified samples. Graphical abstract Schematic representation of methods based on multiple displacement amplification (MDA) for whole genome amplification. The diagrams above represent protocols initiated by random primers (RP-MDA) or a DNA primase-generated short DNA primers (PrimPol-MDA) and continued by Φ29DNAP, whereas the schematics below show piPolB-mediated MDA (left) and the piMDA protocol (right), in which piPolB synthesizes DNA strands that are further extended by Φ29DNAP.

[1]  Yu Lin,et al.  Accurate Binning of Metagenomic Contigs Using Composition, Coverage, and Assembly Graphs , 2022, J. Comput. Biol..

[2]  Edoardo Pasolli,et al.  Extending and improving metagenomic taxonomic profiling with uncharacterized species using MetaPhlAn 4 , 2022, bioRxiv.

[3]  Yan Deng,et al.  Recent advances and application of whole genome amplification in molecular diagnosis and medicine , 2022, MedComm.

[4]  Jun Yu,et al.  Metagenomic Sequencing for Microbial DNA in Human Samples: Emerging Technological Advances , 2022, International journal of molecular sciences.

[5]  A. Goesmann,et al.  Bakta: rapid and standardized annotation of bacterial genomes via alignment-free sequence identification , 2021, Microbial genomics.

[6]  M. Frohme,et al.  Isothermal amplifications – a comprehensive review on current methods , 2021, Critical reviews in biochemistry and molecular biology.

[7]  Xiang Li,et al.  Template length, concentration and guanidine and cytosine content influence on multiple displacement amplification efficiency. , 2021, Journal of microbiological methods.

[8]  A. Nekrutenko,et al.  Sequencing error profiles of Illumina sequencing instruments , 2021, NAR genomics and bioinformatics.

[9]  Richard Molenkamp,et al.  Recommendations for the introduction of metagenomic high-throughput sequencing in clinical virology, part I: Wet lab procedure. , 2020, Journal of clinical virology : the official publication of the Pan American Society for Clinical Virology.

[10]  M. Salas,et al.  Engineered viral DNA polymerase with enhanced DNA amplification capacity: a proof-of-concept of isothermal amplification of damaged DNA , 2020, Scientific Reports.

[11]  D. Oepkes,et al.  TRIDENT-2: National Implementation of Genome-Wide Non-Invasive Prenatal Testing as a First-Tier Screening Test in the Netherlands. , 2019, American journal of human genetics.

[12]  Tetsuya Hayashi,et al.  Comparison of the sequencing bias of currently available library preparation kits for Illumina sequencing of bacterial genomes and metagenomes , 2019, DNA research : an international journal for rapid publication of reports on genes and genomes.

[13]  Marcos Parras-Moltó,et al.  Evaluation of bias induced by viral enrichment and random amplification protocols in metagenomic surveys of saliva DNA viruses , 2018, Microbiome.

[14]  Jia Gu,et al.  fastp: an ultra-fast all-in-one FASTQ preprocessor , 2018, bioRxiv.

[15]  P. Forterre,et al.  Primer-Independent DNA Synthesis by a Family B DNA Polymerase from Self-Replicating Mobile Genetic Elements , 2017, Cell reports.

[16]  Luis Pedro Coelho,et al.  Towards standards for human fecal sample processing in metagenomic studies , 2017, Nature Biotechnology.

[17]  D. Jablons,et al.  Systematic comparison of two whole-genome amplification methods for targeted next-generation sequencing using frozen and FFPE normal and cancer tissues , 2017, Scientific Reports.

[18]  P. Pevzner,et al.  metaSPAdes: a new versatile metagenomic assembler. , 2017, Genome research.

[19]  Christopher E Mason,et al.  Assessment of REPLI-g Multiple Displacement Whole Genome Amplification (WGA) Techniques for Metagenomic Applications. , 2017, Journal of biomolecular techniques : JBT.

[20]  Bettina Budeus,et al.  TruePrime is a novel method for whole-genome amplification from single cells based on TthPrimPol , 2016, Nature Communications.

[21]  Yan Li,et al.  SeqKit: A Cross-Platform and Ultrafast Toolkit for FASTA/Q File Manipulation , 2016, PloS one.

[22]  R. Skirgaila,et al.  In vitro evolution of phi29 DNA polymerase using isothermal compartmentalized self replication technique. , 2016, Protein engineering, design & selection : PEDS.

[23]  Daniel Mapleson,et al.  KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies , 2016, bioRxiv.

[24]  F. Aarestrup,et al.  Impact of Sample Type and DNA Isolation Procedure on Genomic Inference of Microbiome Composition , 2016, mSystems.

[25]  Måns Magnusson,et al.  MultiQC: summarize analysis results for multiple tools and samples in a single report , 2016, Bioinform..

[26]  P. Ehrenfreund,et al.  Systematic evaluation of bias in microbial community profiles induced by whole genome amplification. , 2014, Environmental microbiology.

[27]  J. Nelson Random‐Primed, Phi29 DNA Polymerase‐Based Whole Genome Amplification , 2014, Current protocols in molecular biology.

[28]  Tsunglin Liu,et al.  Effects of GC Bias in Next-Generation-Sequencing Data on De Novo Genome Assembly , 2013, PloS one.

[29]  R. I. Artyukh,et al.  The effect of single-stranded DNA binding proteins on template/primer-independent DNA synthesis in the presence of nicking endonuclease Nt.BspD6I , 2012, Russian Journal of Bioorganic Chemistry.

[30]  Y. Benjamini,et al.  Summarizing and correcting the GC content bias in high-throughput sequencing , 2012, Nucleic acids research.

[31]  David R. Kelley,et al.  Quake: quality-aware detection and correction of sequencing errors , 2010, Genome Biology.

[32]  L. Blanco,et al.  Improvement of φ29 DNA polymerase amplification performance by fusion of DNA binding motifs , 2010, Proceedings of the National Academy of Sciences.

[33]  B. Meyer,et al.  Specific and complete human genome amplification with improved yield achieved by phi29 DNA polymerase and a novel primer at elevated temperature , 2009, BMC Research Notes.

[34]  V. Vasiliev,et al.  N.BspD6I DNA nickase strongly stimulates template-independent synthesis of non-palindromic repetitive DNA by Bst DNA polymerase , 2007, Biological chemistry.

[35]  M. Frank-Kamenetskii,et al.  Very efficient template/primer-independent DNA synthesis by thermophilic DNA polymerase in the presence of a thermophilic restriction endonuclease. , 2004, Biochemistry.

[36]  R. J. Simpson,et al.  Isothermal whole genome amplification from single and small numbers of cells: a new era for preimplantation genetic diagnosis of inherited disease. , 2004, Molecular human reproduction.

[37]  Rameen Beroukhim,et al.  Genome coverage and sequence fidelity of phi29 polymerase-based multiple strand displacement whole genome amplification. , 2004, Nucleic acids research.

[38]  Roger S Lasken,et al.  Whole genome amplification: abundant supplies of DNA from precious samples or clinical specimens. , 2003, Trends in biotechnology.

[39]  Daniel Pinkel,et al.  Whole genome analysis of genetic alterations in small DNA samples using hyperbranched strand displacement amplification and array-CGH. , 2003, Genome research.

[40]  N. Ogata,et al.  Genetic information 'created' by archaebacterial DNA polymerase. , 1997, The Biochemical journal.

[41]  J. Leamon,et al.  Bias in Whole Genome Amplification: Causes and Considerations. , 2015, Methods in molecular biology.

[42]  T. Kroneis Whole Genome Amplification , 2015, Methods in Molecular Biology.

[43]  P. Forterre,et al.  Synergistic template-free synthesis of dsDNA by Thermococcus nautili primase PolpTN2, DNA polymerase PolB, and pTN2 helicase , 2014, Extremophiles.

[44]  Jianqiao Liu,et al.  Single‐nucleotide polymorphism array coupled with multiple displacement amplification: Accuracy and spatial resolution for analysis of chromosome copy numbers in few cells , 2012, Biotechnology and applied biochemistry.

[45]  T. Stockwell,et al.  Mechanism of chimera formation during the Multiple Displacement Amplification reaction , 2007 .

[46]  L. Blanco,et al.  Purification of bacteriophage phi 29 DNA polymerase. , 1995, Methods in enzymology.

[47]  C Garmendia,et al.  Highly efficient DNA synthesis by the phage phi 29 DNA polymerase , 1989 .

[48]  C. Richardson,et al.  Template recognition sequence for RNA primer synthesis by gene 4 protein of bacteriophage T7. , 1981, Proceedings of the National Academy of Sciences of the United States of America.