A multiple-alignment based primer design algorithm for genetically highly variable DNA targets

BackgroundPrimer design for highly variable DNA sequences is difficult, and experimental success requires attention to many interacting constraints. The advent of next-generation sequencing methods allows the investigation of rare variants otherwise hidden deep in large populations, but requires attention to population diversity and primer localization in relatively conserved regions, in addition to recognized constraints typically considered in primer design.ResultsDesign constraints include degenerate sites to maximize population coverage, matching of melting temperatures, optimizing de novo sequence length, finding optimal bio-barcodes to allow efficient downstream analyses, and minimizing risk of dimerization. To facilitate primer design addressing these and other constraints, we created a novel computer program (PrimerDesign) that automates this complex procedure. We show its powers and limitations and give examples of successful designs for the analysis of HIV-1 populations.ConclusionsPrimerDesign is useful for researchers who want to design DNA primers and probes for analyzing highly variable DNA populations. It can be used to design primers for PCR, RT-PCR, Sanger sequencing, next-generation sequencing, and other experimental protocols targeting highly variable DNA samples.

[1]  Masud Mansuripur,et al.  Introduction to information theory , 1986 .

[2]  M. Zuker,et al.  OligoArray 2.0: design of oligonucleotide probes for DNA microarrays using a thermodynamic approach. , 2003, Nucleic acids research.

[3]  Cassandra B. Jabara,et al.  Accurate sampling and deep sequencing of the HIV-1 protease gene using a Primer ID , 2011, Proceedings of the National Academy of Sciences.

[4]  Gayathri Athreya,et al.  Unequal Evolutionary Rates in the Human Immunodeficiency Virus Type 1 (HIV-1) Pandemic: the Evolutionary Rate of HIV-1 Slows Down When the Epidemic Rate Increases , 2007, Journal of Virology.

[5]  Annelise E Barron,et al.  Quantitative experimental determination of primer–dimer formation risk by free‐solution conjugate electrophoresis , 2012, Electrophoresis.

[6]  Alan S. Perelson,et al.  Early Low-Titer Neutralizing Antibodies Impede HIV-1 Replication and Select for Virus Escape , 2012, PLoS pathogens.

[7]  J. Butler,et al.  AutoDimer: a screening tool for primer-dimer and hairpin structures. , 2004, BioTechniques.

[8]  Ruslan Kalendar,et al.  Java web tools for PCR, in silico PCR, and oligonucleotide assembly and analysis. , 2011, Genomics.

[9]  Burkhard Morgenstern,et al.  The role of recombination in the emergence of a complex and dynamic HIV epidemic , 2010, Retrovirology.

[10]  Huldrych F. Günthard,et al.  Whole Genome Deep Sequencing of HIV-1 Reveals the Impact of Early Minor Variants Upon Immune Recognition During Acute Infection , 2012, PLoS pathogens.

[11]  BMC Bioinformatics , 2005 .

[12]  W. Rychlik,et al.  OLIGO 7 primer analysis software. , 2007, Methods in molecular biology.

[13]  Robert Giegerich,et al.  GeneFisher-Software Support for the Detection of Postulated Genes , 1996, ISMB.

[14]  J. SantaLucia,et al.  Nearest neighbor thermodynamic parameters for internal G.A mismatches in DNA. , 1998, Biochemistry.

[15]  Joakim Lundeberg,et al.  Monitoring Resistance to Human Immunodeficiency Virus Type 1 Protease Inhibitors by Pyrosequencing , 2001, Journal of Clinical Microbiology.

[16]  M. Ronaghi,et al.  Discovery of Single Nucleotide Polymorphisms and Mutations by Pyrosequencing , 2002, Comparative and functional genomics.

[17]  David L. Robertson,et al.  The Evolutionary Analysis of Emerging Low Frequency HIV-1 CXCR4 Using Variants through Time—An Ultra-Deep Approach , 2010, PLoS Comput. Biol..

[18]  John W. Mellors,et al.  Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences , 1997 .

[19]  Alan S. Perelson,et al.  Transmission of Single HIV-1 Genomes and Dynamics of Early Immune Escape Revealed by Ultra-Deep Sequencing , 2010, PloS one.

[20]  Jan Albert,et al.  Tempo and Mode of Nucleotide Substitution in gag andenv Gene Fragments in Human Immunodeficiency Virus Type 1 Populations with a Known Transmission History , 1998, Journal of Virology.

[21]  B. Faircloth,et al.  Primer3—new capabilities and interfaces , 2012, Nucleic acids research.

[22]  L. Castellano,et al.  Deep sequencing of small RNAs identifies canonical and non-canonical miRNA and endogenous siRNAs in mammalian somatic tissues , 2013, Nucleic acids research.

[23]  C. Mbogo,et al.  Deep sequencing reveals extensive variation in the gut microbiota of wild mosquitoes from Kenya , 2012, Molecular ecology.

[24]  J. Lundeberg,et al.  Dynamics of HIV-1 Quasispecies during Antiviral Treatment Dissected Using Ultra-Deep Pyrosequencing , 2010, PloS one.

[25]  J. SantaLucia,et al.  Improved nearest-neighbor parameters for predicting DNA duplex stability. , 1996, Biochemistry.

[26]  Peter De Rijk,et al.  SNPbox: a modular software package for large-scale primer design , 2005, Bioinform..

[27]  Thomas Leitner,et al.  Recombination Rate and Selection Strength in HIV Intra-patient Evolution , 2009, PLoS Comput. Biol..

[28]  Bette Korber,et al.  Epitope-Specific CD8+ T Lymphocytes Cross-Recognize Mutant Simian Immunodeficiency Virus (SIV) Sequences but Fail To Contain Very Early Evolution and Eventual Fixation of Epitope Escape Mutations during SIV Infection , 2011, Journal of Virology.

[29]  Accurate variant detection across non-amplified and whole genome amplified DNA using targeted next generation sequencing , 2012, BMC Genomics.

[30]  Alan S. Perelson,et al.  The first T cell response to transmitted/founder virus contributes to the control of acute viremia in HIV-1 infection , 2009, The Journal of experimental medicine.

[31]  B. Korber,et al.  Genetic differences between blood- and brain-derived viral sequences from human immunodeficiency virus type 1-infected patients: evidence of conserved elements in the V3 region of the envelope protein of brain-derived sequences , 1994, Journal of virology.

[32]  James Theiler,et al.  Quantitative Deep Sequencing Reveals Dynamic HIV-1 Escape and Large Population Shifts during CCR5 Antagonist Therapy In Vivo , 2009, PloS one.

[33]  Jakob Fredslund,et al.  A general pipeline for the development of anchor markers for comparative genomics in plants , 2006, BMC Genomics.