Improved assay-dependent searching of nucleic acid sequence databases

Nucleic acid-based biochemical assays are crucial to modern biology. Key applications, such as detection of bacterial, viral and fungal pathogens, require detailed knowledge of assay sensitivity and specificity to obtain reliable results. Improved methods to predict assay performance are needed for exploiting the exponentially growing amount of DNA sequence data and for reducing the experimental effort required to develop robust detection assays. Toward this goal, we present an algorithm for the calculation of sequence similarity based on DNA thermodynamics. In our approach, search queries consist of one to three oligonucleotide sequences representing either a hybridization probe, a pair of Padlock probes or a pair of PCR primers with an optional TaqMantrade mark probe (i.e. in silico or 'virtual' PCR). Matches are reported if the query and target satisfy both the thermodynamics of the assay (binding at a specified hybridization temperature and/or change in free energy) and the relevant biological constraints (assay sequences binding to the correct target duplex strands in the required orientations). The sensitivity and specificity of our method is evaluated by comparing predicted to known sequence tagged sites in the human genome. Free energy is shown to be a more sensitive and specific match criterion than hybridization temperature.

[1]  Paul C. Boutros,et al.  PUNS: transcriptomic- and genomic-in silico PCR for enhanced primer design , 2004, Bioinform..

[2]  Michael Zuker,et al.  Mfold web server for nucleic acid folding and hybridization prediction , 2003, Nucleic Acids Res..

[3]  Michael Zuker,et al.  DINAMelt web server for nucleic acid melting prediction , 2005, Nucleic Acids Res..

[4]  N. Brown,et al.  The sequence of a region of bacteriophage phiX174 DNA coding for parts of genes A and B. , 1977, Journal of molecular biology.

[5]  J. SantaLucia,et al.  The thermodynamics of DNA structural motifs. , 2004, Annual review of biophysics and biomolecular structure.

[6]  Alexander Schönhuth,et al.  A fractional programming approach to efficient DNA melting temperature calculation , 2005, Bioinform..

[7]  Javier Garaizar,et al.  In silico analysis of complete bacterial genomes: PCR, AFLP-PCR and endonuclease restriction , 2004, Bioinform..

[8]  R. Lahesmaa,et al.  New separation-free assay technique for SNPs using two-photon excitation fluorometry. , 2004, Nucleic acids research.

[9]  W R Engels,et al.  Contributing software to the internet: the Amplify program. , 1993, Trends in biochemical sciences.

[10]  G. Gyapay,et al.  A radiation hybrid map of the human genome. , 1996, Human molecular genetics.

[11]  Alexander Schliep,et al.  Selecting signature oligonucleotides to identify organisms using DNA arrays , 2002, Bioinform..

[12]  Raphael T. Haftka,et al.  Surrogate-based Analysis and Optimization , 2005 .

[13]  Giorgio Valle,et al.  PRIMEX: rapid identification of oligonucleotide matches in whole genomes , 2003, Bioinform..

[14]  E. Rubin,et al.  A mathematical model and a computerized simulation of PCR using complex templates. , 1996, Nucleic acids research.

[15]  Peter S. White,et al.  me-PCR: a refined ultrafast algorithm for identifying sequence-defined genomic elements , 2004, Bioinform..

[16]  J. SantaLucia,et al.  Thermodynamic parameters for DNA sequences with dangling ends. , 2000, Nucleic acids research.

[17]  E. Stewart,et al.  An STS-based radiation hybrid map of the human genome. , 1997, Genome research.

[18]  Yulei Zhang,et al.  Information theory-based algorithm for in silico prediction of PCR products with whole genomic sequences as templates , 2005, BMC Bioinformatics.

[19]  István Simon,et al.  BiSearch: primer-design and search tool for PCR on bisulfite-treated genomes , 2005, Nucleic acids research.

[20]  D. Crothers,et al.  THEORY OF THE MELTING TRANSITION OF SYNTHETIC POLYNUCLEOTIDES: EVALUATION OF THE STACKING FREE ENERGY. , 1964, Journal of molecular biology.

[21]  Gregory D. Schuler,et al.  A web server for performing electronic PCR , 2004, Nucleic Acids Res..

[22]  J. SantaLucia,et al.  A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics. , 1998, Proceedings of the National Academy of Sciences of the United States of America.