Free energy estimation of short DNA duplex hybridizations

BackgroundEstimation of DNA duplex hybridization free energy is widely used for predicting cross-hybridizations in DNA computing and microarray experiments. A number of software programs based on different methods and parametrizations are available for the theoretical estimation of duplex free energies. However, significant differences in free energy values are sometimes observed among estimations obtained with various methods, thus being difficult to decide what value is the accurate one.ResultsWe present in this study a quantitative comparison of the similarities and differences among four published DNA/DNA duplex free energy calculation methods and an extended Nearest-Neighbour Model for perfect matches based on triplet interactions. The comparison was performed on a benchmark data set with 695 pairs of short oligos that we collected and manually curated from 29 publications. Sequence lengths range from 4 to 30 nucleotides and span a large GC-content percentage range. For perfect matches, we propose an extension of the Nearest-Neighbour Model that matches or exceeds the performance of the existing ones, both in terms of correlations and root mean squared errors. The proposed model was trained on experimental data with temperature, sodium and sequence concentration characteristics that span a wide range of values, thus conferring the model a higher power of generalization when used for free energy estimations of DNA duplexes under non-standard experimental conditions.ConclusionsBased on our preliminary results, we conclude that no statistically significant differences exist among free energy approximations obtained with 4 publicly available and widely used programs, when benchmarked against a collection of 695 pairs of short oligos collected and curated by the authors of this work based on 29 publications. The extended Nearest-Neighbour Model based on triplet interactions presented in this work is capable of performing accurate estimations of free energies for perfect match duplexes under both standard and non-standard experimental conditions and may serve as a baseline for further developments in this area of research.

[1]  J. SantaLucia,et al.  Thermodynamic parameters for DNA sequences with dangling ends. , 2000, Nucleic acids research.

[2]  K. Breslauer,et al.  Thermodynamic consequences of an abasic lesion in duplex DNA are strongly dependent on base sequence. , 1998, Biochemistry.

[3]  J. SantaLucia,et al.  Nearest neighbor thermodynamic parameters for internal G.A mismatches in DNA. , 1998, Biochemistry.

[4]  Florent Barbault,et al.  A new peculiar DNA structure: NMR solution structure of a DNA kissing complex , 2002, Journal of biomolecular structure & dynamics.

[5]  M C Pirrung,et al.  A universal, photocleavable DNA base: nitropiperonyl 2'-deoxyriboside. , 2001, The Journal of organic chemistry.

[6]  N. Sugimoto,et al.  Nucleic acid duplex stability: influence of base composition on cation effects. , 1999, Nucleic acids research.

[7]  Francisco Melo,et al.  Comparison of different melting temperature calculation methods for short DNA sequences , 2005, Bioinform..

[8]  M J Doktycz,et al.  Studies of DNA dumbbells. I. Melting curves of 17 DNA dumbbells with different duplex stem sequences linked by T4 endloops: Evaluation of the nearest‐neighbor stacking interactions in DNA , 1992, Biopolymers.

[9]  J. SantaLucia,et al.  Nearest-neighbor thermodynamics of internal A.C mismatches in DNA: sequence dependence and pH effects. , 1998, Biochemistry.

[10]  N. Arnheim,et al.  Stability of intrastrand hairpin structures formed by the CAG/CTG class of DNA triplet repeats associated with neurological diseases. , 1996, Nucleic acids research.

[11]  M. D. Morris,et al.  Optical Melting of 128 Octamer DNA Duplexes , 1995, The Journal of Biological Chemistry.

[12]  R. Blake,et al.  Stacking energies in DNA. , 1991, The Journal of biological chemistry.

[13]  D A LeBlanc,et al.  Thermodynamic characterization of deoxyribooligonucleotide duplexes containing bulges. , 1991, Biochemistry.

[14]  Richard Owczarzy,et al.  Studies of DNA dumbbells VII: Evaluation of the next‐nearest‐neighbor sequence‐dependent interactions in duplex DNA , 1999, Biopolymers.

[15]  J. SantaLucia,et al.  Thermodynamics of internal C.T mismatches in DNA. , 1998, Nucleic acids research.

[16]  W D Wilson,et al.  Unusual duplex formation in purine rich oligodeoxyribonucleotides. , 1988, Nucleic acids research.

[17]  J. SantaLucia,et al.  Thermodynamics and NMR of internal G.T mismatches in DNA. , 1997, Biochemistry.

[18]  J. McCaskill The equilibrium partition function and base pair binding probabilities for RNA secondary structure , 1990, Biopolymers.

[19]  BMC Bioinformatics , 2005 .

[20]  A. Condon,et al.  Secondary structure prediction of interacting RNA molecules. , 2005, Journal of molecular biology.

[21]  C. W. Hilbers,et al.  The effect of single base-pair mismatches on the duplex stability of d(T-A-T-T-A-A-T-A-T-C-A-A-G-T-T-G) . d(C-A-A-C-T-T-G-A-T-A-T-T-A-A-T-A). , 1984, European journal of biochemistry.

[22]  K. Breslauer,et al.  Influence of an exocyclic guanine adduct on the thermal stability, conformation, and melting thermodynamics of a DNA duplex. , 1992, Biochemistry.

[23]  N. Sugimoto,et al.  Improved thermodynamic parameters and helix initiation factor to predict stability of DNA duplexes. , 1996, Nucleic acids research.

[24]  J. SantaLucia,et al.  A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Raad Z Gharaibeh,et al.  Software Note: Using probe secondary structure information to enhance Affymetrix GeneChip background estimates , 2007, Comput. Biol. Chem..

[26]  I. Tinoco,et al.  Comparison between DNA melting thermodynamics and DNA polymerase fidelity. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[27]  H. Blöcker,et al.  Predicting DNA duplex stability from the base sequence. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Naoki Sugimoto,et al.  Temperature dependence of thermodynamic properties for DNA/DNA and RNA/DNA duplex formation. , 2002, European journal of biochemistry.

[29]  Myron F. Goodman,et al.  Gene Targeting in Rat Embryo Fibroblasts Promoted by the Polyomavirus Large T Antigen Associated With Neurological Diseases , 1996 .

[30]  Kevin P. Murphy,et al.  Efficient parameter estimation for RNA secondary structure prediction , 2007, ISMB/ECCB.

[31]  Naoki Sugimoto,et al.  Application of the Thermodynamic Parameters of DNA Stability Prediction to Double-Helix Formation of Deoxyribooligonucleotides , 1994 .

[32]  P. Vallone,et al.  Predicting sequence-dependent melting stability of short duplex DNA oligomers. , 1997, Biopolymers.

[33]  B R Amirikyan,et al.  Allowance for heterogeneous stacking in the DNA helix-coil transition theory. , 1984, Journal of biomolecular structure & dynamics.

[34]  G A Leonard,et al.  Structural and thermodynamic studies on the adenine.guanine mismatch in B-DNA. , 1990, Nucleic acids research.

[35]  Michael Zuker,et al.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information , 1981, Nucleic Acids Res..

[36]  Michael Zuker,et al.  DINAMelt web server for nucleic acid melting prediction , 2005, Nucleic Acids Res..

[37]  W D Wilson,et al.  Sequence specific thermodynamic and structural properties for DNA.RNA duplexes. , 1994, Biochemistry.

[38]  D. Turner,et al.  Measuring the thermodynamics of RNA secondary structure formation. , 1997, Biopolymers.

[39]  S. Müller,et al.  RNA double cleavage by a hairpin-derived twin ribozyme. , 2000, Nucleic acids research.

[40]  Naoki Sugimoto,et al.  Double-Helix Melting of Octamers of Deoxyriboadenylic and Deoxyribothymidylic Acids in the Presence of Ethidium , 1991 .

[41]  D. Gray,et al.  CD, absorption and thermodynamic analysis of repeating dinucleotide DNA, RNA and hybrid duplexes [d/r(AC)]12.[d/r(GT/U)]12 and the influence of phosphorothioate substitution. , 1997, Nucleic acids research.

[42]  F. Seela,et al.  The N(8)-(2'-deoxyribofuranoside) of 8-aza-7-deazaadenine: a universal nucleoside forming specific hydrogen bonds with the four canonical DNA constituents. , 2000, Nucleic acids research.

[43]  W D Wilson,et al.  Thermodynamics of DNA duplexes with adjacent G.A mismatches. , 1991, Biochemistry.

[44]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..

[45]  J. SantaLucia,et al.  Nearest-neighbor thermodynamics and NMR of DNA sequences with internal A.A, C.C, G.G, and T.T mismatches. , 1999, Biochemistry.

[46]  J. SantaLucia,et al.  Improved nearest-neighbor parameters for predicting DNA duplex stability. , 1996, Biochemistry.

[47]  Yusaku Tagashira,et al.  Stabilities of nearest‐neighbor doublets in double‐helical DNA determined by fitting calculated melting profiles to observed profiles , 1981 .

[48]  Mirela Andronescu,et al.  Algorithms for predicting the secondary structure of pairs and combinatorial sets of nucleic acid strands , 2003 .

[49]  E. Lesnik,et al.  Relative thermodynamic stability of DNA, RNA, and DNA:RNA hybrid duplexes: relationship with base composition and structure. , 1995, Biochemistry.

[50]  P. Schuster,et al.  Complete suboptimal folding of RNA and the stability of secondary structures. , 1999, Biopolymers.

[51]  Ignacio Tinoco,et al.  Base-base mismatches. Thermodynamics of double helix formation for dCA3XA3G + dCT3YT3G (X, Y = A, C, G, T) , 1985, Nucleic Acids Res..

[52]  Masahito Yamamoto,et al.  Thermodynamic parameters based on a nearest-neighbor model for DNA sequences with a single-bulge loop. , 2004, Biochemistry.