Microsatellites that violate Chargaff's second parity rule have base order-dependent asymmetries in the folding energies of complementary DNA strands and may not drive speciation.

Models for meiotic recombination based on Crick's "unpairing postulate" require symmetrical extrusion of stem-loop structures from homologous DNA duplexes. The potential for such extrusion is abundant in many species and, for a given single-strand segment, can be quantitated as the "folding of natural sequence" (FONS) energy value. This, in turn, can be decomposed into base order-dependent and base composition-dependent components. The FONS values of top and bottom strands in most Caenorhabditis elegans segments are close, as are the corresponding base order-dependent and base composition-dependent components; any discrepancies are in the base composition-dependent component. This suggests that the strands would extrude with similar kinetics. However, interspersed among these segments and at the ends of chromosomes (telomeres) are segments containing short tandem repeats (microsatellites) which, by virtue of their high variability, have been postulated to inhibit the pairing of homologous chromosomes and hence drive speciation. In these segments, there are usually wide discrepancies between the FONS values of top and bottom strands, mainly attributable to differences in base order-dependent components. Analyses of artificial microsatellites of different unit sizes and base compositions show that this asymmetrical distribution of folding potential is greatest for microsatellites when the units are short and violate Chargaff's second parity rule. It is proposed that when there is folding asymmetry, recombination proceeds by special, strand-biased, somatic mechanisms analogous to those operating with Chi sequences in Escherichia coli. If meiotic recombination in the germ-line requires extrusion symmetry, then a general inhibitory influence of microsatellite-containing segments could mask the antirecombinational influence of their variability. Thus, microsatellites may not have driven speciation.

[1]  R. Ivarie,et al.  Mono- through hexanucleotide composition of the Escherichia coli genome: a Markov chain analysis. , 1987, Nucleic acids research.

[2]  Allam Apparao GENBIT COMPRESS - ALGORITHM FOR REPETITIVE AND NON-REPETITIVE DNA SEQUENCES. , 2010 .

[3]  G. Roeder,et al.  Telomere-mediated chromosome pairing during meiosis in budding yeast. , 1998, Genes & development.

[4]  K. T. Nishant,et al.  Molecular features of meiotic recombination hot spots , 2006, BioEssays : news and reviews in molecular, cellular and developmental biology.

[5]  D. Zickler,et al.  From early homologue recognition to synaptonemal complex formation , 2006, Chromosoma.

[6]  D. Forsdyke,et al.  Crossover hot-spot instigator (Chi) sequences in Escherichia coli occupy distinct recombination/transcription islands. , 2000, Gene.

[7]  J. SantaLucia,et al.  NMR solution structure of a DNA dodecamer containing single G.T mismatches. , 1998, Nucleic acids research.

[9]  D. Pulleyblank,et al.  Facile transition of poly[d(TG)·d(CA)] into a left-handed helix in physiological conditions , 1983, Nature.

[10]  M. A. Lauffer Entropy-driven processes in biology. , 1975, Molecular biology, biochemistry, and biophysics.

[11]  Zhang Chi-yu A FORS-D analysis software "Random_fold_scan" and the influence of different shuffle approaches on FORS-D analysis , 2007 .

[12]  H. M. Sobell Molecular mechanism for genetic recombination. , 1972, Proceedings of the National Academy of Sciences of the United States of America.

[13]  M. Radman,et al.  A mechanism for initiation of genetic recombination. , 1975, Proceedings of the National Academy of Sciences of the United States of America.

[14]  G. Cangiano,et al.  Repetitive DNA sequences located in the terminal portion of the Caenorhabditis elegans chromosomes. , 1993, Nucleic acids research.

[15]  A. Gierer Model for DNA and Protein Interactions and the Function of the Operator , 1966, Nature.

[16]  Donald R Forsdyke,et al.  Calculation of folding energies of single-stranded nucleic acid sequences: conceptual issues. , 2007, Journal of theoretical biology.

[17]  W. Flamm,et al.  Some properties of the single strands isolated from the DNA of the nuclear satellite of the mouse (Mus musculus). , 1969, Journal of molecular biology.

[18]  R. Verdun,et al.  C. elegans Telomeres Contain G-Strand and C-Strand Overhangs that Are Bound by Distinct Proteins , 2008, Cell.

[19]  C. C. Hardin,et al.  Telomeric DNA oligonucleotides form novel intramolecular structures containing guanine·guanine base pairs , 1987, Cell.

[20]  D. Forsdyke An alternative way of thinking about stem-loops in DNA. A case study of the human G0S2 gene. , 1998, Journal of theoretical biology.

[21]  D. Forsdyke Molecular sex: the importance of base composition rather than homology when nucleic acids hybridize. , 2007, Journal of theoretical biology.

[22]  D. Forsdyke Purification of oligo dG-tailed Okayama-Berg linker DNA fragments by oligo dC-cellulose chromatography. , 1984, Analytical biochemistry.

[23]  D. Forsdyke,et al.  Relative roles of primary sequence and (G + C)% in determining the hierarchy of frequencies of complementary trinucleotide pairs in DNAs of different species , 1995, Journal of Molecular Evolution.

[24]  R. Britten,et al.  Repetitive and Non-Repetitive DNA Sequences and a Speculation on the Origins of Evolutionary Novelty , 1971, The Quarterly Review of Biology.

[25]  P. Caron,et al.  The role of DNA topoisomerases in recombination and genome stability: A double-edged sword? , 1990, Cell.

[26]  D. Forsdyke,et al.  Low-complexity segments in Plasmodium falciparum proteins are primarily nucleic acid level adaptations. , 2003, Molecular and biochemical parasitology.

[27]  Ji-Fu Wei,et al.  The key role for local base order in the generation of multiple forms of China HIV-1 B'/C intersubtype recombinants , 2005, BMC Evolutionary Biology.

[28]  Geoff S Baldwin,et al.  DNA double helices recognize mutual sequence homology in a protein free environment. , 2008, The journal of physical chemistry. B.

[29]  F. Crick,et al.  General Model for the Chromosomes of Higher Organisms , 1971, Nature.

[30]  D. Forsdyke,et al.  Correlation of chi orientation with transcription indicates a fundamental relationship between recombination and transcription. , 1998, Gene.

[31]  Ji-Fu Wei,et al.  Local Base Order Influences the Origin of ccr5 Deletions Mediated by DNA Slip Replication , 2005, Biochemical Genetics.

[32]  G. Doyle A general theory of chromosome pairing based on the palindromic DNA model of Sobell with modifications and amplifications. , 1978, Journal of theoretical biology.

[33]  G. Benson,et al.  Tandem repeats finder: a program to analyze DNA sequences. , 1999, Nucleic acids research.

[34]  V. Prabhu Symmetry observations in long nucleotide sequences. , 1993, Nucleic acids research.

[35]  J. Mortimer,et al.  Chargaff's legacy. , 2000, Gene.

[36]  F. Crick,et al.  Genetical Implications of the Structure of Deoxyribonucleic Acid , 1953, Nature.

[37]  C. Crowther Evolutionary Faith and Modern Doubts. , 1922, Nature.

[38]  H. Ellegren Microsatellites: simple sequences with complex evolution , 2004, Nature Reviews Genetics.

[39]  N. Kleckner,et al.  Potential advantages of unstable interactions for pairing of chromosomes in meiotic, somatic, and premeiotic cells. , 1993, Cold Spring Harbor symposia on quantitative biology.

[40]  R. Verdun,et al.  Replication and protection of telomeres , 2007, Nature.

[41]  J. Ott,et al.  GT repeats are associated with recombination on human chromosome 22. , 2000, Genome research.

[42]  H. Muller RÉSUMÉ AND PERSPECTIVES OF THE SYMPOSIUM ON GENES AND CHROMOSOMES , 1941 .

[43]  David H Mathews,et al.  Prediction of RNA secondary structure by free energy minimization. , 2006, Current opinion in structural biology.

[44]  W. P. Wahls Meiotic recombination hotspots: shaping the genome and insights into hypervariable minisatellite DNA change. , 1998, Current topics in developmental biology.

[45]  M. Lieber,et al.  Sequence Dependence of Chromosomal R-Loops at the Immunoglobulin Heavy-Chain Sμ Class Switch Region , 2007, Molecular and Cellular Biology.

[46]  M. Zuker Calculating nucleic acid secondary structure. , 2000, Current opinion in structural biology.

[47]  T. Cech,et al.  Monovalent cation-induced structure of telomeric DNA: The G-quartet model , 1989, Cell.

[48]  D. Forsdyke Reciprocal relationship between stem-loop potential and substitution density in retroviral quasispecies under positive Darwinian selection , 1995, Journal of Molecular Evolution.

[49]  D. Leach Long DNA palindromes, cruciform structures, genetic instability and secondary structure repair , 1994, BioEssays : news and reviews in molecular, cellular and developmental biology.

[50]  F. Chédin,et al.  The Recombination Hot Spot Chi Is Embedded within Islands of Preferred DNA Pairing Sequences in the E. coli Genome , 1997, Cell.

[51]  F. Crick,et al.  Selfish DNA: the ultimate parasite , 1980, Nature.