Purine loading, stem-loops and Chargaff’s second parity rule: a discussion of the application of elementary principles to early chemical observations

DNA base compositions were determined chemically long before sequencing technologies permitted the direct counting of bases. Some recent observations made using modern sequencing technologies could have been deduced by application of elementary principles to early chemical observations. This paper draws attention to the fact that the potential for significant stem-loop structure is a general property of single-stranded DNA (genic and nongenic) and hence for any corresponding transcripts, whether they function by virtue of their structure (eg rRNA) or as mRNA. Furthermore, there is Chargaff’s second parity rule: in single strands, the percentage of purines approximately equals the percentage of pyrimidines. Since, in stems, purines match pyrimidines, Szybalski’s rule that transcripts violate the second parity rule in favour of purines, must apply to loops. Since purine loading occurs in both mesophilic and thermophilic species, genes with transcripts that need stable secondary structures for functioning at high temperatures must achieve this by selectively increasing the GC percentage (GC%) of stems, while retaining purine loading of loops. Arguments that purine loading is specific for the loops of RNA-synonymous strands of genes whose transcripts function by virtue of their secondary structure (ie rRNAs, not mRNAs) need to take into account, as controls, the loop regions of mRNA-synonymous strands. Entire genes, or entire genomes where gene orientation is not considered, are not appropriate controls.

[1]  D. Hickey,et al.  Evidence for strong selective constraint acting on the nucleotide composition of 16S ribosomal RNA genes. , 2002, Nucleic acids research.

[2]  D. Forsdyke,et al.  Introns resolve the conflict between base order-dependent stem-loop potential and the encoding of RNA or protein: further evidence from overlapping genes. , 2001, Gene.

[3]  D. Forsdyke,et al.  A stem-loop "kissing" model for the initiation of recombination and the origin of introns. , 1995, Molecular biology and evolution.

[4]  E. Nevo,et al.  Adaptive role of increased frequency of polypurine tracts in mRNA sequences of thermophilic prokaryotes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[5]  D. Chessel,et al.  Internal correspondence analysis of codon and amino-acid usage in thermophilic bacteria. , 2003, Journal of applied genetics.

[6]  Forsdyke Conservation of Stem-Loop Potential in Introns of Snake Venom Phospholipase A2 Genes: An Application of FORS-D Analysis , 1995 .

[7]  W Szybalski,et al.  Pyrimidine clusters on the transcribing strand of DNA and their possible role in the initiation of RNA synthesis. , 1966, Cold Spring Harbor symposia on quantitative biology.

[8]  J. Mortimer,et al.  Chargaff's legacy. , 2000, Gene.

[9]  Steven Skiena,et al.  Natural Selection and Algorithmic Design of mRNA , 2003, J. Comput. Biol..

[10]  D. Lynn,et al.  Synonymous codon usage is subject to selection in thermophilic bacteria. , 2002, Nucleic acids research.

[11]  C. Burge,et al.  Widespread selection for local RNA secondary structure in coding regions of bacterial genes. , 2003, Genome research.

[12]  J. Mortimer,et al.  Comparison of responses by bacteriophages and bacteria to pressures on the base composition of open reading frames. , 2003, Applied bioinformatics.

[13]  D. Battistutta,et al.  Codon usage in Plasmodium falciparum. , 1988, Molecular and biochemical parasitology.

[14]  E. Chargaff,et al.  Evidence of common regularities in the composition of pentose nucleic acids. , 1955, Biochimica et biophysica acta.

[15]  D. Forsdyke,et al.  Deviations from Chargaff's second parity rule correlate with direction of transcription. , 1999, Journal of theoretical biology.

[16]  D. Forsdyke The origin of species, revisited , 2001 .

[17]  David W. Digby,et al.  mRNAs have greater negative folding free energies than shuffled or codon choice randomized sequences. , 1999, Nucleic acids research.

[18]  Ruth Nussinov,et al.  Strong doublet preferences in nucleotide sequences and DNA geometry , 2005, Journal of Molecular Evolution.

[19]  Christopher A. Madill,et al.  Immunity as a function of the unicellular state: implications of emerging genomic data. , 2002, Trends in immunology.

[20]  G. Bernardi,et al.  Evolutionary Genomics of Vertebrates and Its Implications , 1999, Annals of the New York Academy of Sciences.

[21]  G. Singer,et al.  Thermophilic prokaryotes have characteristic patterns of codon usage, amino acid composition and nucleotide content. , 2003, Gene.

[22]  Paolo Del Giudice,et al.  Pentamer vocabularies characterizing introns and intron-like intergenic tracts from Caenorhabditis elegans and Drosophila melanogaster. , 2003, Gene.

[23]  G Bernardi,et al.  Isochores and the evolutionary genomics of vertebrates. , 2000, Gene.

[24]  R. Gutell,et al.  A story: unpaired adenosine bases in ribosomal RNAs. , 2000, Journal of molecular biology.

[25]  V. Prabhu Symmetry observations in long nucleotide sequences. , 1993, Nucleic acids research.

[26]  Donald R. Forsdyke Symmetry observations in long nucleotide sequences: a commentary on the Discovery Note of Qi and Cuticchia , 2002, Bioinform..

[27]  D. Forsdyke An alternative way of thinking about stem-loops in DNA. A case study of the human G0S2 gene. , 1998, Journal of theoretical biology.

[28]  J. Mortimer,et al.  Optimum growth temperature and the base composition of open reading frames in prokaryotes , 2003, Extremophiles.

[29]  D. Forsdyke,et al.  Relative roles of primary sequence and (G + C)% in determining the hierarchy of frequencies of complementary trinucleotide pairs in DNAs of different species , 1995, Journal of Molecular Evolution.

[30]  Pierre Baldi,et al.  Why are complementary DNA strands symmetric? , 2002, Bioinform..

[31]  D. Forsdyke,et al.  Thermophilic bacteria strictly obey Szybalski's transcription direction rule and politely purine-load RNAs with both adenine and guanine. , 2000, Genome research.

[32]  G. Beer,et al.  Evolution of Life , 1965, Nature.

[33]  Jerry L. Slightom,et al.  Base substitutions, length differences and DNA strand asymmetries in the human G γ and A γ fetal globin gene region , 1981, Cell.