Asymptotically increasing compliance of genomes with Chargaff's second parity rules through inversions and inverted transpositions

Chargaff's second parity rules for mononucleotides and oligonucleotides (CIImono and CIIoligo rules) state that a sufficiently long (>100 kb) strand of genomic DNA that contains N copies of a mono- or oligonucleotide, also contains N copies of its reverse complementary mono- or oligonucleotide on the same strand. There is very strong support in the literature for the validity of the rules in coding and noncoding regions, especially for the CIImono rule. Because the experimental support for the CIIoligo rule is much less complete, the present article, focusing on the special case of trinucleotides (triplets), examined several gigabases of genome sequences from a wide range of species and kingdoms including organelles such as mitochondria and chloroplasts. I found that all genomes, with the only exception of certain mitochondria, complied with the CIItriplet rule at a very high level of accuracy in coding and noncoding regions alike. Based on the growing evidence that genomes may contain up to millions of copies of interspersed repetitive elements, I propose in this article a quantitative formulation of the hypothesis that inversions and inverted transposition could be a major contributing if not dominant factor in the almost universal validity of the rules.

[1]  B F Lang,et al.  Mitochondrial genome evolution and the origin of eukaryotes. , 1999, Annual review of genetics.

[2]  David Mitchell,et al.  A test of Chargaff's second rule. , 2006, Biochemical and biophysical research communications.

[3]  V. Prabhu Symmetry observations in long nucleotide sequences. , 1993, Nucleic acids research.

[4]  D. Labuda,et al.  CORE-SINEs: eukaryotic short interspersed retroposing elements with common sequence motifs. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[5]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[6]  D. Forsdyke,et al.  Deviations from Chargaff's second parity rule correlate with direction of transcription. , 1999, Journal of theoretical biology.

[7]  J. Sánchez,et al.  Analysis of bilateral inverse symmetry in whole bacterial chromosomes. , 2002, Biochemical and biophysical research communications.

[8]  D. Forsdyke,et al.  Relative roles of primary sequence and (G + C)% in determining the hierarchy of frequencies of complementary trinucleotide pairs in DNAs of different species , 1995, Journal of Molecular Evolution.

[9]  Michael Pheasant,et al.  Transposon-free regions in mammalian genomes. , 2005, Genome research.

[10]  David R. Wolf,et al.  Base compositional structure of genomes. , 1992, Genomics.

[11]  D. Forsdyke,et al.  Chargaff difference analysis of the bithorax complex of Drosophila melanogaster. , 1998, Biochemistry and cell biology = Biochimie et biologie cellulaire.

[12]  Mme. Irène Joliot-Curie , 1956, Nature.

[13]  B. Mcclintock,et al.  The significance of responses of the genome to challenge. , 1984, Science.

[14]  Toshikazu Ebisuzaki,et al.  Structure and dynamics of RNA polymerase II elongation complex. , 2006, Biochemical and biophysical research communications.

[15]  S. Boissinot,et al.  The structures of mouse and human L1 elements reflect their insertion mechanism , 2005, Cytogenetic and Genome Research.