Codon Distribution in Error-Detecting Circular Codes

In 1957, Francis Crick et al. suggested an ingenious explanation for the process of frame maintenance. The idea was based on the notion of comma-free codes. Although Crick’s hypothesis proved to be wrong, in 1996, Arquès and Michel discovered the existence of a weaker version of such codes in eukaryote and prokaryote genomes, namely the so-called circular codes. Since then, circular code theory has invariably evoked great interest and made significant progress. In this article, the codon distributions in maximal comma-free, maximal self-complementary C3 and maximal self-complementary circular codes are discussed, i.e., we investigate in how many of such codes a given codon participates. As the main (and surprising) result, it is shown that the codons can be separated into very few classes (three, or five, or six) with respect to their frequency. Moreover, the distribution classes can be hierarchically ordered as refinements from maximal comma-free codes via maximal self-complementary C3 codes to maximal self-complementary circular codes.

[1]  Giuseppe Pirillo,et al.  A permuted set of a trinucleotide circular code coding the 20 amino acids in variant nuclear codes. , 2013, Journal of theoretical biology.

[2]  Hervé Seligmann,et al.  The ambush hypothesis: hidden stop codons prevent off-frame gene reading. , 2004, DNA and cell biology.

[3]  H. Seligmann Sharp switches between regular and swinger mitochondrial replication: 16S rDNA systematically exchanging nucleotides A<->T+C<->G in the mitogenome of Kamimuria wangi , 2016, Mitochondrial DNA. Part A, DNA mapping, sequencing, and analysis.

[4]  S. Golomb,et al.  Comma-Free Codes , 1958, Canadian Journal of Mathematics.

[5]  Hervé Seligmann,et al.  Bijective transformation circular codes and nucleotide exchanging RNA transcription , 2014, Biosyst..

[7]  M. Giulio A Blind Empiricism Against the Coevolution Theory of the Origin of the Genetic Code , 2001, Journal of Molecular Evolution.

[8]  J. C. Shepherd Method to determine the reading frame of a protein from the purine/pyrimidine genome sequence and its possible evolutionary justification. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Lutz Strüngmann,et al.  On the hierarchy of trinucleotide n-circular codes and their corresponding amino acids. , 2015, Journal of theoretical biology.

[10]  L. Hurst,et al.  Early fixation of an optimal genetic code. , 2000, Molecular biology and evolution.

[11]  Christian J Michel,et al.  A genetic scale of reading frame coding. , 2014, Journal of theoretical biology.

[12]  Peter Schuster,et al.  A principle of natural self-organization , 1977, Naturwissenschaften.

[13]  H. Seligmann Species radiation by DNA replication that systematically exchanges nucleotides? , 2014, Journal of theoretical biology.

[14]  Giuseppe Pirillo,et al.  A relation between trinucleotide comma-free codes and trinucleotide circular codes , 2008, Theor. Comput. Sci..

[15]  F H Crick,et al.  CODES WITHOUT COMMAS. , 1957, Proceedings of the National Academy of Sciences of the United States of America.

[16]  A. Kempf,et al.  Degeneracy in the Genetic Code : How and Why ? , 2009 .

[17]  Giuseppe Pirillo,et al.  Identification of all trinucleotide circular codes , 2010, Comput. Biol. Chem..

[18]  H. G. Khorana,et al.  Polynucleotide synthesis and the genetic code. , 1966, Harvey lectures.

[19]  C J Michel,et al.  A complementary circular code in the protein coding genes. , 1996, Journal of theoretical biology.

[20]  Massimo Di Giulio,et al.  An extension of the coevolution theory of the origin of the genetic code , 2008, Biology Direct.

[21]  Antonin Guilloux,et al.  The genetic code and its optimization for kinetic energy conservation in polypeptide chains , 2012, Biosyst..

[22]  C J Michel,et al.  Analysis of a circular code model. , 2001, Journal of theoretical biology.

[23]  Tzipe Govezensky,et al.  An Extended RNA Code and its Relationship to the Standard Genetic Code: An Algebraic and Geometrical Approach , 2007, Bulletin of mathematical biology.

[24]  Hervé Seligmann,et al.  Mitochondrial swinger replication: DNA replication systematically exchanging nucleotides and short 16S ribosomal DNA swinger inserts , 2014, Biosyst..

[25]  Christian J Michel,et al.  The maximal C(3) self-complementary trinucleotide circular code X in genes of bacteria, eukaryotes, plasmids and viruses. , 2015, Journal of theoretical biology.

[26]  Peculiar symmetry of DNA sequences and evidence suggesting its evolutionary origin in a primeval genetic code , 2001 .

[27]  Tobias Warnecke,et al.  Error prevention and mitigation as forces in the evolution of genes and genomes , 2011, Nature Reviews Genetics.

[28]  M. Eigen,et al.  The Hypercycle: A principle of natural self-organization , 2009 .

[29]  Marshall W. Nirenberg,et al.  The dependence of cell-free protein synthesis in E. coli upon naturally occurring or synthetic polyribonucleotides , 1961, Proceedings of the National Academy of Sciences.

[30]  Lutz Strüngmann,et al.  Circular codes, symmetries and transformations , 2015, Journal of mathematical biology.

[31]  Justin Pearson,et al.  Comma-free codes , 2003 .

[32]  Hervé Seligmann Systematic exchanges between nucleotides: Genomic swinger repeats and swinger transcription in human mitochondria. , 2015, Journal of theoretical biology.