On the hierarchy of trinucleotide n-circular codes and their corresponding amino acids.

Circular codes are putative remnants of primeval comma-free codes and are potentially involved in detecting and maintaining the normal reading frame in protein coding sequences. In Michel and Pirillo (2013a) it was shown by computer algorithm that no maximal trinucleotide circular code can encode more than 18 different amino acids under the standard version of the genetic code. For comma-free codes the maximum is even less, namely 13 (Michel, 2014). The main purpose of this paper is to investigate these facts from a mathematical point of view and to show why the codes with the best-known error detecting properties are limited in the number of amino acids they can encode. We introduce five hierarchically ordered classes of trinucleotide codes including the well-known comma-free and circular codes and prove combinatorically that it is impossible to encode all amino acids using codes from four out of the five classes that have the strongest error detecting properties. However, it is possible to encode all 20 amino acids using codes from the largest class with the weakest properties. Additionally, we develop a handy criterion for circularity. As an application, it is shown that all codes from a special class of trinucleotide codes which includes the RNY-primeval code (Shepherd, 1986) are automatically circular. We also list which amino acids these codes encode.

[1]  L. Welch,et al.  CONSTRUCTION AND PROPERTIES OF COMMA-FREE CODES , 2015 .

[2]  V. Nanjundiah George Gamow and the genetic code , 2004 .

[3]  Diego L. González,et al.  The Mathematical Structure of the Genetic Code , 2008 .

[4]  F. Crick,et al.  Molecular Structure of Nucleic Acids: A Structure for Deoxyribose Nucleic Acid , 1953, Nature.

[5]  Christian J Michel,et al.  A genetic scale of reading frame coding. , 2014, Journal of theoretical biology.

[6]  Giuseppe Pirillo,et al.  A classification of 20-trinucleotide circular codes , 2012, Inf. Comput..

[7]  F. Crick,et al.  A structure for deoxyribose nucleic acid , 2017 .

[8]  Giuseppe Pirillo,et al.  Dinucleotide Circular Codes , 2013 .

[9]  Simone Giannerini,et al.  Strong short-range correlations and dichotomic codon classes in coding DNA sequences. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Giuseppe Pirillo,et al.  Identification of all trinucleotide circular codes , 2010, Comput. Biol. Chem..

[11]  Christian J. Michel,et al.  A necklace algorithm to determine the growth function of trinucleotide circular codes , 2013 .

[12]  Rumer IuB Systematization of codons in the genetic code , 1969 .

[13]  Brian Hayes,et al.  THE INVENTION OF THE GENETIC CODE , 1998 .

[14]  Giuseppe Pirillo,et al.  Varieties of comma-free codes , 2008, Comput. Math. Appl..

[15]  Tidjani Negadi,et al.  The Genetic Code Degeneracy and the Amino Acids Chemical Composition are Connected , 2008, 0903.4131.

[16]  Jean-Luc Jestin,et al.  Degeneracy in the genetic code and its symmetries by base substitutions. , 2003, Comptes rendus biologies.

[17]  S. Giannerini,et al.  Circular codes revisited: a statistical approach. , 2011, Journal of theoretical biology.

[18]  A. Sciarrino A mathematical model accounting for the organization in multiplets of the genetic code. , 2001, Bio Systems.

[19]  S. Golomb,et al.  Comma-Free Codes , 1958, Canadian Journal of Mathematics.

[20]  A J Koch,et al.  About a symmetry of the genetic code. , 1997, Journal of theoretical biology.

[21]  Iu B Rumer [Systematization of codons in the genetic code]. , 1968, Doklady Akademii nauk SSSR.

[22]  Giuseppe Pirillo,et al.  A relation between trinucleotide comma-free codes and trinucleotide circular codes , 2008, Theor. Comput. Sci..

[23]  Giuseppe Pirillo,et al.  A permuted set of a trinucleotide circular code coding the 20 amino acids in variant nuclear codes. , 2013, Journal of theoretical biology.

[24]  Christian J. Michel,et al.  Circular code motifs in transfer and 16S ribosomal RNAs: A possible translation code in genes , 2012, Comput. Biol. Chem..

[25]  C J Michel,et al.  A complementary circular code in the protein coding genes. , 1996, Journal of theoretical biology.