Maximal dinucleotide and trinucleotide circular codes.

We determine here the number and the list of maximal dinucleotide and trinucleotide circular codes. We prove that there is no maximal dinucleotide circular code having strictly less than 6 elements (maximum size of dinucleotide circular codes). On the other hand, a computer calculus shows that there are maximal trinucleotide circular codes with less than 20 elements (maximum size of trinucleotide circular codes). More precisely, there are maximal trinucleotide circular codes with 14, 15, 16, 17, 18 and 19 elements and no maximal trinucleotide circular code having less than 14 elements. We give the same information for the maximal self-complementary dinucleotide and trinucleotide circular codes. The amino acid distribution of maximal trinucleotide circular codes is also determined.

[1]  S. Golomb,et al.  Comma-Free Codes , 1958, Canadian Journal of Mathematics.

[2]  Giuseppe Pirillo,et al.  Growth function of self-complementary circular codes. , 2005, Rivista di biologia.

[3]  F H Crick,et al.  CODES WITHOUT COMMAS. , 1957, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Christian J. Michel,et al.  Identification of circular codes in bacterial genomes and their use in a factorization method for retrieving the reading frames of genes , 2006, Comput. Biol. Chem..

[5]  S. Giannerini,et al.  Circular codes revisited: a statistical approach. , 2011, Journal of theoretical biology.

[6]  M. Pellegrini,et al.  On the dinucleotide circular codes of maximum cardinality. , 2014, Theoretical biology forum.

[7]  Giuseppe Pirillo A hierarchy for circular codes , 2008, RAIRO Theor. Informatics Appl..

[8]  C J Michel,et al.  A complementary circular code in the protein coding genes. , 1996, Journal of theoretical biology.

[9]  Jean-Louis Lassez Circular codes and synchronization , 2004, International Journal of Computer & Information Sciences.

[11]  Christian J. Michel,et al.  The Maximal C3 Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses , 2017, Life.

[12]  Giuseppe Pirillo,et al.  A permuted set of a trinucleotide circular code coding the 20 amino acids in variant nuclear codes. , 2013, Journal of theoretical biology.

[13]  Gabriel Frey,et al.  Circular codes in archaeal genomes. , 2003, Journal of theoretical biology.

[14]  Ryan A. Rossi,et al.  Crick's Hypothesis Revisited: The Existence of a Universal Coding Frame , 2007, 21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07).

[15]  L. Welch,et al.  CONSTRUCTION AND PROPERTIES OF COMMA-FREE CODES , 2015 .

[16]  A J Koch,et al.  About a symmetry of the genetic code. , 1997, Journal of theoretical biology.

[17]  Marie-Pierre Béal,et al.  On the Bound of the Synchronization Delay of a Local Automaton , 1998, Theor. Comput. Sci..

[18]  Giuseppe Pirillo,et al.  Dinucleotide Circular Codes , 2013 .

[19]  C. J. Michel,et al.  On 51 forbidden configurations for self-complementary circular codes , 2011 .

[20]  Giuseppe Pirillo,et al.  Identification of all trinucleotide circular codes , 2010, Comput. Biol. Chem..

[21]  Lutz Strüngmann,et al.  Circular codes, symmetries and transformations , 2015, Journal of mathematical biology.

[22]  Lutz Strüngmann,et al.  Dinucleotide circular codes and bijective transformations. , 2015, Journal of theoretical biology.

[23]  Stephen M. Mount,et al.  A catalogue of splice junction sequences. , 1982, Nucleic acids research.

[24]  Lutz Strüngmann,et al.  Maximal dinucleotide comma-free codes. , 2016, Journal of theoretical biology.

[25]  J. Berstel,et al.  Theory of codes , 1985 .

[26]  Giuseppe Pirillo,et al.  A classification of 20-trinucleotide circular codes , 2012, Inf. Comput..

[27]  Giuseppe Pirillo A Characterization for a Set of Trinucleotides to be a Circular Code , 2003 .

[28]  Christian J Michel,et al.  A genetic scale of reading frame coding. , 2014, Journal of theoretical biology.

[29]  Frédérique Bassino Generating Functions of Circular Codes , 1999 .

[30]  Giuseppe Pirillo,et al.  A relation between trinucleotide comma-free codes and trinucleotide circular codes , 2008, Theor. Comput. Sci..

[31]  V. Solovyev,et al.  Analysis of canonical and non-canonical splice sites in mammalian genomes. , 2000, Nucleic acids research.

[32]  J. Isola,et al.  Allelic length of a CA dinucleotide repeat in the egfr gene correlates with the frequency of amplifications of this sequence—first results of an inter‐ethnic breast cancer study , 2004, The Journal of pathology.

[34]  Lutz Strüngmann,et al.  On the hierarchy of trinucleotide n-circular codes and their corresponding amino acids. , 2015, Journal of theoretical biology.

[35]  M. Goossens,et al.  Polyvariant mutant cystic fibrosis transmembrane conductance regulator genes. The polymorphic (Tg)m locus explains the partial penetrance of the T5 polymorphism as a disease mutation. , 1998, The Journal of clinical investigation.

[36]  Giuseppe Pirillo,et al.  Varieties of comma-free codes , 2008, Comput. Math. Appl..