Frameshift Signals in Genes Associated with the Circular Code

Three sets of 20 trinucleotides are preferentially associated with the reading frames and their 2 shifted frames of both eukaryotic and prokaryotic genes. These 3 sets are circular codes. They allow retrieval of any frame in genes (containing these circular code words), locally anywhere in the 3 frames and in particular without start codons in the reading frame, and automatically with the reading of a few nucleotides. The circular code in the reading frame, noted X, which can deduce the 2 other circular codes in the shifted frames by permutation, is the information used for analysing frameshift genes, i. e. genes with a change of reading frame during translation. This work studies the circular code signal around their frameshift sites. Two scoring methods are developed, a function P based on this code X and a function Q based both on this code X and the 4 trinucleotides with identical nucleotides. They detect a significant correlation between the code X and the -1 frameshift signals in both eukaryotic and prokaryotic genes, and the +1 frameshift signals in eukaryotic genes.

[1]  R. Weiss,et al.  Recoding: reprogrammed genetic decoding. , 1992, Science.

[2]  Olivier Fayet,et al.  Recoding in bacteriophages and bacterial IS elements. , 2006, Trends in genetics : TIG.

[3]  Christian J. Michel,et al.  A 2006 review of circular codes in genes , 2008, Comput. Math. Appl..

[4]  C. Thomas Caskey,et al.  Translational frameshifting: Where will it stop? , 1987, Cell.

[5]  Raymond F. Gesteland,et al.  RECODE: a database of frameshifting, bypassing and codon redefinition utilized for gene expression , 2001, Nucleic Acids Res..

[6]  R. Hendrix,et al.  Conserved translational frameshift in dsDNA bacteriophage tail assembly genes. , 2004, Molecular cell.

[7]  C J Michel,et al.  A complementary circular code in the protein coding genes. , 1996, Journal of theoretical biology.

[8]  J. Parker,et al.  Errors and alternatives in reading the universal genetic code. , 1989, Microbiological reviews.

[9]  R. Weiss,et al.  Ribosome gymnastics—Degree of difficulty 9.5, style 10.0 , 1990, Cell.

[10]  M. Gerstein,et al.  A small reservoir of disabled ORFs in the Saccharomyces cerevisiae genome and its implications for the dynamics of proteome evolution , 2001 .

[11]  O. Namy,et al.  Reprogrammed genetic decoding in cellular gene expression. , 2004, Molecular cell.

[12]  M. Rossi,et al.  Recoding in Archaea , 2004, Molecular microbiology.

[13]  P. Farabaugh Programmed translational frameshifting. , 1996, Annual review of genetics.

[14]  Mark Gerstein,et al.  A small reservoir of disabled ORFs in the yeast genome and its implications for the dynamics of proteome evolution. , 2002, Journal of molecular biology.

[15]  Christian J. Michel,et al.  Identification of circular codes in bacterial genomes and their use in a factorization method for retrieving the reading frames of genes , 2006, Comput. Biol. Chem..

[16]  B. Prum,et al.  Identification of programmed ribosomal-1 frameshifting sites in the genome of Saccharomyces cerevisiae , 2005 .