Identification and simulation of shifted periodicities common to protein coding genes of eukaryotes, prokaryotes and viruses.

The distribution of nucleotides in protein coding genes is studied with autocorrelation functions. The autocorrelation function YRY(N)iYRY, analysing the occurrence probability of the i-motif YRY(N)iYRY (two motifs YRY separated by any i bases N, R = purine = Adenine or Guanine, Y = pyrimidine = Cytosine or Thymine, N = R or Y) in the protein coding genes of eukaryotes, prokaryotes and viruses, reveals the classical periodicity 0 modulo 3 associated with the normal frame 0 (maximal values of the function at i = 0, 3, 6, etc). The specification of YRY(N)iYRY on the alphabet [A, C, G, T] leads to 64 i-motifs: CAC(N)iCAC, CAC(N)iCAT, ..., TGF(N)iTGT. The 64 autocorrelation functions associated with these 64 i-motifs in protein coding genes have all the periodicity modulo 3, but, surprisingly, not always the expected periodicity 0 modulo 3. Two new types of periodicities are identified: a periodicity 1 modulo 3 associated with the shifted frame +1 (maximal values of the function at i = 1, 4, 7, etc) and a periodicity 2 modulo 3 associated with the shifted frame -1 (maximal values of the function at i = 2, 5, 8 etc). Furthermore, the classification of i-motifs according to the type of periodicity demonstrates a strong coherence relation between the 64 i-motifs, which is, in addition, common to the three gene populations, as the same i-motifs in the three gene populations have the same periodicities. The three periodicities 0, 1 and 2 modulo 3 can be simulated by an evolutionary model at two successive processes. The simulated genes are generated by a process of gene construction, with a stochastic automaton followed by a process of gene evolution with random insertions and deletions of trinucleotides simulating RNA editing. For almost all i-motifs, the autocorrelation functions in these simulated genes are strongly correlated with those in protein coding genes, for both the type and the probability level of periodicities. This paper describes the process of ribosomal frameshifting leading to the shifted periodicities, which may reveal overlapping genes or concatenated genes from different frames. It also presents the evolutionary aspects of the shifted periodicities. The shifted periodicities cannot be associated with the RNY model (Eigen & Schuster, 1978, Naturwissenschaften 65, 341-369) or the RRY model (Crick et al., 1976, Origins of Life 7, 389-397), but are compatible with the oligonucleotide mixing model (Arquès & Michel, 1990, Bull. math. Biol. 52, 741-772). Finally, a variant of the primitive translation model of Crick et al. (1976) is proposed to explain the shifted periodicities.

[1]  J. F. Atkins,et al.  tRNA anticodon replacement experiments show that ribosomal frameshifting can be caused by doublet decoding. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[2]  T. Fox,et al.  Leaky +1 and −1 frameshift mutations at the same site in a yeast mitochondrial gene , 1980, Nature.

[3]  U. Lagerkvist Unorthodox codon reading and the evolution of the genetic code , 1981, Cell.

[4]  L F Landweber,et al.  Phylogenetic analysis of RNA editing: a primitive genetic phenomenon. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[5]  R. Benne,et al.  Major transcript of the frameshifted coxll gene from trypanosome mitochondria contains four nucleotides that are not encoded in the DNA , 1986, Cell.

[6]  R. Benne,et al.  RNA editing in trypanosome mitochondria: guidelines for models , 1993, FEBS letters.

[7]  Philip J. Farabaugh,et al.  Ribosomal frameshifting in the yeast retrotransposon Ty: tRNAs induce slippage on a 7 nucleotide minimal site , 1990, Cell.

[8]  M Yarus,et al.  Translational efficiency of transfer RNA's: uses of an extended anticodon. , 1982, Science.

[9]  D. Crothers,et al.  On the physical basis for ambiguity in genetic coding interactions. , 1978, Proceedings of the National Academy of Sciences of the United States of America.

[10]  R. Weiss,et al.  Slippery runs, shifty stops, backward steps, and forward hops: -2, -1, +1, +2, +5, and +6 ribosomal frameshifting. , 1987, Cold Spring Harbor symposia on quantitative biology.

[11]  T. Cech,et al.  RNA editing: World's smallest introns? , 1991, Cell.

[12]  D. Dunn,et al.  Reading frame switch caused by base‐pair formation between the 3′ end of 16S rRNA and the mRNA during elongation of protein synthesis in Escherichia coli. , 1988, The EMBO journal.

[13]  R. Gaber,et al.  Codon recognition during frameshift suppression in Saccharomyces cerevisiae , 1984, Molecular and cellular biology.

[14]  A. Kingsman,et al.  A retrovirus-like strategy for expression of a fusion protein encoded by yeast transposon Ty1 , 1985, Nature.

[15]  J. Feagin RNA editing in kinetoplastid mitochondria. , 1990, The Journal of biological chemistry.

[16]  D. Arquès,et al.  Periodicities in coding and noncoding regions of the genes. , 1990, Journal of theoretical biology.

[17]  R. Weiss,et al.  On the mechanism of ribosomal frameshifting at hungry codons. , 1988, Journal of molecular biology.

[18]  K. Stuart RNA editing in mitochondrial mRNA of trypanosomatids. , 1991, Trends in biochemical sciences.

[19]  B. Suter,et al.  Queuosine modification of the wobble base in tRNAHis influences ‘in vivo’ decoding properties. , 1985, The EMBO journal.

[20]  W. Fuller,et al.  Conformation of the Anticodon Loop in tRNA , 1967, Nature.

[21]  U Lagerkvist,et al.  "Two out of three": an alternative method for codon reading. , 1978, Proceedings of the National Academy of Sciences of the United States of America.

[22]  D. Riddle,et al.  Frameshift suppression: a nucleotide addition in the anticodon of a glycine transfer RNA. , 1973, Nature: New biology.

[23]  J. Roth,et al.  A Salmonella frameshift suppressor that acts at runs of A residues in the messenger RNA. , 1978, Journal of molecular biology.

[24]  C. Caskey,et al.  Trinucleotide repeats and genome variation. , 1993, Current opinion in genetics & development.

[25]  D. Arquès,et al.  Analytical expression of the purine/pyrimidine autocorrelation function after and before random mutations. , 1994, Mathematical biosciences.

[26]  J. F. Atkins,et al.  Seven, eight and nine-membered anticodon loop mutants of tRNA(2Arg) which cause +1 frameshifting. Tolerance of DHU arm and other secondary mutations. , 1992, Journal of molecular biology.

[27]  L. Bossi,et al.  Four-base codons ACCA, ACCU and ACCC are recognized by frameshift suppressor sufJ , 1981, Cell.

[28]  D. Arquès,et al.  A model of DNA sequence evolution , 1990 .

[29]  A. Hizi,et al.  Characterization of mouse mammary tumor virus gag-pro gene products and the ribosomal frameshift site by protein sequencing. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[30]  M. Yarus,et al.  Transfer RNA mutation and the malleability of the genetic code. , 1994, Journal of molecular biology.

[31]  D. Arquès,et al.  A simulation of the genetic periodicities modulo 2 and 3 with processes of nucleotide insertions and deletions. , 1992, Journal of theoretical biology.

[32]  J. F. Atkins,et al.  tRNA hopping: enhancement by an expanded anticodon. , 1989, The EMBO journal.

[33]  C. Woese Molecular Mechanics of Translation : a Reciprocating Ratchet Mechanism , 1970, Nature.

[34]  M. Eigen,et al.  The Hypercycle: A principle of natural self-organization , 2009 .

[35]  P. Covello,et al.  On the evolution of RNA editing. , 1993, Trends in genetics : TIG.

[36]  H. Varmus,et al.  Signals for ribosomal frameshifting in the rous sarcoma virus gag-pol region , 1988, Cell.

[37]  F. Crick Codon--anticodon pairing: the wobble hypothesis. , 1966, Journal of molecular biology.

[38]  D. Prüfer,et al.  Ribosomal frameshifting in plants: a novel signal directs the -1 frameshift in the synthesis of the putative viral replicase of potato leafroll luteovirus. , 1992, The EMBO journal.

[39]  D. Arquès,et al.  Analytical expression of the purine/pyrimidine codon probability after and before random mutations , 1993 .

[40]  R. Weiss,et al.  Ribosomal frameshifting from -2 to +50 nucleotides. , 1990, Progress in nucleic acid research and molecular biology.

[41]  C J Michel,et al.  A purine-pyrimidine motif verifying an identical presence in almost all gene taxonomic groups. , 1987, Journal of theoretical biology.

[42]  Laura F. Landweber,et al.  RNA editing as a source of genetic variation , 1993, Nature.

[43]  V. Morell The puzzle of the triple repeats. , 1993, Science.

[44]  D. Riddle,et al.  Frameshift suppressors: II. Genetic mapping and dominance studies☆☆☆ , 1972 .

[45]  L Simpson,et al.  RNA editing--a novel genetic phenomenon? , 1990, Science.

[46]  S. Le,et al.  Thermodynamic stability and statistical significance of potential stem-loop structures situated at the frameshift sites of retroviruses. , 1989, Nucleic acids research.