Codon Usage Bias and Mutation Constraints Reduce the Level of ErrorMinimization of the Genetic Code

Studies on the origin of the genetic code compare measures of the degree of error minimization of the standard code with measures produced by random variant codes but do not take into account codon usage, which was probably highly biased during the origin of the code. Codon usage bias could play an important role in the minimization of the chemical distances between amino acids because the importance of errors depends also on the frequency of the different codons. Here I show that when codon usage is taken into account, the degree of error minimization of the standard code may be dramatically reduced, and shifting to alternative codes often increases the degree of error minimization. This is especially true with a high CG content, which was probably the case during the origin of the code. I also show that the frequency of codes that perform better than the standard code, in terms of relative efficiency, is much higher in the neighborhood of the standard code itself, even when not considering codon usage bias; therefore alternative codes that differ only slightly from the standard code are more likely to evolve than some previous analyses suggested. My conclusions are that the standard genetic code is far from being an optimum with respect to error minimization and must have arisen for reasons other than error minimization.

[1]  M. Di Giulio The universal ancestor lived in a thermophilic or hyperthermophilic environment. , 2000, Journal of theoretical biology.

[2]  J. Wong A co-evolution theory of the genetic code. , 1975, Proceedings of the National Academy of Sciences of the United States of America.

[3]  M Di Giulio Genetic code origin and the strength of natural selection. , 2000, Journal of theoretical biology.

[4]  D. G. George,et al.  Mutation data matrix and its uses. , 1990, Methods in enzymology.

[5]  M. Di Giulio,et al.  The extension reached by the minimization of the polarity distances during the evolution of the genetic code. , 1989, Journal of molecular evolution.

[6]  W. Fitch,et al.  The phylogeny of tRNA sequences provides evidence for ambiguity reduction in the origin of the genetic code. , 1987, Cold Spring Harbor symposia on quantitative biology.

[7]  The coevolution theory of the origin of the genetic code , 2004 .

[8]  C. Woese,et al.  Bacterial evolution , 1987, Microbiological reviews.

[9]  C. Woese The universal ancestor. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[10]  C. Epstein,et al.  Role of the Amino-Acid ‘Code’ and of Selection for Conformation in the Evolution of Proteins , 1966, Nature.

[11]  D. Haydon,et al.  The Genetic Code: What Is It Good For? An Analysis of the Effects of Selection Pressures on Genetic Codes , 1999, Journal of Molecular Evolution.

[12]  John P. Overington,et al.  Environment‐specific amino acid substitution tables: Tertiary templates and prediction of protein folds , 1992, Protein science : a publication of the Protein Society.

[13]  C. Woese,et al.  Evolution of the genetic code , 2004, The Science of Nature.

[14]  S A Benner,et al.  Amino acid substitution during functionally constrained divergent evolution of protein sequences. , 1994, Protein engineering.

[15]  A. Mclachlan Tests for comparing related amino-acid sequences. Cytochrome c and cytochrome c 551 . , 1971, Journal of molecular biology.

[16]  L F Landweber,et al.  Measuring adaptation within the genetic code. , 2000, Trends in biochemical sciences.

[17]  Massimo Di Giulio,et al.  The origin of the genetic code cannot be studied using measurements based on the PAM matrix because this matrix reflects the code itself, making any such analyses tautologous. , 2001 .

[18]  F. Wright The 'effective number of codons' used in a gene. , 1990, Gene.

[19]  R S Root-Bernstein On the origin of the genetic code. , 1982, Journal of theoretical biology.

[20]  C. Woese,et al.  Were the original eubacteria thermophiles? , 1987, Systematic and applied microbiology.

[21]  T. M. Sonneborn Degeneracy of the Genetic Code: Extent, Nature, and Genetic Implications , 1965 .

[22]  M. O. Dayhoff,et al.  22 A Model of Evolutionary Change in Proteins , 1978 .

[23]  M. Schluchter,et al.  Evolutionary conservation of both the hydrophilic and hydrophobic nature of transmembrane residues. , 1995, Journal of theoretical biology.

[24]  J. Risler,et al.  Amino acid substitutions in structurally related proteins. A pattern recognition approach. Determination of a new and efficient scoring matrix. , 1988, Journal of molecular biology.

[25]  Laurence D. Hurst,et al.  A quantitative measure of error minimization in the genetic code , 1991, Journal of Molecular Evolution.

[26]  L. Hurst,et al.  Early fixation of an optimal genetic code. , 2000, Molecular biology and evolution.

[27]  L F Landweber,et al.  Selection, history and chemistry: the three faces of the genetic code. , 1999, Trends in biochemical sciences.

[28]  Massimo Di Giulio On the origin of the genetic code. , 1992 .

[29]  L. Hurst,et al.  The Genetic Code Is One in a Million , 1998, Journal of Molecular Evolution.