Evolution of the genetic code: partial optimization of a random code for robustness to translation error in a rugged fitness landscape

BackgroundThe standard genetic code table has a distinctly non-random structure, with similar amino acids often encoded by codons series that differ by a single nucleotide substitution, typically, in the third or the first position of the codon. It has been repeatedly argued that this structure of the code results from selective optimization for robustness to translation errors such that translational misreading has the minimal adverse effect. Indeed, it has been shown in several studies that the standard code is more robust than a substantial majority of random codes. However, it remains unclear how much evolution the standard code underwent, what is the level of optimization, and what is the likely starting point.ResultsWe explored possible evolutionary trajectories of the genetic code within a limited domain of the vast space of possible codes. Only those codes were analyzed for robustness to translation error that possess the same block structure and the same degree of degeneracy as the standard code. This choice of a small part of the vast space of possible codes is based on the notion that the block structure of the standard code is a consequence of the structure of the complex between the cognate tRNA and the codon in mRNA where the third base of the codon plays a minimum role as a specificity determinant. Within this part of the fitness landscape, a simple evolutionary algorithm, with elementary evolutionary steps comprising swaps of four-codon or two-codon series, was employed to investigate the optimization of codes for the maximum attainable robustness. The properties of the standard code were compared to the properties of four sets of codes, namely, purely random codes, random codes that are more robust than the standard code, and two sets of codes that resulted from optimization of the first two sets. The comparison of these sets of codes with the standard code and its locally optimized version showed that, on average, optimization of random codes yielded evolutionary trajectories that converged at the same level of robustness to translation errors as the optimization path of the standard code; however, the standard code required considerably fewer steps to reach that level than an average random code. When evolution starts from random codes whose fitness is comparable to that of the standard code, they typically reach much higher level of optimization than the standard code, i.e., the standard code is much closer to its local minimum (fitness peak) than most of the random codes with similar levels of robustness. Thus, the standard genetic code appears to be a point on an evolutionary trajectory from a random point (code) about half the way to the summit of the local peak. The fitness landscape of code evolution appears to be extremely rugged, containing numerous peaks with a broad distribution of heights, and the standard code is relatively unremarkable, being located on the slope of a moderate-height peak.ConclusionThe standard code appears to be the result of partial optimization of a random code for robustness to errors of translation. The reason the code is not fully optimized could be the trade-off between the beneficial effect of increasing robustness to translation errors and the deleterious effect of codon series reassignment that becomes increasingly severe with growing complexity of the evolving system. Thus, evolution of the code can be represented as a combination of adaptation and frozen accident.ReviewersThis article was reviewed by David Ardell, Allan Drummond (nominated by Laura Landweber), and Rob Knight.Open Peer ReviewThis article was reviewed by David Ardell, Allan Drummond (nominated by Laura Landweber), and Rob Knight.

[1]  G. Gamow Possible Relation between Deoxyribonucleic Acid and Protein Structures , 1954, Nature.

[2]  S. Pestka,et al.  On the Coding of Genetic Information , 1963 .

[3]  W. Gilbert,et al.  STREPTOMYCIN, SUPPRESSION, AND THE CODE. , 1964, Proceedings of the National Academy of Sciences of the United States of America.

[4]  I. Weinstein,et al.  LACK OF FIDELITY IN THE TRANSLATION OF SYNTHETIC POLYRIBONUCLEOTIDES. , 1964, Proceedings of the National Academy of Sciences of the United States of America.

[5]  S. R. Pelc Correlation between Coding-Triplets and Amino-Acids , 1965, Nature.

[6]  C R Woese,et al.  The molecular basis for the genetic code. , 1966, Proceedings of the National Academy of Sciences of the United States of America.

[7]  C. Woese,et al.  On the fundamental nature and evolution of the genetic code. , 1966, Cold Spring Harbor symposia on quantitative biology.

[8]  F. Crick Codon--anticodon pairing: the wobble hypothesis. , 1966, Journal of molecular biology.

[9]  C. Woese The genetic code : the molecular basis for genetic expression , 1967 .

[10]  F. H. C. CRICK,et al.  Origin of the Genetic Code , 1967, Nature.

[11]  F. Crick Origin of the Genetic Code , 1967, Nature.

[12]  J. L. King,et al.  Non-Darwinian evolution. , 1969, Science.

[13]  C. Alff-Steinberger,et al.  The genetic code and error transmission. , 1969, Proceedings of the National Academy of Sciences of the United States of America.

[14]  T. Jukes,et al.  Arginine as an evolutionary intruder into protein synthesis. , 1973, Biochemical and biophysical research communications.

[15]  T. Jukes Possibilities for the Evolution of the Genetic Code from a Preceding Form , 1973, Nature.

[16]  J. Wong A co-evolution theory of the genetic code. , 1975, Proceedings of the National Academy of Sciences of the United States of America.

[17]  J. Wong,et al.  Role of minimization of chemical distances between amino acids in the evolution of the genetic code. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[18]  R. Swanson A unifying concept for the amino acid code. , 1984, Bulletin of mathematical biology.

[19]  M Yarus,et al.  A specific amino acid binding site composed of RNA. , 1988, Science.

[20]  J. Parker,et al.  Errors and alternatives in reading the universal genetic code. , 1989, Microbiological reviews.

[21]  S A Benner,et al.  Amino acid substitution during functionally constrained divergent evolution of protein sequences. , 1994, Protein engineering.

[22]  M. R. Capobianco,et al.  On the optimization of the physicochemical distances between amino acids in the evolution of the genetic code. , 1994, Journal of theoretical biology.

[23]  S. Brunak,et al.  Neural network model of the genetic code is strongly correlated to the GES scale of amino acid transfer free energies. , 1994, Journal of molecular biology.

[24]  Michael Yarus,et al.  Amino Acids as RNA Ligands: A Direct-RNA-Template Theory for the Code's Origin , 1998, Journal of Molecular Evolution.

[25]  J. Bieker,et al.  Regulation of Erythroid Krüppel-like Factor (EKLF) Transcriptional Activity by Phosphorylation of a Protein Kinase Casein Kinase II Site within Its Interaction Domain* , 1998, The Journal of Biological Chemistry.

[26]  L. Hurst,et al.  The Genetic Code Is One in a Million , 1998, Journal of Molecular Evolution.

[27]  M. Di Giulio,et al.  The Historical Factor: The Biosynthetic Relationships Between Amino Acids and Their Physicochemical Properties in the Origin of the Genetic Code , 1998, Journal of Molecular Evolution.

[28]  S. J. Freeland,et al.  Load minimization of the genetic code: history does not explain the pattern , 1998, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[29]  David H. Ardell,et al.  On Error Minimization in a Sequential Origin of the Standard Genetic Code , 1998, Journal of Molecular Evolution.

[30]  B. K. Davis Evolution of the genetic code. , 1999, Progress in biophysics and molecular biology.

[31]  M Di Giulio The Coevolution Theory of the Origin of the Genetic Code , 1999, Journal of molecular evolution.

[32]  Laurence D. Hurst,et al.  A Quantitative Measure of Error Minimization in the Genetic Code , 1999, Journal of Molecular Evolution.

[33]  L F Landweber,et al.  Selection, history and chemistry: the three faces of the genetic code. , 1999, Trends in biochemical sciences.

[34]  M Yarus,et al.  RNA-ligand chemistry: a testable source for the genetic code. , 2000, RNA.

[35]  L. Hurst,et al.  Early fixation of an optimal genetic code. , 2000, Molecular biology and evolution.

[36]  L F Landweber,et al.  Measuring adaptation within the genetic code. , 2000, Trends in biochemical sciences.

[37]  M Di Giulio Genetic code origin and the strength of natural selection. , 2000, Journal of theoretical biology.

[38]  M Di Giulio The origin of the genetic code cannot be studied using measurements based on the PAM matrix because this matrix reflects the code itself, making any such analyses tautologous. , 2001, Journal of theoretical biology.

[39]  Guy Sella,et al.  On the Evolution of Redundancy in Genetic Codes , 2001, Journal of Molecular Evolution.

[40]  V. Ramakrishnan,et al.  Recognition of Cognate Transfer RNA by the 30S Ribosomal Subunit , 2001, Science.

[41]  Serge Massar,et al.  Optimality of the genetic code with respect to protein stability and amino-acid frequencies , 2001, Genome Biology.

[42]  G. Sella,et al.  The Impact of Message Mutation on the Fitness of a Genetic Code , 2002, Journal of Molecular Evolution.

[43]  Guy Sella,et al.  No accident: genetic codes freeze in error-correcting patterns of the standard genetic code. , 2002, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[44]  V. Chechetkin,et al.  Block structure and stability of the genetic code. , 2003, Journal of theoretical biology.

[45]  Chen-Tseh Zhu,et al.  Codon Usage Decreases the Error Minimization Within the Genetic Code , 2003, Journal of Molecular Evolution.

[46]  V. Ramakrishnan,et al.  Insights into the decoding mechanism from recent ribosome structures. , 2003, Trends in biochemical sciences.

[47]  T. Miyata,et al.  On the antisymmetry of the amino acid code table , 1980, Origins of life.

[48]  H. Goodarzi,et al.  On the optimality of the genetic code, with the consideration of termination codons. , 2004, Bio Systems.

[49]  Guy Sella,et al.  The Coevolution of Genes and Genetic Codes: Crick’s Frozen Accident Revisited , 2006, Journal of Molecular Evolution.

[50]  Marco Archetti,et al.  Codon Usage Bias and Mutation Constraints Reduce the Level of ErrorMinimization of the Genetic Code , 2004, Journal of Molecular Evolution.

[51]  Nick Goldman,et al.  Further results on error minimization in the genetic code , 1993, Journal of Molecular Evolution.

[52]  J. Caporaso,et al.  Error Minimization and Coding Triplet/Binding Site Associations Are Independent Features of the Canonical Genetic Code , 2005, Journal of Molecular Evolution.

[53]  S. Freeland,et al.  The Case for an Error Minimizing Standard Genetic Code , 2003, Origins of life and evolution of the biosphere.

[54]  R. Knight,et al.  Origins of the genetic code: the escaped triplet theory. , 2005, Annual review of biochemistry.

[55]  H. Najafabadi,et al.  On the optimality of the genetic code, with the consideration of coevolution theory by comparison of prominent cost measure matrices. , 2005, Journal of theoretical biology.

[56]  Rob Knight,et al.  Do universal codon-usage patterns minimize the effects of mutation and translation error? , 2005, Genome Biology.

[57]  Massimo Di Giulio,et al.  The origin of the genetic code: theories and their relationships, a review. , 2005, Bio Systems.

[58]  V. Ramakrishnan,et al.  First published online as a Review in Advance on February 25, 2005 STRUCTURAL INSIGHTS INTO TRANSLATIONAL , 2022 .

[59]  Noorossadat Torabi,et al.  On the coevolution of genes and genetic code. , 2005, Gene.

[60]  P. B. Milanov,et al.  On the optimality of the genetic code , 1986, Origins of life and evolution of the biosphere.

[61]  On the origin of the translation system and the genetic code in the RNA world by means of natural selection, exaptation, and subfunctionalization , 2007, Biology Direct.

[62]  Dieter Söll,et al.  Natural expansion of the genetic code. , 2007, Nature chemical biology.

[63]  P. Farabaugh,et al.  The frequency of translational misreading errors in E. coli is largely determined by tRNA competition. , 2006, RNA.