Origin and evolution of the genetic code: The universal enigma

The genetic code is nearly universal, and the arrangement of the codons in the standard codon table is highly nonrandom. The three main concepts on the origin and evolution of the code are the stereochemical theory, according to which codon assignments are dictated by physicochemical affinity between amino acids and the cognate codons (anticodons); the coevolution theory, which posits that the code structure coevolved with amino acid biosynthesis pathways; and the error minimization theory under which selection to minimize the adverse effect of point mutations and translation errors was the principal factor of the code's evolution. These theories are not mutually exclusive and are also compatible with the frozen accident hypothesis, that is, the notion that the standard code might have no special properties but was fixed simply because all extant life forms share a common ancestor, with subsequent changes to the code, mostly, precluded by the deleterious effect of codon reassignment. Mathematical analysis of the structure and possible evolutionary trajectories of the code shows that it is highly robust to translational misreading but there are numerous more robust codes, so the standard code potentially could evolve from a random code via a short sequence of codon series reassignments. Thus, much of the evolution that led to the standard code could be a combination of frozen accident with selection for error minimization although contributions from coevolution of the code with metabolic pathways and weak affinities between amino acids and nucleotide triplets cannot be ruled out. However, such scenarios for the code evolution are based on formal schemes whose relevance to the actual primordial evolution is uncertain. A real understanding of the code origin and evolution is likely to be attainable only in conjunction with a credible scenario for the evolution of the coding principle itself and the translation system. © 2008 IUBMB IUBMB Life, 61(2): 99–111, 2009

[1]  Apoorva D. Patel,et al.  The Triplet Genetic Code had a Doublet Predecessor , 2004, Journal of theoretical biology.

[2]  J. Wong,et al.  Coevolution theory of the genetic code at age thirty. , 2005, BioEssays : news and reviews in molecular, cellular and developmental biology.

[3]  Shin-ichi Yokobori,et al.  Genetic Code Variations in Mitochondria: tRNA as a Major Determinant of Genetic Code Plasticity , 2001, Journal of Molecular Evolution.

[4]  L. Hurst,et al.  The Genetic Code Is One in a Million , 1998, Journal of Molecular Evolution.

[5]  E N Trifonov,et al.  Consensus temporal order of amino acids and evolution of the triplet code. , 2000, Gene.

[6]  M Di Giulio The Coevolution Theory of the Origin of the Genetic Code , 1999, Journal of molecular evolution.

[7]  R. Wetzel Evolution of the aminoacyl-tRNA synthetases and the origin of the genetic code , 1995, Journal of Molecular Evolution.

[8]  E. Szathmáry,et al.  Four letters in the genetic alphabet: a frozen evolutionary optimum? , 1991, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[9]  H. Noller The driving force for molecular evolution of translation. , 2004, RNA.

[10]  David H. Ardell,et al.  On Error Minimization in a Sequential Origin of the Standard Genetic Code , 1998, Journal of Molecular Evolution.

[11]  Massimo Di Giulio,et al.  The extension reached by the minimization of the polarity distances during the evolution of the genetic code , 1989, Journal of Molecular Evolution.

[12]  Dieter Söll,et al.  Natural expansion of the genetic code. , 2007, Nature chemical biology.

[13]  Chen-Tseh Zhu,et al.  Codon Usage Decreases the Error Minimization Within the Genetic Code , 2003, Journal of Molecular Evolution.

[14]  S. Kumar,et al.  Patterns of nucleotide substitution in mitochondrial protein coding genes of vertebrates. , 1996, Genetics.

[15]  S. Bagby,et al.  Evolution of the Genetic Triplet Code via Two Types of Doublet Codons , 2005, Journal of Molecular Evolution.

[16]  Marc Delarue,et al.  An asymmetric underlying rule in the assignment of codons: possible clue to a quick early evolution of the genetic code via successive binary choices. , 2006, RNA.

[17]  M. Aldana,et al.  Primordial synthesis machines and the origin of the genetic code , 1998 .

[18]  Laura F. Landweber,et al.  Rewiring the keyboard: evolvability of the genetic code , 2001, Nature Reviews Genetics.

[19]  C. Woese Interpreting the universal phylogenetic tree. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[20]  M. Yarus,et al.  Transfer RNA mutation and the malleability of the genetic code. , 1994, Journal of molecular biology.

[21]  Guy Sella,et al.  The Coevolution of Genes and Genetic Codes: Crick’s Frozen Accident Revisited , 2006, Journal of Molecular Evolution.

[22]  Stanley L. Miller,et al.  Reasons for the occurrence of the twenty coded protein amino acids , 1981, Journal of Molecular Evolution.

[23]  M. Yarus,et al.  On malleability in the genetic code , 1996, Journal of Molecular Evolution.

[24]  P. Farabaugh,et al.  The frequency of translational misreading errors in E. coli is largely determined by tRNA competition. , 2006, RNA.

[25]  G. Dueck New optimization heuristics , 1993 .

[26]  Peter G. Schultz,et al.  A chemical toolkit for proteins — an expanded genetic code , 2006, Nature Reviews Molecular Cell Biology.

[27]  Camille Stephan-Otto Attolini,et al.  Generic Darwinian selection in catalytic protocell assemblies , 2007, Philosophical Transactions of the Royal Society B: Biological Sciences.

[28]  Chao Qian,et al.  Population , 1940, State Rankings 2020: A Statistical View of America.

[29]  Manuel A. S. Santos,et al.  Selective advantages created by codon ambiguity allowed for the evolution of an alternative genetic code in Candida spp. , 1999, Molecular microbiology.

[30]  Marco Archetti,et al.  Codon Usage Bias and Mutation Constraints Reduce the Level of ErrorMinimization of the Genetic Code , 2004, Journal of Molecular Evolution.

[31]  C. Wilke,et al.  A single determinant dominates the rate of yeast protein evolution. , 2006, Molecular biology and evolution.

[32]  W. Martin,et al.  On the origins of cells: a hypothesis for the evolutionary transitions from abiotic geochemistry to chemoautotrophic prokaryotes, and from prokaryotes to nucleated cells. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[33]  Eörs Szathmáry,et al.  Why are there four letters in the genetic alphabet? , 2003, Nature Reviews Genetics.

[34]  Pedro Beltrão,et al.  Comparative evolutionary genomics unveils the molecular mechanism of reassignment of the CTG codon in Candida spp. , 2003, Genome research.

[35]  H. Larralde,et al.  Translocation properties of primitive molecular machines and their relevance to the structure of the genetic code. , 2002, Journal of theoretical biology.

[36]  Massimo Di Giulio,et al.  The origin of the genetic code: theories and their relationships, a review. , 2005, Bio Systems.

[37]  K. Watanabe,et al.  A Novel Wobble Rule Found in Starfish Mitochondria , 1998, The Journal of Biological Chemistry.

[38]  J. Parker,et al.  Errors and alternatives in reading the universal genetic code. , 1989, Microbiological reviews.

[39]  S. R. Pelc Correlation between Coding-Triplets and Amino-Acids , 1965, Nature.

[40]  M. Syvanen,et al.  Cross-species gene transfer; implications for a new theory of evolution. , 1985, Journal of theoretical biology.

[41]  C. Woese,et al.  On the fundamental nature and evolution of the genetic code. , 1966, Cold Spring Harbor symposia on quantitative biology.

[42]  C. Kurland,et al.  Genomic evolution drives the evolution of the translation system. , 1995, Biochemistry and cell biology = Biochimie et biologie cellulaire.

[43]  H. Goodarzi,et al.  On the optimality of the genetic code, with the consideration of termination codons. , 2004, Bio Systems.

[44]  Iu B Rumer [Codon systematization in the genetic code]. , 1966, Doklady Akademii nauk SSSR.

[45]  L F Landweber,et al.  Measuring adaptation within the genetic code. , 2000, Trends in biochemical sciences.

[46]  T. Miyata,et al.  On the antisymmetry of the amino acid code table , 1980, Origins of life.

[47]  J. Krzycki The direct genetic encoding of pyrrolysine. , 2005, Current opinion in microbiology.

[48]  A. Ellington,et al.  The scene of a frozen accident. , 2000, RNA.

[49]  L F Landweber,et al.  Guilt by association: the arginine case revisited. , 2000, RNA.

[50]  Claus O. Wilke,et al.  Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution , 2008, Cell.

[51]  J. Caporaso,et al.  Error Minimization and Coding Triplet/Binding Site Associations Are Independent Features of the Canonical Genetic Code , 2005, Journal of Molecular Evolution.

[52]  J. Wong,et al.  Coevolution of genetic code and amino acid biosynthesis , 1981 .

[53]  L. Hurst,et al.  Early fixation of an optimal genetic code. , 2000, Molecular biology and evolution.

[54]  E. Koonin,et al.  Evolution of the genetic code: partial optimization of a random code for robustness to translation error in a rugged fitness landscape , 2007, Biology Direct.

[55]  M. Kimura,et al.  The role of robustness and changeability on the origin and evolution of genetic codes. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[56]  S. R. Pelc,et al.  Stereochemical Relationship Between Coding Triplets and Amino-Acids , 1966, Nature.

[57]  T H Jukes,et al.  Rates of transition and transversion in coding sequences since the human-rodent divergence. , 1994, Genomics.

[58]  I. Weinstein,et al.  LACK OF FIDELITY IN THE TRANSLATION OF SYNTHETIC POLYRIBONUCLEOTIDES. , 1964, Proceedings of the National Academy of Sciences of the United States of America.

[59]  J. L. King,et al.  Non-Darwinian evolution. , 1969, Science.

[60]  George E. Fox,et al.  The concept of cellular evolution , 1977, Journal of Molecular Evolution.

[61]  C. Epstein,et al.  Role of the Amino-Acid ‘Code’ and of Selection for Conformation in the Evolution of Proteins , 1966, Nature.

[62]  A. Oparin [The origin of life]. , 1938, Nordisk medicin.

[63]  C. Wilke,et al.  Why highly expressed proteins evolve slowly. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[64]  James R. Williamson,et al.  The catalytic diversity of RNAs , 2005, Nature Reviews Molecular Cell Biology.

[65]  T. M. Sonneborn Degeneracy of the Genetic Code: Extent, Nature, and Genetic Implications , 1965 .

[66]  J. Wong,et al.  Inadequacy of prebiotic synthesis as origin of proteinous amino acids , 1979, Journal of Molecular Evolution.

[67]  C. Woese,et al.  Evidence for the interaction of nucleotides with immobilized amino-acids and its significance for the origin of the genetic code. , 1971, Nature: New biology.

[68]  Laurence D. Hurst,et al.  A Quantitative Measure of Error Minimization in the Genetic Code , 1999, Journal of Molecular Evolution.

[69]  Stephen Freeland,et al.  On the evolution of the standard amino-acid alphabet , 2006, Genome Biology.

[70]  Martynas Yčas,et al.  The biological code , 1969 .

[71]  R Giegé,et al.  Universal rules and idiosyncratic features in tRNA identity. , 1998, Nucleic acids research.

[72]  Eugene V Koonin,et al.  On the origin of genomes and cells within inorganic compartments , 2005, Trends in Genetics.

[73]  S. Freeland,et al.  The Case for an Error Minimizing Standard Genetic Code , 2003, Origins of life and evolution of the biosphere.

[74]  C. Kurland,et al.  Reductive evolution of resident genomes. , 1998, Trends in microbiology.

[75]  A. Ellington,et al.  In vitro selection of ribozymes dependent on peptides for activity. , 2004, RNA.

[76]  A. T. Bankier,et al.  A different genetic code in human mitochondria , 1979, Nature.

[77]  David Penny,et al.  An Interpretive Review of the Origin of Life Research , 2005 .

[78]  Essential structures of a self-aminoacylating RNA. , 1997, Journal of molecular biology.

[79]  Nigel Goldenfeld,et al.  Biology's next revolution , 2007, Nature.

[80]  C. G. Km-land An Extreme Codon Preference Strategy : Codon Reassignment , 1998 .

[81]  N. Anderson Evolutionary Significance of Virus Infection , 1970, Nature.

[82]  J. Wong,et al.  Question 6: Coevolution Theory of the Genetic Code: A Proven Theory , 2007, Origins of Life and Evolution of Biospheres.

[83]  Xiaoguang Yang,et al.  The Mechanisms of Codon Reassignments in Mitochondrial Genetic Codes , 2007, Journal of Molecular Evolution.

[84]  N. Pace,et al.  The genetic core of the universal ancestor. , 2003, Genome research.

[85]  D. Schulze-Makuch,et al.  Genetic code: Lucky chance or fundamental law of nature? , 2004 .

[86]  A. Goldberg,et al.  Genetic Code: Aspects of Organization , 1966, Science.

[87]  L F Landweber,et al.  Rhyme or reason: RNA-arginine interactions and the genetic code. , 1998, Chemistry & biology.

[88]  C R Woese,et al.  The molecular basis for the genetic code. , 1966, Proceedings of the National Academy of Sciences of the United States of America.

[89]  F. H. C. CRICK,et al.  Origin of the Genetic Code , 1967, Nature.

[90]  J. Wong,et al.  Role of minimization of chemical distances between amino acids in the evolution of the genetic code. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[91]  H. Noller 10 Evolution of Ribosomes and Translation from an RNA World , 2006 .

[92]  R. Knight,et al.  Origins of the genetic code: the escaped triplet theory. , 2005, Annual review of biochemistry.

[93]  Manuel A. S. Santos,et al.  Driving change: the evolution of alternative genetic codes. , 2004, Trends in genetics : TIG.

[94]  S. Freeland,et al.  Testing a biosynthetic theory of the genetic code: fact or artifact? , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[95]  A. Stoltzfus,et al.  Amino Acid Exchangeability and the Adaptive Code Hypothesis , 2007, Journal of Molecular Evolution.

[96]  S. Freeland,et al.  A quantitative investigation of the chemical space surrounding amino acid alphabet formation. , 2008, Journal of theoretical biology.

[97]  W. Gilbert,et al.  STREPTOMYCIN, SUPPRESSION, AND THE CODE. , 1964, Proceedings of the National Academy of Sciences of the United States of America.

[98]  C. Woese The genetic code : the molecular basis for genetic expression , 1967 .

[99]  Eugene V. Koonin,et al.  Comparative genomics, minimal gene-sets and the last universal common ancestor , 2003, Nature Reviews Microbiology.

[100]  T Suzuki,et al.  The 'polysemous' codon--a codon with multiple amino acid assignment caused by dual specificity of tRNA identity. , 1997, The EMBO journal.

[101]  T. Oshima,et al.  Abiotic synthesis of amino acids and imidazole by proton irradiation of simulated primitive earth atmospheres , 1990, Origins of life and evolution of the biosphere.

[102]  B. K. Davis Evolution of the genetic code. , 1999, Progress in biophysics and molecular biology.

[103]  K. Ikehara,et al.  Origin and evolutionary process of the genetic code. , 2007, Current medicinal chemistry.

[104]  Guy Sella,et al.  No accident: genetic codes freeze in error-correcting patterns of the standard genetic code. , 2002, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[105]  Rumer IuB Codon systematization in the genetic code , 1966 .

[106]  J. Wong,et al.  Membership mutation of the genetic code: loss of fitness by tryptophan. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[107]  Z. Cui,et al.  A peptidyl transferase ribozyme capable of combinatorial peptide synthesis. , 2004, Bioorganic & medicinal chemistry.

[108]  Lei Wang,et al.  Expanding the Genetic Code , 2003, Science.

[109]  Uri Alon,et al.  The genetic code is nearly optimal for allowing additional information within protein-coding sequences. , 2007, Genome research.

[110]  Edward N. Trifonov,et al.  The Triplet Code From First Principles , 2004, Journal of biomolecular structure & dynamics.

[111]  D. Haydon,et al.  The Genetic Code: What Is It Good For? An Analysis of the Effects of Selection Pressures on Genetic Codes , 1999, Journal of Molecular Evolution.

[112]  R. Amirnovin An Analysis of the Metabolic Theory of the Origin of the Genetic Code , 1997, Journal of Molecular Evolution.

[113]  F. Crick Origin of the Genetic Code , 1967, Nature.

[114]  G. Gamow Possible Relation between Deoxyribonucleic Acid and Protein Structures , 1954, Nature.

[115]  M Di Giulio The origin of the genetic code cannot be studied using measurements based on the PAM matrix because this matrix reflects the code itself, making any such analyses tautologous. , 2001, Journal of theoretical biology.

[116]  M. A. Rubio,et al.  C to U editing of the anticodon of imported mitochondrial tRNATrp allows decoding of the UGA stop codon in Leishmania tarentolae , 1999, The EMBO journal.

[117]  A. Krol,et al.  Selenoprotein synthesis: UGA does not end the story. , 2006, Biochimie.

[118]  V. Chechetkin Genetic code from tRNA point of view. , 2006, Journal of theoretical biology.

[119]  A. Travers The Evolution of the Genetic Code Revisited , 2007, Origins of Life and Evolution of Biospheres.

[120]  S. J. Freeland,et al.  Load minimization of the genetic code: history does not explain the pattern , 1998, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[121]  C R Woese,et al.  Order in the genetic code. , 1965, Proceedings of the National Academy of Sciences of the United States of America.

[122]  S. Pestka,et al.  On the Coding of Genetic Information , 1963 .

[123]  C. Woese UNIVERSALITY IN THE GENETIC CODE. , 1964, Science.

[124]  R. Hinegardner,et al.  Rationale for a Universal Genetic Code , 1963, Science.

[125]  N. Goldenfeld,et al.  Collective evolution and the genetic code. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[126]  Serge Massar,et al.  Optimality of the genetic code with respect to protein stability and amino-acid frequencies , 2001, Genome Biology.

[127]  Steven E. Massey,et al.  A Comparative Genomics Analysis of Codon Reassignments Reveals a Link with Mitochondrial Proteome Size and a Mechanism of Genetic Code Change Via Suppressor tRNAs , 2007, Journal of Molecular Evolution.

[128]  Claus O Wilke,et al.  Population Genetics of Translational Robustness , 2005, Genetics.

[129]  M. Syvanen Recent emergence of the modern genetic code: a proposal. , 2002, Trends in genetics : TIG.

[130]  C. Alff-Steinberger,et al.  The genetic code and error transmission. , 1969, Proceedings of the National Academy of Sciences of the United States of America.

[131]  L F Landweber,et al.  Selection, history and chemistry: the three faces of the genetic code. , 1999, Trends in biochemical sciences.

[132]  Stephen Freeland,et al.  The standard genetic code enhances adaptive evolution of proteins. , 2006, Journal of theoretical biology.

[133]  V. Chechetkin,et al.  Block structure and stability of the genetic code. , 2003, Journal of theoretical biology.

[134]  Elias Zintzaras,et al.  "Living" under the challenge of information decay: the stochastic corrector model vs. hypercycles. , 2002, Journal of theoretical biology.

[135]  WHEN DARWIN,et al.  The Origin of Life , 2019, Rethinking Evolution.

[136]  S. Osawa,et al.  Recent evidence for evolution of the genetic code , 1992, Microbiological reviews.

[137]  P. Dunnill,et al.  Triplet Nucleotide–Amino-acid Pairing; a Stereo-chemical Basis for the Division between Protein and Non-protein Amino-acids , 1966, Nature.

[138]  J. Wong A co-evolution theory of the genetic code. , 1975, Proceedings of the National Academy of Sciences of the United States of America.

[139]  M. R. Capobianco,et al.  On the optimization of the physicochemical distances between amino acids in the evolution of the genetic code. , 1994, Journal of theoretical biology.