On Error Minimization in a Sequential Origin of the Standard Genetic Code

Abstract. Distances between amino acids were derived from the polar requirement measure of amino acid polarity and Benner and co-workers' (1994) 74-100 PAM matrix. These distances were used to examine the average effects of amino acid substitutions due to single-base errors in the standard genetic code and equally degenerate randomized variants of the standard code. Second-position transitions conserved all distances on average, an order of magnitude more than did second-position transversions. In contrast, first-position transitions and transversions were about equally conservative. In comparison with randomized codes, second-position transitions in the standard code significantly conserved mean square differences in polar requirement and mean Benner matrix-based distances, but mean absolute value differences in polar requirement were not significantly conserved. The discrepancy suggests that these commonly used distance measures may be insufficient for strict hypothesis testing without more information. The translational consequences of single-base errors were then examined in different codon contexts, and similarities between these contexts explored with a hierarchical cluster analysis. In one cluster of codon contexts corresponding to the RNY and GNR codons, second-position transversions between C and G and transitions between C and U were most conservative of both polar requirement and the matrix-based distance. In another cluster of codon contexts, second-position transitions between A and G were most conservative. Despite the claims of previous authors to the contrary, it is shown theoretically that the standard code may have been shaped by position-invariant forces such as mutation and base content. These forces may have left heterogeneous signatures in the code because of differences in translational fidelity by codon position.A scenario for the origin of the code is presented wherein selection for error minimization could have occurred multiple times in disjoint parts of the code through a phyletic process of competition between lineages. This process permits error minimization without the disruption of previously useful messages, and does not predict that the code is optimally error-minimizing with respect to modern error. Instead, the code may be a record of genetic process and patterns of mutation before the radiation of modern organisms and organelles.

[1]  J. Davies. Streptomycin and the genetic code. , 1966, Cold Spring Harbor symposia on quantitative biology.

[2]  R. Wolfenden,et al.  Water, protein folding, and the genetic code. , 1979, Science.

[3]  H. Khorana,et al.  A further study of misreading of codons induced by streptomycin and neomycin using ribopolynucleotides containing two nucleotides in alternating sequence as templates. , 1966, Journal of molecular biology.

[4]  S. Brunak,et al.  Neural network model of the genetic code is strongly correlated to the GES scale of amino acid transfer free energies. , 1994, Journal of molecular biology.

[5]  V. sitaramam Genetic code preferentially conserves long‐range interactions among the amino acids , 1989, FEBS letters.

[6]  M. Hirabayashi,et al.  Nature of magnesium-induced miscoding. , 1969, Journal of molecular biology.

[7]  D. Turner,et al.  Thermodynamics of base pairing. , 1996, Current opinion in structural biology.

[8]  W. Fitch Evidence suggesting a partial, internal duplication in the ancestral gene for heme-containing globins. , 1966, Journal of molecular biology.

[9]  A. Sancar,et al.  Effect of base, pentose, and phosphodiester backbone structures on binding and repair of pyrimidine dimers by Escherichia coli DNA photolyase. , 1991, Biochemistry.

[10]  P. Strigini,et al.  Analysis of specific misreading in Escherichia coli. , 1973, Journal of molecular biology.

[11]  H. Nürnberg The Hypercycle. A Principle of Natural Self Organization. , 1981 .

[12]  F. Crick Origin of the Genetic Code , 1967, Nature.

[13]  R. Grantham Amino Acid Difference Formula to Help Explain Protein Evolution , 1974, Science.

[14]  T. Jukes,et al.  Arginine as an evolutionary intruder into protein synthesis. , 1973, Biochemical and biophysical research communications.

[15]  Laurence D. Hurst,et al.  A Quantitative Measure of Error Minimization in the Genetic Code , 1999, Journal of Molecular Evolution.

[16]  F. Taylor,et al.  The code within the codons. , 1989, Bio Systems.

[17]  J. Wong A co-evolution theory of the genetic code. , 1975, Proceedings of the National Academy of Sciences of the United States of America.

[18]  S. Pestka,et al.  On the Coding of Genetic Information , 1963 .

[19]  John Maynard Smith,et al.  The major evolutionary transitions , 1995, Nature.

[20]  R A Goldstein,et al.  Mutation matrices and physical‐chemical properties: Correlations and implications , 1997, Proteins.

[21]  S. Altschul Amino acid substitution matrices from an information theoretic perspective , 1991, Journal of Molecular Biology.

[22]  W. Fitch,et al.  The phylogeny of tRNA sequences provides evidence for ambiguity reduction in the origin of the genetic code. , 1987, Cold Spring Harbor symposia on quantitative biology.

[23]  W. Gilbert,et al.  STREPTOMYCIN, SUPPRESSION, AND THE CODE. , 1964, Proceedings of the National Academy of Sciences of the United States of America.

[24]  S Rodin,et al.  The presence of codon-anticodon pairs in the acceptor stem of tRNAs. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[25]  M. Di Giulio,et al.  Was it an ancient gene codifying for a hairpin RNA that, by means of direct duplication, gave rise to the primitive tRNA molecule? , 1995, Journal of theoretical biology.

[26]  A. Goldberg,et al.  Genetic Code: Aspects of Organization , 1966, Science.

[27]  D. Nègre,et al.  Differential pattern of misreading induced by streptomycin in vitro. , 1988, Journal of molecular biology.

[28]  M. R. Capobianco,et al.  On the optimization of the physicochemical distances between amino acids in the evolution of the genetic code. , 1994, Journal of theoretical biology.

[29]  C. Epstein,et al.  Role of the Amino-Acid ‘Code’ and of Selection for Conformation in the Evolution of Proteins , 1966, Nature.

[30]  S A Benner,et al.  Amino acid substitution during functionally constrained divergent evolution of protein sequences. , 1994, Protein engineering.

[31]  David P. Bartel,et al.  RNA-catalysed RNA polymerization using nucleoside triphosphates , 1996, Nature.

[32]  J. Parker,et al.  Errors and alternatives in reading the universal genetic code. , 1989, Microbiological reviews.

[33]  M. Kanehisa,et al.  Cluster analysis of amino acid indices for prediction of protein structure and function. , 1988, Protein engineering.

[34]  M. Di Giulio,et al.  The extension reached by the minimization of the polarity distances during the evolution of the genetic code. , 1989, Journal of molecular evolution.

[35]  S. Osawa,et al.  Recent evidence for evolution of the genetic code , 1992, Microbiological reviews.

[36]  J. L. King,et al.  Non-Darwinian evolution. , 1969, Science.

[37]  R. Swanson A unifying concept for the amino acid code. , 1984, Bulletin of mathematical biology.

[38]  M. Di Giulio Some aspects of the organization and evolution of the genetic code. , 1989, Journal of molecular evolution.

[39]  M. Eigen,et al.  The Hypercycle: A principle of natural self-organization , 2009 .

[40]  D. Turner,et al.  A periodic table of symmetric tandem mismatches in RNA. , 1995, Biochemistry.

[41]  S. Miller,et al.  Which organic compounds could have occurred on the prebiotic earth? , 1987, Cold Spring Harbor Symposia on Quantitative Biology.

[42]  C. Woese,et al.  On the fundamental nature and evolution of the genetic code. , 1966, Cold Spring Harbor symposia on quantitative biology.

[43]  C. Alff-Steinberger,et al.  The genetic code and error transmission. , 1969, Proceedings of the National Academy of Sciences of the United States of America.

[44]  M. Kanehisa,et al.  Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins. , 1996, Protein engineering.

[45]  D. Turner,et al.  RNA structure prediction. , 1988, Annual review of biophysics and biophysical chemistry.

[46]  D. Crothers,et al.  On the physical basis for ambiguity in genetic coding interactions. , 1978, Proceedings of the National Academy of Sciences of the United States of America.

[47]  S. Kuge,et al.  Strong inclination toward transition mutation in nucleotide substitutions by poliovirus replicase. , 1989, Journal of molecular biology.

[48]  T. M. Sonneborn Degeneracy of the Genetic Code: Extent, Nature, and Genetic Implications , 1965 .

[49]  B. K. Davis Evolution of the genetic code. , 1999, Progress in biophysics and molecular biology.

[50]  J. Wong,et al.  Role of minimization of chemical distances between amino acids in the evolution of the genetic code. , 1980, Proceedings of the National Academy of Sciences of the United States of America.