The hypercube structure of the genetic code explains conservative and non-conservative aminoacid substitutions in vivo and in vitro.

A representation of the genetic code as a six-dimensional Boolean hypercube is described. This structure is the result of the hierarchical order of the interaction energies of the bases in codon-anticodon recognition. In this paper it is applied to study molecular evolution in vivo and in vitro. In the first case we compared aligned positions in homologous protein sequences and found two different behaviors: (a) There are sites in which the different amino acids may be explained by one or two 'attractor nodes' (coding for the dominating amino acid(s)) and their one-bit neighbors in the codon hypercube; and (b) There are sites in which the amino acids correspond to codons located in closed paths in the hypercube. In the second case we studied the 'Sexual PCR'1 experiment described by Stemmer [Stemmer (1994)] and found that the success of this combination of usual PCR and recombination is in part due to the Gray code structure of the genetic code.

[1]  Richard W. Hamming,et al.  Error detecting and error correcting codes , 1950 .

[2]  M. Seashore The Genetic Mechanism and the Origin of Life , 1980, The Yale Journal of Biology and Medicine.

[3]  Zellig S. Harris,et al.  Language and information , 1988 .

[4]  J. Wong A co-evolution theory of the genetic code. , 1975, Proceedings of the National Academy of Sciences of the United States of America.

[5]  M. Kanehisa,et al.  Cluster analysis of amino acid indices for prediction of protein structure and function. , 1988, Protein engineering.

[6]  George P Smith The progeny of sexual PCR , 1994, Nature.

[7]  J. Jungck,et al.  Group graph of the genetic code. , 1979, The Journal of heredity.

[8]  Masatoshi Nei,et al.  Evolution of genes and proteins. , 1983 .

[9]  J. Wong The evolution of a universal genetic code. , 1976, Proceedings of the National Academy of Sciences of the United States of America.

[10]  U. Lagerkvist Unorthodox codon reading and the evolution of the genetic code , 1981, Cell.

[11]  E. Kabat,et al.  Sequences of proteins of immunological interest , 1991 .

[12]  A. Lesk,et al.  Determinants of a protein fold. Unique features of the globin amino acid sequences. , 1987, Journal of molecular biology.

[13]  R. Swanson A unifying concept for the amino acid code. , 1984, Bulletin of mathematical biology.

[14]  F. Crick Codon--anticodon pairing: the wobble hypothesis. , 1966, Journal of molecular biology.

[15]  Werner Ebeling,et al.  Physik der Selbstorganisation und Evolution , 1984 .

[16]  R. Grantham Amino Acid Difference Formula to Help Explain Protein Evolution , 1974, Science.

[17]  W. Stemmer Rapid evolution of a protein in vitro by DNA shuffling , 1994, Nature.

[18]  M. Jiménez-Montaño On the syntactic structure and redundancy distribution of the genetic code. , 1994, Bio Systems.

[19]  R. Curnow,et al.  Origin of noncoding DNA sequences: molecular fossils of genome evolution. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[20]  M. Kainosho,et al.  Frontiers of NMR in Molecular Biology , 1992 .

[21]  J. David Schaffer,et al.  Representation and Hidden Bias: Gray vs. Binary Coding for Genetic Algorithms , 1988, ML.

[22]  U Lagerkvist,et al.  "Two out of three": an alternative method for codon reading. , 1978, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Geoffrey J. Barton,et al.  Protein sequence alignments: a strategy for the hierarchical analysis of residue conservation , 1993, Comput. Appl. Biosci..

[24]  M. Guéron,et al.  Applications of imino proton exchange to nucleic acid kinetics and structures , 1990 .