Second codon positions of genes and the secondary structures of proteins. Relationships and implications for the origin of the genetic code.

The nucleotide frequencies in the second codon positions of genes are remarkably different for the coding regions that correspond to different secondary structures in the encoded proteins, namely, helix, beta-strand and aperiodic structures. Indeed, hydrophobic and hydrophilic amino acids are encoded by codons having U or A, respectively, in their second position. Moreover, the beta-strand structure is strongly hydrophobic, while aperiodic structures contain more hydrophilic amino acids. The relationship between nucleotide frequencies and protein secondary structures is associated not only with the physico-chemical properties of these structures but also with the organisation of the genetic code. In fact, this organisation seems to have evolved so as to preserve the secondary structures of proteins by preventing deleterious amino acid substitutions that could modify the physico-chemical properties required for an optimal structure.

[1]  T C Ghosh,et al.  Studies on the relationships between the synonymous codon usage and protein secondary structural units. , 2000, Biochemical and biophysical research communications.

[2]  M. Yarus An RNA-amino acid complex and the origin of the genetic code. , 1991, The New biologist.

[3]  R L Jernigan,et al.  Short‐range conformational energies, secondary structure propensities, and recognition of correct sequence‐structure matches , 1997, Proteins.

[4]  P. Y. Chou,et al.  Conformational parameters for amino acids in helical, beta-sheet, and random coil regions calculated from proteins. , 1974, Biochemistry.

[5]  Francis Crick,et al.  The Genetic Code , 1962 .

[6]  F. Taylor,et al.  The code within the codons. , 1989, Bio Systems.

[7]  CHARLES J. EPSTEIN,et al.  Non-randomness of Ammo-acid Changes in the Evolution of Homologous Proteins , 1967, Nature.

[8]  S. Brunak,et al.  Neural network model of the genetic code is strongly correlated to the GES scale of amino acid transfer free energies. , 1994, Journal of molecular biology.

[9]  Stephen Neidle,et al.  An Integrated Sequence-Structure Database incorporating matching mRNA sequence, amino acid sequence and protein three-dimensional structure data , 1998, Nucleic Acids Res..

[10]  L. Orgel,et al.  β structures of alternating polypeptides and their possible prebiotic significance , 1975, Nature.

[11]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[12]  Jianzhi Zhang,et al.  Rates of Conservative and Radical Nonsynonymous Nucleotide Substitutions in Mammalian Nuclear Genes , 2000, Journal of Molecular Evolution.

[13]  Carl R. Woese,et al.  The Present Status of the Genetic Code , 1967 .

[14]  J W Prothero,et al.  Correlation between the distribution of amino acids and alpha helices. , 1966, Biophysical journal.

[15]  Massimo Di Giulio On the origin of the genetic code. , 1992 .

[16]  T P Chirpich,et al.  Rates of protein evolution: a function of amino acid composition. , 1975, Science.

[17]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[18]  M. Levitt Conformational preferences of amino acids in globular proteins. , 1978, Biochemistry.

[19]  R S Root-Bernstein On the origin of the genetic code. , 1982, Journal of theoretical biology.

[20]  B. Havsteen A study of the correlation between the amino acid composition and the helical content of proteins. , 1966, Journal of theoretical biology.

[21]  A. Szent-Gyorgyi,et al.  Role of proline in polypeptide chain configuration of proteins. , 1957, Science.

[22]  R. Wolfenden,et al.  Water, protein folding, and the genetic code. , 1979, Science.

[23]  S R Jordan,et al.  Structural convergence during protein evolution. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Beta turns in early evolution: chirality, genetic code, and biosynthetic pathways. , 1987, Cold Spring Harbor symposia on quantitative biology.

[25]  M. Di Giulio The beta-sheets of proteins, the biosynthetic relationships between amino acids, and the origin of the genetic code. , 1996, Origins of life and evolution of the biosphere : the journal of the International Society for the Study of the Origin of Life.

[26]  C R Woese,et al.  The molecular basis for the genetic code. , 1966, Proceedings of the National Academy of Sciences of the United States of America.

[27]  R. Grantham Amino Acid Difference Formula to Help Explain Protein Evolution , 1974, Science.

[28]  A. Guzzo,et al.  The influence of amino-acid sequence on protein structure. , 1965, Biophysical journal.

[29]  D. Goldsack Relation of amino acid composition and the moffitt parameters to the secondary structure of proteins , 1969, Biopolymers.

[30]  D. A. Cook,et al.  The relation between amino acid sequence and protein conformation. , 1967, Journal of molecular biology.