Specific correlations between relative synonymous codon usage and protein secondary structure.

We found significant species-specific correlations between the use of two synonymous codons and protein secondary structure units by comparing the three-dimensional structures of human and Escherichia coli proteins with their mRNA sequences. The correlations are not explained by codon-context, expression level, GC/AU content, or positional effects. The E. coli correlation is between Asn AAC and the C-terminal regions of beta-sheet segments; it may result from selection for translational accuracy, suggesting the hypothesis that downstream Asn residues are important for beta-sheet formation. The correlation in human proteins is between Asp GAU and the N termini of alpha-helices; it may be important for eukaryote-specific sequential, cotranslational folding. The kingdom-specific correlations may reflect kingdom-specific differences in translational mechanisms. The correlations may help identify residues that are important for secondary structure formation, be useful in secondary structure prediction algorithms, and have implications for recombinant gene expression.

[1]  B. Friguet,et al.  Folding on the ribosome of Escherichia coli tryptophan synthase beta subunit nascent chains probed with a conformation-dependent monoclonal antibody. , 1992, Journal of molecular biology.

[2]  A. Brown,et al.  The efficiency of folding of some proteins is increased by controlled rates of translation in vivo. A hypothesis. , 1987, Journal of molecular biology.

[3]  S. Karlin,et al.  What drives codon choices in human genes? , 1996, Journal of molecular biology.

[4]  G Humphreys,et al.  Codon usage can affect efficiency of translation of genes in Escherichia coli. , 1984, Nucleic acids research.

[5]  R. Abagyan,et al.  Biased probability Monte Carlo conformational searches and electrostatic calculations for peptides and proteins. , 1994, Journal of molecular biology.

[6]  F. Hartl Molecular chaperones in cellular protein folding , 1996, Nature.

[7]  S. Ochoa,et al.  Translation of the genetic message. VII. Role of initiation factors in formation of the chain initiation complex with Escherichia coli ribosomes. , 1968, Archives of biochemistry and biophysics.

[8]  K. Gardiner Human genome organization. , 1995, Current opinion in genetics & development.

[9]  W. Gilbert,et al.  The exon theory of genes. , 1987, Cold Spring Harbor symposia on quantitative biology.

[10]  D. Crothers,et al.  On the physical basis for ambiguity in genetic coding interactions. , 1978, Proceedings of the National Academy of Sciences of the United States of America.

[11]  G. W. Hatfield,et al.  Nonrandom utilization of codon pairs in Escherichia coli. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Søren Brunak,et al.  Protein structure and the sequential structure of mRNA: α‐Helix and β‐sheet signals at the nucleotide level , 1996 .

[13]  M. Sørensen,et al.  Absolute in vivo translation rates of individual codons in Escherichia coli. The two glutamic acid codons GAA and GAG are translated with a threefold difference in rate. , 1991, Journal of molecular biology.

[14]  D. McPherson Codon preference reflects mistranslational constraints: a proposal. , 1988, Nucleic acids research.

[15]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[16]  Manolo Gouy,et al.  Codon catalog usage is a genome strategy modulated for gene expressivity , 1981, Nucleic Acids Res..

[17]  W. Gilbert,et al.  Intron phase correlations and the evolution of the intron/exon structure of genes. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[18]  The yeast pyruvate kinase gene does not contain a string of non‐ preferred codons: Revised nucleotide sequence , 1989, FEBS letters.

[19]  K. Nishikawa,et al.  The difference in the type of codon-anticodon base pairing at the ribosomal P-site is one of the determinants of the translational rate. , 1990, Journal of biochemistry.

[20]  R Nussinov,et al.  Some rules in the ordering of nucleotides in the DNA. , 1980, Nucleic acids research.

[21]  F. Hartl,et al.  Recombination of protein domains facilitated by co-translational folding in eukaryotes , 1997, Nature.

[22]  P Argos,et al.  Protein secondary structural types are differentially coded on messenger RNA , 1996, Protein science : a publication of the Protein Society.

[23]  S. Nishimura,et al.  Possible anticodon sequences of tRNA His , tRNA Asm , and tRNA Asp from Escherichia coli B. Universal presence of nucleoside Q in the first postion of the anticondons of these transfer ribonucleic acids. , 1972, Biochemistry.

[24]  M. Gouy,et al.  Codon usage in bacteria: correlation with gene expressivity. , 1982, Nucleic acids research.

[25]  J. Richardson,et al.  Amino acid preferences for specific locations at the ends of alpha helices. , 1988, Science.

[26]  P. Sharp,et al.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. , 1987, Nucleic acids research.

[27]  Stephen Neidle,et al.  Non‐random usage of ‘degenerate’ codons is related to protein three‐dimensional structure , 1996, FEBS letters.

[28]  C. Blake Exons and the evolution of proteins. , 1985, International review of cytology.

[29]  K. Wüthrich,et al.  Three-dimensional structure of human [113Cd7]metallothionein-2 in solution determined by nuclear magnetic resonance spectroscopy. , 1990, Journal of molecular biology.

[30]  G. Rose,et al.  Helix stop signals in proteins and peptides: the capping box. , 1993, Biochemistry.

[31]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[32]  W. Fiers,et al.  On codon usage , 1979, Nature.

[33]  R. Thompson,et al.  Codon choice and gene expression: synonymous codons differ in their ability to direct aminoacylated-transfer RNA binding to ribosomes in vitro. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[34]  C. Harley,et al.  Model for messenger RNA translation during amino acid starvation applied to the calculation of protein synthetic error rates. , 1981, The Journal of biological chemistry.

[35]  G. Rose,et al.  Helix signals in proteins. , 1988, Science.

[36]  W. Fiers,et al.  Codon usage and mistranslation. In vivo basal level misreading of the MS2 coat protein message. , 1983, The Journal of biological chemistry.

[37]  F. Marston The purification of eukaryotic polypeptides synthesized in Escherichia coli. , 1986, The Biochemical journal.

[38]  T. D. Schneider,et al.  Features of spliceosome evolution and function inferred from an analysis of the information at human splice sites. , 1992, Journal of molecular biology.

[39]  R Nussinov,et al.  Doublet frequencies in evolutionary distinct groups. , 1984, Nucleic acids research.

[40]  G Bernardi,et al.  The gene distribution of the human genome. , 1996, Gene.

[41]  Antonio Marín,et al.  Preference for guanosine at first codon position in highly expressed Escherichia coli genes. A relationship with translational efficiency , 1996, Nucleic Acids Res..

[42]  I. Adzhubei,et al.  Nonuniform size distribution of nascent globin peptides, evidence for pause localization sites, and a cotranslational protein-folding model , 1991, Journal of protein chemistry.

[43]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[44]  R Nussinov,et al.  Nearest neighbor nucleotide patterns. Structural and biological implications. , 1981, The Journal of biological chemistry.

[45]  M. Bulmer The selection-mutation-drift theory of synonymous codon usage. , 1991, Genetics.

[46]  The chaperonin containing t-complex polypeptide 1 (TCP-1). Multisubunit machinery assisting in protein folding and assembly in the eukaryotic cytosol. , 1995 .

[47]  T. Ikemura Codon usage and tRNA content in unicellular and multicellular organisms. , 1985, Molecular biology and evolution.

[48]  K. Jensen,et al.  Translation rates of individual codons are not correlated with tRNA abundances or with frequencies of utilization in Escherichia coli , 1989, Journal of bacteriology.

[49]  D. Labuda,et al.  Codon-induced transfer RNA association. A property of transfer RNA involved in its adaptor function? , 1983, Journal of molecular biology.

[50]  H. Scheraga,et al.  Analysis of Conformations of Amino Acid Residues and Prediction of Backbone Topography in Proteins , 1974 .

[51]  S. Karlin,et al.  Dinucleotide relative abundance extremes: a genomic signature. , 1995, Trends in genetics : TIG.

[52]  D. Labuda,et al.  Mechanism of codon recognition by transfer RNA and codon-induced tRNA association. , 1984, Journal of molecular biology.

[53]  D. Lipman,et al.  Rapid similarity searches of nucleic acid and protein data banks. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[54]  H. Akashi Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy. , 1994, Genetics.

[55]  Robert L. Baldwin,et al.  Tests of the helix dipole model for stabilization of α-helices , 1987, Nature.

[56]  J. Frank,et al.  A model of protein synthesis based on cryo-electron microscopy of the E. coli ribosome , 1995, Nature.

[57]  C. Anfinsen,et al.  The kinetics of formation of native ribonuclease during oxidation of the reduced polypeptide chain. , 1961, Proceedings of the National Academy of Sciences of the United States of America.

[58]  A. Brown,et al.  Protein folding within the cell is influenced by controlled rates of polypeptide elongation. , 1992, Journal of molecular biology.

[59]  M Yarus,et al.  Rates of aminoacyl-tRNA selection at 29 sense codons in vivo. , 1989, Journal of molecular biology.

[60]  S Karlin,et al.  Heterogeneity of genomes: measures and values. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[61]  W. Fiers,et al.  Folding of the MS2 coat protein in Escherichia coli is modulated by translational pauses resulting from mRNA secondary structure and codon usage: a hypothesis. , 1993, Journal of theoretical biology.

[62]  R A Sayle,et al.  RASMOL: biomolecular graphics for all. , 1995, Trends in biochemical sciences.

[63]  M. Goldberg The second translation of the genetic message: protein folding and assembly , 1985 .

[64]  G. Holmquist,et al.  Organization of mutations along the genome: a prime determinant of genome evolution. , 1994, Trends in ecology & evolution.

[65]  J. Parker,et al.  Missense misreading of asparagine codons as a function of codon identity and context. , 1987, The Journal of biological chemistry.

[66]  G. W. Hatfield,et al.  Codon Pair Utilization Biases Influence Translational Elongation Step Times (*) , 1995, The Journal of Biological Chemistry.

[67]  J. Sambrook,et al.  Protein folding in the cell , 1992, Nature.

[68]  The chaperonin containing t-complex polypeptide 1 (TCP-1). Multisubunit machinery assisting in protein folding and assembly in the eukaryotic cytosol. , 1995, European journal of biochemistry.

[69]  W. Fiers,et al.  Preferential codon usage in prokaryotic genes: the optimal codon-anticodon interaction energy and the selective codon usage in efficiently expressed genes. , 1982, Gene.

[70]  S. Lewis,et al.  Chaperonin-mediated folding of actin and tubulin , 1996, The Journal of cell biology.

[71]  S. Nishimura Structure, biosynthesis, and function of queuosine in transfer RNA. , 1983, Progress in nucleic acid research and molecular biology.

[72]  P. Sharp,et al.  Codon usage and genome evolution. , 1994, Current opinion in genetics & development.

[73]  P Argos,et al.  Ribosome‐mediated translational pause and protein domain organization , 1996, Protein science : a publication of the Protein Society.