Relationship of codon bias to mRNA concentration and protein length in Saccharomyces cerevisiae

In 1982, Ikemura reported a strikingly unequal usage of different synonymous codons, in five Saccharomyces cerevisiae nuclear genes having high protein levels. To study this trend in detail, we examined data from three independent studies that used oligonucleotide arrays or SAGE to estimate mRNA concentrations for nearly all genes in the genome. Correlation coefficients were calculated for the relationship of mRNA concentration to four commonly used measures of synonymous codon usage bias: the codon adaptation index (CAI), the codon bias index (CBI), the frequency of optimal codons (Fop), and the effective number of codons (N̂c). mRNA concentration was best approximated as an exponential function of each of these four measures. Of the four, the CAI was the most strongly correlated with mRNA concentration (rs=0.62±0.01, n=2525, p<10−17). When we controlled for CAI, mRNA concentration and protein length were negatively correlated (partial rs=–0.23±0.01, n=4765, p<10−17). This may result from selection to reduce the size of abundant proteins to minimize transcriptional and translational costs. When we controlled for mRNA concentration, protein length and CAI were positively correlated (partial rs=0.16±0.01, n=4765, p<10−17). This may reflect more effective selection in longer genes against missense errors during translation. The correlation coefficients between the mRNA levels of individual genes, as measured by different investigators and methods, were low, in the range rs=0.39–0.68. Copyright © 2000 John Wiley & Sons, Ltd.

[1]  Wei Zhou,et al.  Characterization of the Yeast Transcriptome , 1997, Cell.

[2]  P. Sharp,et al.  Codon usage: mutational bias, translational selection, or both? , 1993, Biochemical Society transactions.

[3]  X. Xia How optimized is the translational machinery in Escherichia coli, Salmonella typhimurium and Saccharomyces cerevisiae? , 1998, Genetics.

[4]  S. Gygi,et al.  Correlation between Protein and mRNA Abundance in Yeast , 1999, Molecular and Cellular Biology.

[5]  P. Sharp,et al.  G+C content variation along and among Saccharomyces cerevisiae chromosomes. , 1999, Molecular biology and evolution.

[6]  T. Ikemura Correlation between the abundance of yeast transfer RNAs and the occurrence of the respective codons in protein genes. Differences in synonymous codon choice patterns of yeast and Escherichia coli with reference to the abundance of isoaccepting transfer RNAs. , 1982, Journal of molecular biology.

[7]  James D. Lawrey,et al.  Statistical Methods in Biology , 1996 .

[8]  P. Sharp,et al.  Evolution of codon usage patterns: the extent and nature of divergence between Candida albicans and Saccharomyces cerevisiae. , 1992, Nucleic acids research.

[9]  J. Davis The Logic Of Causal Order , 1987 .

[10]  Norman T. J. Bailey,et al.  Statistical Methods in Biology , 1959 .

[11]  Stephen K. Campbell Flaws and Fallacies in Statistical Thinking , 1973 .

[12]  C. Kurland,et al.  Strategies for efficiency and accuracy in gene expression , 1987 .

[13]  X. Xia Maximizing transcription efficiency causes codon usage bias. , 1996, Genetics.

[14]  William H. Press,et al.  Numerical Recipes in C, 2nd Edition , 1992 .

[15]  A. Pavesi,et al.  Transfer RNA gene redundancy and translational selection in Saccharomyces cerevisiae. , 1997, Journal of molecular biology.

[16]  Etsuko N. Moriyama,et al.  Gene length and codon usage bias in Drosophila melanogaster, Saccharomyces cerevisiae and Escherichia coli , 1998, Nucleic Acids Res..

[17]  M. Gouy,et al.  Codon catalog usage and the genome hypothesis. , 1980, Nucleic acids research.

[18]  H. Akashi,et al.  Molecular evolution between Drosophila melanogaster and D. simulans: reduced codon bias, faster rates of amino acid substitution, and larger proteins in D. melanogaster. , 1996, Genetics.

[19]  P. Sharp,et al.  Synonymous codon usage in Saccharomyces cerevisiae , 1991, Yeast.

[20]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[21]  L. Duret,et al.  Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[22]  A. Eyre-Walker,et al.  Synonymous codon bias is related to gene length in Escherichia coli: selection for translational accuracy? , 1996, Molecular biology and evolution.

[23]  A. Brown,et al.  Protein folding within the cell is influenced by controlled rates of polypeptide elongation. , 1992, Journal of molecular biology.

[24]  Paul M. Sharp,et al.  Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes , 1986, Nucleic Acids Res..

[25]  B. Futcher,et al.  A Sampling of the Yeast Proteome , 1999, Molecular and Cellular Biology.

[26]  M. Holland,et al.  Targeted deletion of a yeast enolase structural gene. Identification and isolation of yeast enolase isozymes. , 1982, The Journal of biological chemistry.

[27]  Josep M. Comeron,et al.  An Evaluation of Measures of Synonymous Codon Usage Bias , 1998, Journal of Molecular Evolution.

[28]  Ji Huang,et al.  [Serial analysis of gene expression]. , 2002, Yi chuan = Hereditas.

[29]  M. Holland,et al.  Transcriptional regulation by an upstream repression sequence from the yeast enolase gene ENO1 , 1995, Yeast.

[30]  J. Powell,et al.  Evolution of codon usage bias in Drosophila. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Getting hip to the chip , 1998, Nature Genetics.

[32]  Temple F. Smith,et al.  Biology's new Rosetta stone , 1997, Nature.

[33]  J Solomovici,et al.  Does Escherichia coli optimize the economics of the translation process? , 1997, Journal of theoretical biology.

[34]  E S Lander,et al.  Ploidy regulation of gene expression. , 1999, Science.

[35]  L. Wodicka,et al.  Genome-wide expression monitoring in Saccharomyces cerevisiae , 1997, Nature Biotechnology.

[36]  P. Sharp,et al.  CODONS: a microcomputer program for codon usage analysis. , 1992, The Journal of heredity.

[37]  M. Bulmer The effect of context on synonymous codon usage in genes with low codon usage bias. , 1990, Nucleic acids research.

[38]  J. Bennetzen,et al.  Codon selection in yeast. , 1982, The Journal of biological chemistry.

[39]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[40]  E. Lander Array of hope , 1999, Nature Genetics.

[41]  Angelo Pavesi,et al.  Relationships Between Transcriptional and Translational Control of Gene Expression in Saccharomyces cerevisiae: A Multiple Regression Analysis , 1999, Journal of Molecular Evolution.

[42]  P. Sharp,et al.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. , 1987, Nucleic acids research.

[43]  R. Blumenthal,et al.  Mapping regulatory networks in microbial cells. , 1999, Trends in microbiology.

[44]  R H Hruban,et al.  Gene expression profiles in normal and cancer cells. , 1997, Science.

[45]  M. Bulmer Codon usage and intragenic position. , 1988, Journal of theoretical biology.

[46]  T. Ikemura Codon usage and tRNA content in unicellular and multicellular organisms. , 1985, Molecular biology and evolution.

[47]  Michael R. Green,et al.  Dissecting the Regulatory Circuitry of a Eukaryotic Genome , 1998, Cell.

[48]  F. Wright The 'effective number of codons' used in a gene. , 1990, Gene.