Why highly expressed proteins evolve slowly.

Much recent work has explored molecular and population-genetic constraints on the rate of protein sequence evolution. The best predictor of evolutionary rate is expression level, for reasons that have remained unexplained. Here, we hypothesize that selection to reduce the burden of protein misfolding will favor protein sequences with increased robustness to translational missense errors. Pressure for translational robustness increases with expression level and constrains sequence evolution. Using several sequenced yeast genomes, global expression and protein abundance data, and sets of paralogs traceable to an ancient whole-genome duplication in yeast, we rule out several confounding effects and show that expression level explains roughly half the variation in Saccharomyces cerevisiae protein evolutionary rates. We examine causes for expression's dominant role and find that genome-wide tests favor the translational robustness explanation over existing hypotheses that invoke constraints on function or translational efficiency. Our results suggest that proteins evolve at rates largely unrelated to their functions and can explain why highly expressed proteins evolve slowly across the tree of life.

[1]  A. E. Hirsh,et al.  Functional genomic analysis of the rates of protein evolution. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Christoph Adami,et al.  Thermodynamic prediction of protein neutrality. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[3]  P. Sharp,et al.  Determinants of DNA sequence divergence betweenEscherichia coli andSalmonella typhimurium: Codon usage, map position, and concerted evolution , 1991, Journal of Molecular Evolution.

[4]  Emile Zuckerkandl,et al.  Evolutionary processes and evolutionary noise at the molecular level , 1976, Journal of Molecular Evolution.

[5]  B. Charlesworth,et al.  Correlated Evolution of Synonymous and Nonsynonymous Sites in Drosophila , 2004, Journal of Molecular Evolution.

[6]  Sudhir Kumar,et al.  Gene Expression Intensity Shapes Evolutionary Rates of the Proteins Encoded by the Vertebrate Genome , 2004, Genetics.

[7]  Juno Choe,et al.  Protein tolerance to random amino acid change. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[8]  B. Birren,et al.  Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae , 2004, Nature.

[9]  Eduardo P C Rocha,et al.  An analysis of determinants of amino acids substitution rates in bacterial proteins. , 2004, Molecular biology and evolution.

[10]  A. Goldberg,et al.  Protein degradation and protection against misfolded or damaged proteins , 2003, Nature.

[11]  S. Carroll,et al.  Genome-scale approaches to resolving incongruence in molecular phylogenies , 2003, Nature.

[12]  E. O’Shea,et al.  Global analysis of protein expression in yeast , 2003, Nature.

[13]  C. Adami,et al.  Apparent dependence of protein evolutionary rate on number of interactions is linked to biases in protein–protein interactions data sets , 2003, BMC Evolutionary Biology.

[14]  D. Wall,et al.  Gene expression level influences amino acid usage, but not codon usage, in the tsetse fly endosymbiont Wigglesworthia. , 2003, Microbiology.

[15]  D. P. Wall,et al.  Detecting putative orthologs , 2003, Bioinform..

[16]  M. Gerstein,et al.  Comparing protein abundance and mRNA expression levels on a genomic scale , 2003, Genome Biology.

[17]  Hiroshi Akashi,et al.  Translational selection and yeast proteome evolution. , 2003, Genetics.

[18]  C. Kurtzman,et al.  Phylogenetic relationships among yeasts of the 'Saccharomyces complex' determined from multigene sequence analyses. , 2003, FEMS yeast research.

[19]  B. Birren,et al.  Sequencing and comparison of yeast species to identify genes and regulatory elements , 2003, Nature.

[20]  Laurence D. Hurst,et al.  Genomic function (communication arising): Rate of evolution and gene dispensability , 2003, Nature.

[21]  M. Roukes,et al.  Nanoelectromechanical systems: Nanodevice motion at microwave frequencies , 2003, Nature.

[22]  A. E. Hirsh,et al.  Genomic function (communication arising): Rate of evolution and gene dispensability , 2003, Nature.

[23]  Ronald W. Davis,et al.  Role of duplicate genes in genetic robustness against null mutations , 2003, Nature.

[24]  E. Koonin,et al.  Essential genes are more evolutionarily conserved than are nonessential genes in bacteria. , 2002, Genome research.

[25]  A. E. Hirsh,et al.  Evolutionary Rate in the Protein Interaction Network , 2002, Science.

[26]  C. Dobson,et al.  Inherent toxicity of aggregates implies a common mechanism for protein misfolding diseases , 2002, Nature.

[27]  R. Ellis,et al.  Medicine: Danger — misfolding proteins , 2002, Nature.

[28]  H. Akashi,et al.  Gene expression and molecular evolution. , 2001, Current opinion in genetics & development.

[29]  C. Pál,et al.  Does the recombination rate affect the efficiency of purifying selection? The yeast genome provides a partial answer. , 2001, Molecular biology and evolution.

[30]  A. E. Hirsh,et al.  Protein dispensability and rate of evolution , 2001, Nature.

[31]  C. Pál,et al.  Highly expressed genes in yeast evolve slowly. , 2001, Genetics.

[32]  K. H. Wolfe,et al.  Relationship of codon bias to mRNA concentration and protein length in Saccharomyces cerevisiae , 2000, Yeast.

[33]  L. Duret,et al.  Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate. , 2000, Molecular biology and evolution.

[34]  K. H. Wolfe,et al.  Yeast genome evolution in the post-genome era. , 1999, Current opinion in microbiology.

[35]  Laurence D. Hurst,et al.  Do essential genes evolve slowly? , 1999, Current Biology.

[36]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[37]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[38]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[39]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[40]  C. Kurland,et al.  Gratuitous overexpression of genes in Escherichia coli leads to growth inhibition and ribosome destruction , 1995, Journal of bacteriology.

[41]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[42]  H. Akashi Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy. , 1994, Genetics.

[43]  R. J. Spreitzer Genetic Dissection of Rubisco Structure and Function , 1993 .

[44]  D. Mindell Fundamentals of molecular evolution , 1991 .

[45]  Wen-Hsiung Li,et al.  Fundamentals of molecular evolution , 1990 .

[46]  S. Eykyn Microbiology , 1950, The Lancet.

[47]  J. Parker,et al.  Errors and alternatives in reading the universal genetic code. , 1989, Microbiological reviews.

[48]  D. Delmer,et al.  Annual review of plant physiology and plant molecular biology , 1988 .

[49]  J. Parker,et al.  Missense misreading of asparagine codons as a function of codon identity and context. , 1987, The Journal of biological chemistry.

[50]  P. Sharp,et al.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. , 1987, Nucleic acids research.

[51]  W R Engels,et al.  Gene duplication. , 1981, Science.

[52]  B. Bainbridge,et al.  Genetics , 1981, Experientia.

[53]  Dr. Susumu Ohno Evolution by Gene Duplication , 1970, Springer Berlin Heidelberg.