Impact of translational error-induced and error-free misfolding on the rate of protein evolution

What determines the rate of protein evolution is a fundamental question in biology. Recent genomic studies revealed a surprisingly strong anticorrelation between the expression level of a protein and its rate of sequence evolution. This observation is currently explained by the translational robustness hypothesis in which the toxicity of translational error‐induced protein misfolding selects for higher translational robustness of more abundant proteins, which constrains sequence evolution. However, the impact of error‐free protein misfolding has not been evaluated. We estimate that a non‐negligible fraction of misfolded proteins are error free and demonstrate by a molecular‐level evolutionary simulation that selection against protein misfolding results in a greater reduction of error‐free misfolding than error‐induced misfolding. Thus, an overarching protein‐misfolding‐avoidance hypothesis that includes both sources of misfolding is superior to the translational robustness hypothesis. We show that misfolding‐minimizing amino acids are preferentially used in highly abundant yeast proteins and that these residues are evolutionarily more conserved than other residues of the same proteins. These findings provide unambiguous support to the role of protein‐misfolding‐avoidance in determining the rate of protein sequence evolution.

[1]  Tobias Warnecke,et al.  GroEL dependency affects codon usage—support for a critical role of misfolding in gene evolution , 2010, Molecular systems biology.

[2]  Eugene V Koonin,et al.  Systemic determinants of gene evolution and function , 2005, Molecular systems biology.

[3]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[4]  K. Holsinger The neutral theory of molecular evolution , 2004 .

[5]  Jianzhi Zhang,et al.  Significant impact of protein dispensability on the instantaneous rate of protein evolution. , 2005, Molecular biology and evolution.

[6]  Eugene V Koonin,et al.  Universal distribution of protein evolution rates as a consequence of protein folding physics , 2010, Proceedings of the National Academy of Sciences.

[7]  T. Ohta,et al.  On some principles governing molecular evolution. , 1974, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Richard A Goldstein,et al.  Why are proteins so robust to site mutations? , 2002, Journal of molecular biology.

[9]  Liran Carmel,et al.  Unifying measures of gene function and evolution , 2006, Proceedings of the Royal Society B: Biological Sciences.

[10]  A. E. Hirsh,et al.  Protein dispensability and rate of evolution , 2001, Nature.

[11]  Francesc X. Avilés,et al.  AGGRESCAN: a server for the prediction and evaluation of "hot spots" of aggregation in polypeptides , 2007, BMC Bioinform..

[12]  R. Sauer,et al.  The structural stability of a protein is an important determinant of its proteolytic susceptibility in Escherichia coli. , 1989, The Journal of biological chemistry.

[13]  C. Dobson Protein folding and misfolding , 2003, Nature.

[14]  A. Eyre-Walker,et al.  Synonymous codon usage in Escherichia coli: selection for translational accuracy. , 2006, Molecular biology and evolution.

[15]  R. Kaufman,et al.  The mammalian unfolded protein response. , 2003, Annual review of biochemistry.

[16]  Sudhir Kumar,et al.  Gene Expression Intensity Shapes Evolutionary Rates of the Proteins Encoded by the Vertebrate Genome , 2004, Genetics.

[17]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[18]  D. M. Taverna,et al.  Why are proteins marginally stable? , 2002, Proteins.

[19]  C. Dobson,et al.  Inherent toxicity of aggregates implies a common mechanism for protein misfolding diseases , 2002, Nature.

[20]  C. Pál,et al.  Highly expressed genes in yeast evolve slowly. , 2001, Genetics.

[21]  Michael Y. Galperin,et al.  Sequence ― Evolution ― Function: Computational Approaches in Comparative Genomics , 2010 .

[22]  Akinori Sarai,et al.  ProTherm, version 4.0: thermodynamic database for proteins and mutants , 2004, Nucleic Acids Res..

[23]  Eugene V Koonin,et al.  No simple dependence between protein evolution rate and the number of protein-protein interactions: only the most prolific interactors tend to evolve slowly , 2003, BMC Evolutionary Biology.

[24]  C. Wilke Molecular clock in neutral protein evolution , 2004, BMC Genetics.

[25]  Salvador Ventura,et al.  Protein Aggregation Profile of the Bacterial Cytosol , 2010, PloS one.

[26]  Ruth Hershberg,et al.  Selection on codon bias. , 2008, Annual review of genetics.

[27]  Christian Brochmann,et al.  Refugia, differentiation and postglacial migration in arctic‐alpine Eurasia, exemplified by the mountain avens (Dryas octopetala L.) , 2006, Molecular ecology.

[28]  H. Akashi Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy. , 1994, Genetics.

[29]  Wen-Hsiung Li,et al.  Mammalian housekeeping genes evolve more slowly than tissue-specific genes. , 2004, Molecular biology and evolution.

[30]  E. Marcotte,et al.  Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation , 2007, Nature Biotechnology.

[31]  K. Sharp,et al.  Potential energy functions for protein design. , 2007, Current opinion in structural biology.

[32]  David Tollervey,et al.  Coding-Sequence Determinants of Gene Expression in Escherichia coli , 2009, Science.

[33]  C. Wilke,et al.  Why highly expressed proteins evolve slowly. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Jianzhi Zhang,et al.  Impact of Extracellularity on the Evolutionary Rate of Mammalian Proteins , 2010, Genome biology and evolution.

[35]  Eugene V Koonin,et al.  Comparable contributions of structural-functional constraints and expression level to the rate of protein sequence evolution , 2008, Biology Direct.

[36]  R. Sauer,et al.  Genetic analysis of protein stability and function. , 1989, Annual review of genetics.

[37]  Jianzhi Zhang Evolution by gene duplication: an update , 2003 .

[38]  L. Pauling,et al.  Evolutionary Divergence and Convergence in Proteins , 1965 .

[39]  Ben-Yang Liao,et al.  Impacts of gene essentiality, expression pattern, and gene compactness on the evolutionary rate of mammalian proteins. , 2006, Molecular biology and evolution.

[40]  M. Kimura Evolutionary Rate at the Molecular Level , 1968, Nature.

[41]  J. L. Cherry Highly Expressed and Slowly Evolving Proteins Share Compositional Properties with Thermophilic Proteins , 2009, Molecular biology and evolution.

[42]  J. M. Sanchez-Ruiz,et al.  Protein kinetic stability. , 2010, Biophysical chemistry.

[43]  L. Serrano,et al.  Prediction of sequence-dependent and mutational effects on the aggregation of peptides and proteins , 2004, Nature Biotechnology.

[44]  K. Dill,et al.  The Protein Folding Problem , 1993 .

[45]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[46]  Nikolay V Dokholyan,et al.  Natural selection against protein aggregation on self-interacting and essential proteins in yeast, fly, and worm. , 2008, Molecular biology and evolution.

[47]  M. Nei Molecular Evolutionary Genetics , 1987 .

[48]  Michael R. Green,et al.  Dissecting the Regulatory Circuitry of a Eukaryotic Genome , 1998, Cell.

[49]  M. Nei,et al.  Molecular Evolution and Phylogenetics , 2000 .

[50]  D. Labie,et al.  Molecular Evolution , 1991, Nature.

[51]  C. Wilke,et al.  A single determinant dominates the rate of yeast protein evolution. , 2006, Molecular biology and evolution.

[52]  Takashi Gojobori,et al.  Metabolic efficiency and amino acid composition in the proteomes of Escherichia coli and Bacillus subtilis , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[53]  Eduardo P C Rocha,et al.  An analysis of determinants of amino acids substitution rates in bacterial proteins. , 2004, Molecular biology and evolution.

[54]  Dan S. Tawfik,et al.  Chaperonin overexpression promotes genetic variation and enzyme evolution , 2009, Nature.

[55]  Jianzhi Zhang,et al.  Why Is the Correlation between Gene Importance and Gene Evolutionary Rate So Weak? , 2009, PLoS genetics.

[56]  Sydney Anne Cameron,et al.  Molecular Evolution: A Phylogenetic Approach.—Roderic D. M. Page and Edward C. Holmes. , 2002 .

[57]  Andreas Wagner,et al.  Energy constraints on the evolution of gene expression. , 2005, Molecular biology and evolution.

[58]  Claus O. Wilke,et al.  Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution , 2008, Cell.

[59]  N. Friedman,et al.  Natural history and evolutionary principles of gene duplication in fungi , 2007, Nature.

[60]  E. O’Shea,et al.  Global analysis of protein expression in yeast , 2003, Nature.

[61]  Piero Fariselli,et al.  I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure , 2005, Nucleic Acids Res..

[62]  Tong Zhou,et al.  Translationally optimal codons associate with structurally sensitive sites in proteins. , 2009, Molecular biology and evolution.

[63]  L. Hurst,et al.  The Genetic Code Is One in a Million , 1998, Journal of Molecular Evolution.

[64]  D. Lipman,et al.  Relative Contributions of Intrinsic Structural–Functional Constraints and Translation Rate to the Evolution of Protein-Coding Genes , 2010, Genome biology and evolution.

[65]  T. Jukes Non-Darwinian Evolution , 2001 .

[66]  R. Jernigan,et al.  Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation , 1985 .

[67]  Kara Dolinski,et al.  Saccharomyces Genome Database provides mutant phenotype data , 2009, Nucleic Acids Res..

[68]  Michele Vendruscolo,et al.  Life on the edge: a link between gene expression levels and aggregation rates of human proteins. , 2007, Trends in biochemical sciences.

[69]  C. Wilke,et al.  The evolutionary consequences of erroneous protein synthesis , 2009, Nature Reviews Genetics.

[70]  A. E. Hirsh,et al.  Functional genomic analysis of the rates of protein evolution. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[71]  A. E. Hirsh,et al.  Evolutionary Rate in the Protein Interaction Network , 2002, Science.

[72]  Laurence D. Hurst,et al.  Do essential genes evolve slowly? , 1999, Current Biology.

[73]  C. Bystroff,et al.  Identifying the subproteome of kinetically stable proteins via diagonal 2D SDS/PAGE , 2007, Proceedings of the National Academy of Sciences.

[74]  M. Kimura The Neutral Theory of Molecular Evolution: Introduction , 1983 .