Determinants of the rate of protein sequence evolution

The rate and mechanism of protein sequence evolution have been central questions in evolutionary biology since the 1960s. Although the rate of protein sequence evolution depends primarily on the level of functional constraint, exactly what determines functional constraint has remained unclear. The increasing availability of genomic data has enabled much needed empirical examinations on the nature of functional constraint. These studies found that the evolutionary rate of a protein is predominantly influenced by its expression level rather than functional importance. A combination of theoretical and empirical analyses has identified multiple mechanisms behind these observations and demonstrated a prominent role in protein evolution of selection against errors in molecular and cellular processes.

[1]  Michael R. Green,et al.  Dissecting the Regulatory Circuitry of a Eukaryotic Genome , 1998, Cell.

[2]  Laurence D. Hurst,et al.  Genomic function (communication arising): Rate of evolution and gene dispensability , 2003, Nature.

[3]  C. Wilke,et al.  The evolutionary consequences of erroneous protein synthesis , 2009, Nature Reviews Genetics.

[4]  Joshua G. Dunn,et al.  Ribosome profiling reveals pervasive and regulated stop codon readthrough in Drosophila melanogaster , 2013, eLife.

[5]  Svetlana A. Shabalina,et al.  Negative Correlation between Expression Level and Evolutionary Rate of Long Intergenic Noncoding RNAs , 2011, Genome biology and evolution.

[6]  D C Shields,et al.  Chromosomal location and evolutionary rate variation in enterobacterial genes. , 1989, Science.

[7]  M. Nei,et al.  Molecular Evolution and Phylogenetics , 2000 .

[8]  Eugene V. Koonin,et al.  Coupling Between Protein Level Selection and Codon Usage Optimization in the Evolution of Bacteria and Archaea , 2014, mBio.

[9]  J. Zhang,et al.  Correlation between the substitution rate and rate variation among sites in protein evolution. , 1998, Genetics.

[10]  T. Ota,et al.  Positive selection is a general phenomenon in the evolution of abalone sperm lysin. , 1995, Molecular biology and evolution.

[11]  Christopher J. Oldfield,et al.  Evolutionary Rate Heterogeneity in Proteins with Long Disordered Regions , 2002, Journal of Molecular Evolution.

[12]  D. Hartl,et al.  Misfolded proteins impose a dosage-dependent fitness cost and trigger a cytosolic unfolded protein response in yeast , 2010, Proceedings of the National Academy of Sciences.

[13]  D. Kahn,et al.  The Relationship among Gene Expression, the Evolution of Gene Dosage, and the Rate of Protein Evolution , 2010, PLoS genetics.

[14]  J. L. Cherry Expression Level, Evolutionary Rate, and the Cost of Expression , 2010, Genome biology and evolution.

[15]  L. Orgel,et al.  Biochemical Evolution , 1971, Nature.

[16]  Jianzhi Zhang,et al.  Toward a Molecular Understanding of Pleiotropy , 2006, Genetics.

[17]  D. Vitkup,et al.  Influence of metabolic network structure and function on enzyme evolution , 2006, Genome Biology.

[18]  P. Hanawalt,et al.  Transcription-coupled DNA repair: two decades of progress and surprises , 2008, Nature Reviews Molecular Cell Biology.

[19]  Jian-Rong Yang,et al.  Differential requirements for mRNA folding partially explain why highly expressed proteins evolve slowly , 2013, Proceedings of the National Academy of Sciences.

[20]  J. Moult,et al.  SNPs, protein structure, and disease , 2001, Human mutation.

[21]  Nayun Kim,et al.  Transcription as a source of genome instability , 2012, Nature Reviews Genetics.

[22]  Laurence D. Hurst,et al.  Do essential genes evolve slowly? , 1999, Current Biology.

[23]  Gabriela R. Moura,et al.  The Yeast PNC1 Longevity Gene Is Up-Regulated by mRNA Mistranslation , 2009, PloS one.

[24]  A. E. Hirsh,et al.  Functional genomic analysis of the rates of protein evolution. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Mark Gerstein,et al.  Integrated Assessment of Genomic Correlates of Protein Evolutionary Rate , 2009, PLoS Comput. Biol..

[26]  I. Simon,et al.  Modulation of the vitamin D3 response by cancer-associated mutant p53. , 2010, Cancer cell.

[27]  Jianzhi Zhang,et al.  Impact of gene expression noise on organismal fitness and the efficacy of natural selection , 2011, Proceedings of the National Academy of Sciences.

[28]  Claus O. Wilke,et al.  Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution , 2008, Cell.

[29]  A. E. Hirsh,et al.  Evolutionary Rate in the Protein Interaction Network , 2002, Science.

[30]  J. Shendure,et al.  Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data , 2011, Nature Reviews Genetics.

[31]  David J. Lipman,et al.  Why Does a Protein’s Evolutionary Rate Vary over Time? , 2013, Genome biology and evolution.

[32]  Jianzhi Zhang,et al.  High Expression Hampers Horizontal Gene Transfer , 2012, Genome biology and evolution.

[33]  D. M. Krylov,et al.  Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution. , 2003, Genome research.

[34]  B. Williams,et al.  From single-cell to cell-pool transcriptomes: Stochasticity in gene expression and RNA splicing , 2014, Genome research.

[35]  R. Varadarajan,et al.  Residue depth: a novel parameter for the analysis of protein structure and stability. , 1999, Structure.

[36]  Tobias Warnecke,et al.  Error prevention and mitigation as forces in the evolution of genes and genomes , 2011, Nature Reviews Genetics.

[37]  Matthew W. Hahn,et al.  Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks. , 2005, Molecular biology and evolution.

[38]  S. Bergmann,et al.  The Hourglass and the Early Conservation Models—Co-Existing Patterns of Developmental Constraints in Vertebrates , 2013, PLoS genetics.

[39]  Tamir Tuller,et al.  Strong association between mRNA folding strength and protein abundance in S. cerevisiae , 2012, EMBO reports.

[40]  C. Wilke,et al.  Why highly expressed proteins evolve slowly. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[41]  M. Nei,et al.  Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection , 1988, Nature.

[42]  Philip M. Kim,et al.  Relating Three-Dimensional Structures to Protein Networks Provides Evolutionary Insights , 2006, Science.

[43]  M. Elowitz,et al.  Functional roles for noise in genetic circuits , 2010, Nature.

[44]  C. Pál,et al.  Genomic function: Rate of evolution and gene dispensability. , 2003, Nature.

[45]  Jian-Rong Yang,et al.  Impact of translational error-induced and error-free misfolding on the rate of protein evolution , 2010, Molecular systems biology.

[46]  C. Wilke,et al.  A single determinant dominates the rate of yeast protein evolution. , 2006, Molecular biology and evolution.

[47]  M. Kimura The Neutral Theory of Molecular Evolution: Introduction , 1983 .

[48]  Frances H Arnold,et al.  Structural determinants of the rate of protein evolution in yeast. , 2006, Molecular biology and evolution.

[49]  Takashi Gojobori,et al.  Metabolic efficiency and amino acid composition in the proteomes of Escherichia coli and Bacillus subtilis , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[50]  Eduardo P C Rocha,et al.  An analysis of determinants of amino acids substitution rates in bacterial proteins. , 2004, Molecular biology and evolution.

[51]  Ben Lehner,et al.  Intrinsic Protein Disorder and Interaction Promiscuity Are Widely Associated with Dosage Sensitivity , 2009, Cell.

[52]  Jianzhi Zhang,et al.  Low rates of expression profile divergence in highly expressed genes and tissue-specific genes during mammalian evolution. , 2006, Molecular biology and evolution.

[53]  E. Koonin,et al.  Essential genes are more evolutionarily conserved than are nonessential genes in bacteria. , 2002, Genome research.

[54]  Patrick C Phillips,et al.  Evolutionary rates and centrality in the yeast gene regulatory network , 2009, Genome Biology.

[55]  Susan Lindquist,et al.  Quantitative Analysis of Hsp90-Client Interactions Reveals Principles of Substrate Recognition , 2012, Cell.

[56]  T. Ohta,et al.  On some principles governing molecular evolution. , 1974, Proceedings of the National Academy of Sciences of the United States of America.

[57]  T. Helleday,et al.  Transcription-associated recombination in eukaryotes: link between transcription, replication and recombination. , 2009, Mutagenesis.

[58]  Alan M. Moses,et al.  In vivo enhancer analysis of human conserved non-coding sequences , 2006, Nature.

[59]  Jianzhi Zhang,et al.  Why Is the Correlation between Gene Importance and Gene Evolutionary Rate So Weak? , 2009, PLoS genetics.

[60]  Jianzhi Zhang,et al.  Human coding RNA editing is generally nonadaptive , 2014, Proceedings of the National Academy of Sciences.

[61]  M. Gerstein,et al.  Comparing protein abundance and mRNA expression levels on a genomic scale , 2003, Genome Biology.

[62]  Marc Robinson-Rechavi,et al.  Determinants of protein evolutionary rates in light of ENCODE functional genomics , 2014, BMC Bioinformatics.

[63]  H. Akashi Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy. , 1994, Genetics.

[64]  Diego J. Zea,et al.  Protein conformational diversity correlates with evolutionary rate. , 2013, Molecular biology and evolution.

[65]  Subhajyoti De,et al.  Cellular crowding imposes global constraints on the chemistry and evolution of proteomes , 2012, Proceedings of the National Academy of Sciences.

[66]  Jian-Rong Yang,et al.  Protein misinteraction avoidance causes highly expressed proteins to evolve slowly , 2012, Proceedings of the National Academy of Sciences.

[67]  Jianzhi Zhang,et al.  Impact of Extracellularity on the Evolutionary Rate of Mammalian Proteins , 2010, Genome biology and evolution.

[68]  Eugene V Koonin,et al.  Comparable contributions of structural-functional constraints and expression level to the rate of protein sequence evolution , 2008, Biology Direct.

[69]  E. Marcotte,et al.  Insights into the regulation of protein abundance from proteomic and transcriptomic analyses , 2012, Nature Reviews Genetics.

[70]  Yu Xia,et al.  Structural determinants of protein evolution are context-sensitive at the residue level. , 2009, Molecular biology and evolution.

[71]  A. Oudenaarden,et al.  Nature, Nurture, or Chance: Stochastic Gene Expression and Its Consequences , 2008, Cell.

[72]  D. Rickwood,et al.  Cell and Molecular Biology , 1998, The Journal of Steroid Biochemistry and Molecular Biology.

[73]  Z. Yang,et al.  Among-site rate variation and its impact on phylogenetic analyses. , 1996, Trends in ecology & evolution.

[74]  Jianzhi Zhang,et al.  No gene-specific optimization of mutation rate in Escherichia coli. , 2013, Molecular biology and evolution.

[75]  L. Orgel,et al.  The maintenance of the accuracy of protein synthesis and its relevance to ageing. , 1963, Proceedings of the National Academy of Sciences of the United States of America.

[76]  Wenfeng Qian,et al.  Positive selection for elevated gene expression noise in yeast , 2009, Molecular systems biology.

[77]  Michael T. Laub,et al.  Pervasive degeneracy and epistasis in a protein-protein interface , 2015, Science.

[78]  J. L. Cherry Highly Expressed and Slowly Evolving Proteins Share Compositional Properties with Thermophilic Proteins , 2009, Molecular biology and evolution.

[79]  Steven A Frank,et al.  Somatic Mosaicism and Disease , 2014, Current Biology.

[80]  Ben-Yang Liao,et al.  Impacts of gene essentiality, expression pattern, and gene compactness on the evolutionary rate of mammalian proteins. , 2006, Molecular biology and evolution.

[81]  M. Kimura Evolutionary Rate at the Molecular Level , 1968, Nature.

[82]  Jianzhi Zhang,et al.  Genomic evidence for elevated mutation rates in highly expressed genes , 2012, EMBO reports.

[83]  Wen-Hsiung Li,et al.  The relationships among microRNA regulation, intrinsically disordered regions, and other indicators of protein evolutionary rate. , 2011, Molecular biology and evolution.

[84]  K. Holsinger The neutral theory of molecular evolution , 2004 .

[85]  D. Andersson,et al.  Whole-genome mutational biases in bacteria , 2008, Proceedings of the National Academy of Sciences.

[86]  Jian-Rong Yang,et al.  Codon-by-Codon Modulation of Translational Speed and Accuracy Via mRNA Folding , 2014, PLoS biology.

[87]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[88]  D. Futuyma,et al.  Evolution Since Darwin: The First 150 Years , 2010 .

[89]  R. Ranganathan,et al.  Evolvability as a Function of Purifying Selection in TEM-1 β-Lactamase , 2015, Cell.

[90]  Sidney W. Fox Evolving genes and proteins (Bryson, Vernon; Vogel, Henry J.; eds.) , 1966 .

[91]  Jianzhi Zhang,et al.  Significant impact of protein dispensability on the instantaneous rate of protein evolution. , 2005, Molecular biology and evolution.

[92]  Andrew Ying-Fei Chang,et al.  Assessing determinants of exonic evolutionary rates in mammals. , 2012, Molecular biology and evolution.

[93]  C. Wilke,et al.  The look-ahead effect of phenotypic mutations , 2007, Biology Direct.

[94]  Ronald W. Davis,et al.  Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. , 1999, Science.

[95]  Samuel H. Vohr,et al.  Evolutionary Rates and Gene Dispensability Associate with Replication Timing in the Archaeon Sulfolobus islandicus , 2010, Genome biology and evolution.

[96]  Jianzhi Zhang,et al.  Yeast mutation accumulation experiment supports elevated mutation rates at highly transcribed sites , 2014, Proceedings of the National Academy of Sciences.

[97]  Liran Carmel,et al.  Unifying measures of gene function and evolution , 2006, Proceedings of the Royal Society B: Biological Sciences.

[98]  A. E. Hirsh,et al.  Protein dispensability and rate of evolution , 2001, Nature.

[99]  Jenn-Kang Hwang,et al.  Site-specific structural constraints on protein sequence evolutionary divergence: local packing density versus solvent exposure. , 2014, Molecular biology and evolution.

[100]  Rui Jiang,et al.  Integrating Multiple Genomic Data to Predict Disease-Causing Nonsynonymous Single Nucleotide Variants in Exome Sequencing Studies , 2014, PLoS genetics.

[101]  David Bogumil,et al.  Chaperonin-Dependent Accelerated Substitution Rates in Prokaryotes , 2010, Genome biology and evolution.

[102]  N. Takahata,et al.  Molecular Clock: An Anti-neo-Darwinian Legacy , 2007, Genetics.

[103]  J. L. King,et al.  Non-Darwinian evolution. , 1969, Science.

[104]  C. Pál,et al.  An integrated view of protein evolution , 2006, Nature Reviews Genetics.

[105]  Trees-Juen Chuang,et al.  Impacts of Pretranscriptional DNA Methylation, Transcriptional Transcription Factor, and Posttranscriptional microRNA Regulations on Protein Evolutionary Rate , 2014, Genome biology and evolution.

[106]  Pier Paolo Pandolfi,et al.  Aberrant mRNA translation in cancer pathogenesis: an old concept revisited comes finally of age , 2004, Oncogene.

[107]  L. Pauling,et al.  Evolutionary Divergence and Convergence in Proteins , 1965 .

[108]  Suhua Shi,et al.  Testing hypotheses on the rate of molecular evolution in relation to gene expression using microRNAs , 2011, Proceedings of the National Academy of Sciences.

[109]  Albert J. Vilella,et al.  A high-resolution map of human evolutionary constraint using 29 mammals , 2011, Nature.

[110]  Sergei Maslov,et al.  Constraints imposed by non-functional protein–protein interactions on gene expression and proteome size , 2008, Molecular systems biology.

[111]  C. Adami,et al.  Apparent dependence of protein evolutionary rate on number of interactions is linked to biases in protein–protein interactions data sets , 2003, BMC Evolutionary Biology.

[112]  Sudhir Kumar,et al.  Molecular clocks: four decades of evolution , 2005, Nature Reviews Genetics.

[113]  V. Rotter,et al.  Mutant p53 gain-of-function in cancer. , 2010, Cold Spring Harbor perspectives in biology.

[114]  C. Pál,et al.  Highly expressed genes in yeast evolve slowly. , 2001, Genetics.

[115]  Søren Vang,et al.  Protein misfolding and human disease. , 2006, Annual review of genomics and human genetics.

[116]  Joel Dudley,et al.  TimeTree: a public knowledge-base of divergence times among organisms , 2006, Bioinform..

[117]  Mark Gerstein,et al.  The relationship between the evolution of microRNA targets and the length of their UTRs , 2009, BMC Genomics.

[118]  Wen-Hsiung Li,et al.  Mammalian housekeeping genes evolve more slowly than tissue-specific genes. , 2004, Molecular biology and evolution.