Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution

Strikingly consistent correlations between rates of coding-sequence evolution and gene expression levels are apparent across taxa, but the biological causes behind the selective pressures on coding-sequence evolution remain controversial. Here, we demonstrate conserved patterns of simple covariation between sequence evolution, codon usage, and mRNA level in E. coli, yeast, worm, fly, mouse, and human that suggest that all observed trends stem largely from a unified underlying selective pressure. In metazoans, these trends are strongest in tissues composed of neurons, whose structure and lifetime confer extreme sensitivity to protein misfolding. We propose, and demonstrate using a molecular-level evolutionary simulation, that selection against toxicity of misfolded proteins generated by ribosome errors suffices to create all of the observed covariation. The mechanistic model of molecular evolution that emerges yields testable biochemical predictions, calls into question the use of nonsynonymous-to-synonymous substitution ratios (Ka/Ks) to detect functional selection, and suggests how mistranslation may contribute to neurodegenerative disease.

[1]  J. Dow,et al.  Using FlyAtlas to identify better Drosophila melanogaster models of human disease , 2007, Nature Genetics.

[2]  M. Lynch,et al.  The Origins of Genome Complexity , 2003, Science.

[3]  L. Duret,et al.  Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[4]  J. M. Comeron,et al.  Selective and Mutational Patterns Associated With Gene Expression in Humans , 2004, Genetics.

[5]  R. Greenberg Biometry , 1969, The Yale Journal of Biology and Medicine.

[6]  Sudhir Kumar,et al.  Gene Expression Intensity Shapes Evolutionary Rates of the Proteins Encoded by the Vertebrate Genome , 2004, Genetics.

[7]  S. Sugano,et al.  Rate of Evolution in Brain-Expressed Genes in Humans and Other Primates , 2006, PLoS biology.

[8]  Araxi O. Urrutia,et al.  Splicing and the Evolution of Proteins in Mammals , 2007, Nature Reviews Genetics.

[9]  R. Bowser,et al.  Common molecular signature in SOD1 for both sporadic and familial amyotrophic lateral sclerosis , 2007, Proceedings of the National Academy of Sciences.

[10]  C. Luo,et al.  A new method for estimating synonymous and nonsynonymous rates of nucleotide substitution considering the relative likelihood of nucleotide and codon changes. , 1985, Molecular biology and evolution.

[11]  Andreas Prlic,et al.  Ensembl 2006 , 2005, Nucleic Acids Res..

[12]  Christine M. Malcom,et al.  Accelerated Evolution of Nervous System Genes in the Origin of Homo sapiens , 2004, Cell.

[13]  V. Ramakrishnan,et al.  First published online as a Review in Advance on February 25, 2005 STRUCTURAL INSIGHTS INTO TRANSLATIONAL , 2022 .

[14]  C. Ball,et al.  Saccharomyces Genome Database. , 2002, Methods in enzymology.

[15]  E. Rocha Codon usage bias from tRNA's point of view: redundancy, specialization, and efficient decoding for translation optimization. , 2004, Genome research.

[16]  R. Sauer,et al.  Genetic analysis of protein stability and function. , 1989, Annual review of genetics.

[17]  C. Wilke,et al.  Why highly expressed proteins evolve slowly. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[18]  T J White,et al.  Biochemical evolution. , 1977, Annual review of biochemistry.

[19]  C. Anfinsen The Molecular Basis of Evolution , 1959, The Yale Journal of Biology and Medicine.

[20]  P. Schimmel,et al.  An editing-defective aminoacyl-tRNA synthetase is mutagenic in aging bacteria via the SOS response , 2007, Proceedings of the National Academy of Sciences.

[21]  P. Sharp,et al.  Synonymous codon usage in Saccharomyces cerevisiae , 1991, Yeast.

[22]  C Cruz,et al.  Genetic studies of the lac repressor. XIV. Analysis of 4000 altered Escherichia coli lac repressors reveals essential and non-essential residues, as well as "spacers" which do not require a specific sequence. , 1994, Journal of molecular biology.

[23]  M. Stefani Generic Cell Dysfunction in Neurodegenerative Disorders: Role of Surfaces in Early Protein Misfolding, Aggregation, and Aggregate Cytotoxicity , 2007, The Neuroscientist : a review journal bringing neurobiology, neurology and psychiatry.

[24]  Antony Michael,et al.  The Selective Cause of an Ancient Adaptation , 2007 .

[25]  C. Dobson,et al.  Inherent toxicity of aggregates implies a common mechanism for protein misfolding diseases , 2002, Nature.

[26]  C. Pál,et al.  Highly expressed genes in yeast evolve slowly. , 2001, Genetics.

[27]  Sudhir Kumar,et al.  Neutral substitutions occur at a faster rate in exons than in noncoding DNA in primate genomes. , 2003, Genome research.

[28]  R. Hilborn,et al.  Supplementary Materials , 2019 .

[29]  Daniel M. Stoebel,et al.  The Cost of Expression of Escherichia coli lac Operon Proteins Is in the Process, Not in the Products , 2008, Genetics.

[30]  Anders Blomberg,et al.  High-resolution yeast phenomics resolves different physiological features in the saline response , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[31]  J. Wakeley,et al.  The excess of transitions among nucleotide substitutions: new methods of estimating transition bias underscore its significance. , 1996, Trends in ecology & evolution.

[32]  C. Wilke Molecular clock in neutral protein evolution , 2004, BMC Genetics.

[33]  L. Hurst,et al.  The Genetic Code Is One in a Million , 1998, Journal of Molecular Evolution.

[34]  Eugene V Koonin,et al.  Evolutionary systems biology: links between gene evolution and function. , 2006, Current opinion in biotechnology.

[35]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[36]  Pasch,et al.  References and Notes Supporting Online Material Evolution of Hormone-receptor Complexity by Molecular Exploitation , 2022 .

[37]  A. Bird DNA methylation and the frequency of CpG in animal DNA. , 1980, Nucleic acids research.

[38]  Kimberly Van Auken,et al.  WormBase 2007 , 2007, Nucleic Acids Res..

[39]  Wen-Hsiung Li,et al.  Mammalian housekeeping genes evolve more slowly than tissue-specific genes. , 2004, Molecular biology and evolution.

[40]  Ben-Yang Liao,et al.  Impacts of gene essentiality, expression pattern, and gene compactness on the evolutionary rate of mammalian proteins. , 2006, Molecular biology and evolution.

[41]  A. Eyre-Walker,et al.  The problem of counting sites in the estimation of the synonymous and nonsynonymous substitution rates: implications for the correlation between the synonymous substitution rate and codon usage bias. , 2003, Genetics.

[42]  S. Ackerman,et al.  Editing-defective tRNA synthetase causes protein misfolding and neurodegeneration , 2006, Nature.

[43]  Claus O Wilke,et al.  Population Genetics of Translational Robustness , 2005, Genetics.

[44]  C. Pál,et al.  An integrated view of protein evolution , 2006, Nature Reviews Genetics.

[45]  Ziheng Yang,et al.  Mutation-selection models of codon substitution and their use to estimate selective strengths on codon usage. , 2008, Molecular biology and evolution.

[46]  Laurent Duret,et al.  A new perspective on isochore evolution. , 2006, Gene.

[47]  T. Borza,et al.  Evolutionary Rates and Expression Level in Chlamydomonas , 2006, Genetics.

[48]  A. E. Hirsh,et al.  Functional genomic analysis of the rates of protein evolution. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[49]  Claudio Soto,et al.  Unfolding the role of protein misfolding in neurodegenerative diseases , 2003, Nature Reviews Neuroscience.

[50]  Markus J. Herrgård,et al.  Integrating high-throughput and computational data elucidates bacterial networks , 2004, Nature.

[51]  Blake C Meyers,et al.  Effects of gene expression on molecular evolution in Arabidopsis thaliana and Arabidopsis lyrata. , 2004, Molecular biology and evolution.

[52]  Wen-Hsiung Li,et al.  The rate of synonymous substitution in enterobacterial genes is inversely related to codon usage bias. , 1987, Molecular biology and evolution.

[53]  E. Brown,et al.  Genomic analysis of gene expression in C. elegans. , 2000, Science.

[54]  A. Orth,et al.  Large-scale analysis of the human and mouse transcriptomes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[55]  H. Akashi Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy. , 1994, Genetics.

[56]  J. Kelly,et al.  The Biological and Chemical Basis for Tissue-Selective Amyloid Disease , 2005, Cell.

[57]  S. Ackerman,et al.  Protein accumulation and neurodegeneration in the woozy mutant mouse is caused by disruption of SIL1, a cochaperone of BiP , 2005, Nature Genetics.

[58]  Ziheng Yang,et al.  Computational Molecular Evolution , 2006 .

[59]  Tao Pan,et al.  Tissue-Specific Differences in Human Transfer RNA Expression , 2006, PLoS genetics.

[60]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[61]  Jong Bhak,et al.  Impact of Transcriptional Properties on Essentiality and Evolutionary Rate , 2007, Genetics.

[62]  L. Hurst,et al.  Hearing silence: non-neutral evolution at synonymous sites in mammals , 2006, Nature Reviews Genetics.

[63]  C. Hollenberg,et al.  Characterization of a glucose-repressed pyruvate kinase (Pyk2p) in Saccharomyces cerevisiae that is catalytically insensitive to fructose-1,6-bisphosphate , 1997, Journal of bacteriology.

[64]  Bernardo Lemos,et al.  Evolution of proteins and gene expression levels are coupled in Drosophila and are independently associated with mRNA abundance, protein length, and number of protein-protein interactions. , 2005, Molecular biology and evolution.

[65]  A. Dean,et al.  Mechanistic approaches to the study of evolution: the functional synthesis , 2007, Nature Reviews Genetics.

[66]  Michael R. Green,et al.  Dissecting the Regulatory Circuitry of a Eukaryotic Genome , 1998, Cell.

[67]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[68]  J H Miller,et al.  Genetic studies of the lac repressor. I. Correlation of mutational sites with specific amino acid residues: construction of a colinear gene-protein map. , 1977, Journal of molecular biology.

[69]  Erin Beck,et al.  The comprehensive microbial resource , 2000, Nucleic Acids Res..

[70]  L. Wernisch,et al.  Solving the riddle of codon usage preferences: a test for translational selection. , 2004, Nucleic acids research.

[71]  C. Wilke,et al.  A single determinant dominates the rate of yeast protein evolution. , 2006, Molecular biology and evolution.

[72]  Eduardo P C Rocha,et al.  An analysis of determinants of amino acids substitution rates in bacterial proteins. , 2004, Molecular biology and evolution.

[73]  Melanie A. Huntley,et al.  Evolution of genes and genomes on the Drosophila phylogeny , 2007, Nature.

[74]  B. Charlesworth,et al.  Correlated Evolution of Synonymous and Nonsynonymous Sites in Drosophila , 2004, Journal of Molecular Evolution.

[75]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[76]  F. Arnold,et al.  Protein stability promotes evolvability. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[77]  Pär K Ingvarsson,et al.  Gene expression and protein length influence codon usage and rates of sequence evolution in Populus tremula. , 2006, Molecular biology and evolution.

[78]  C. Dobson,et al.  Protein aggregation and aggregate toxicity: new insights into protein folding, misfolding diseases and biological evolution , 2003, Journal of Molecular Medicine.

[79]  Ronald W. Davis,et al.  Functional profiling of the Saccharomyces cerevisiae genome , 2002, Nature.

[80]  P. Sharp,et al.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. , 1987, Nucleic acids research.

[81]  J. Parker,et al.  Errors and alternatives in reading the universal genetic code. , 1989, Microbiological reviews.

[82]  A. Eyre-Walker,et al.  Synonymous codon usage in Escherichia coli: selection for translational accuracy. , 2006, Molecular biology and evolution.

[83]  T. Ikemura Codon usage and tRNA content in unicellular and multicellular organisms. , 1985, Molecular biology and evolution.

[84]  Juno Choe,et al.  Protein tolerance to random amino acid change. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[85]  D. M. Krylov,et al.  Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution. , 2003, Genome research.

[86]  L. Duret,et al.  Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate. , 2000, Molecular biology and evolution.

[87]  P. Farabaugh,et al.  The frequency of translational misreading errors in E. coli is largely determined by tRNA competition. , 2006, RNA.

[88]  Emile Zuckerkandl,et al.  Evolutionary processes and evolutionary noise at the molecular level , 1976, Journal of Molecular Evolution.

[89]  J. Kourie,et al.  Ion channel formation and membrane‐linked pathologies of misfolded hydrophobic proteins: The role of dangerous unchaperoned molecules , 2002, Clinical and experimental pharmacology & physiology.

[90]  Christine M. Malcom,et al.  A highly unexpected strong correlation between fixation probability of nonsynonymous mutations and mutation rate. , 2005, Trends in genetics : TIG.

[91]  Simone Campanoni Competition , 1866, Nature.