Codon usage influences fitness through RNA toxicity

Significance Synonymous mutations in genes do not change protein sequence, but they may affect gene expression and cellular function. Here we describe an unexpected toxic effect of synonymous mutations in Escherichia coli, with potentially large implications for bacterial physiology and evolution. Unlike previously studied effects of synonymous mutations, the effect that we discovered is independent of translation, but it depends on the production of toxic mRNA molecules. We hypothesize that the mechanism we identified influences the evolution of endogenous genes in bacteria by imposing selective constraints on synonymous mutations that arise in the genome. Of interest for biotechnology and synthetic biology, we identify bacterial strains and growth conditions that alleviate RNA toxicity, thus allowing efficient overexpression of heterologous proteins. Many organisms are subject to selective pressure that gives rise to unequal usage of synonymous codons, known as codon bias. To experimentally dissect the mechanisms of selection on synonymous sites, we expressed several hundred synonymous variants of the GFP gene in Escherichia coli, and used quantitative growth and viability assays to estimate bacterial fitness. Unexpectedly, we found many synonymous variants whose expression was toxic to E. coli. Unlike previously studied effects of synonymous mutations, the effect that we discovered is independent of translation, but it depends on the production of toxic mRNA molecules. We identified RNA sequence determinants of toxicity and evolved suppressor strains that can tolerate the expression of toxic GFP variants. Genome sequencing of these suppressor strains revealed a cluster of promoter mutations that prevented toxicity by reducing mRNA levels. We conclude that translation-independent RNA toxicity is a previously unrecognized obstacle in bacterial gene expression.

[1]  Sriram Kosuri,et al.  Causes and Effects of N-Terminal Codon Bias in Bacterial Genes , 2013, Science.

[2]  J. Plotkin,et al.  Synonymous but not the same: the causes and consequences of codon bias , 2011, Nature Reviews Genetics.

[3]  Nathan Morris,et al.  Codon Optimality Is a Major Determinant of mRNA Stability , 2015, Cell.

[4]  Gerald Striedner,et al.  Preventing T7 RNA polymerase read-through transcription-A synthetic termination signal capable of improving bioprocess stability. , 2015, ACS synthetic biology.

[5]  Christopher A. Voigt,et al.  Automated Design of Synthetic Ribosome Binding Sites to Precisely Control Protein Expression , 2009, Nature Biotechnology.

[6]  Michael Zuker,et al.  UNAFold: software for nucleic acid folding and hybridization. , 2008, Methods in molecular biology.

[7]  Claus O. Wilke,et al.  Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution , 2008, Cell.

[8]  Mian Zhou,et al.  Codon usage is an important determinant of gene expression levels largely through its effects on transcription , 2016, Proceedings of the National Academy of Sciences.

[9]  H. Akashi Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy. , 1994, Genetics.

[10]  M. Raponi,et al.  Synonymous mutations in CFTR exon 12 affect splicing and are not neutral in evolution. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[12]  Eric D. Kelsic,et al.  RNA Structural Determinants of Optimal Codons Revealed by MAGE-Seq. , 2016, Cell systems.

[13]  H. Ochman,et al.  A selective force favoring increased G+C content in bacterial genes , 2012, Proceedings of the National Academy of Sciences.

[14]  K. Sobczak,et al.  Triplet repeat RNA structure and its role as pathogenic agent and therapeutic target , 2011, Nucleic acids research.

[15]  A. Greener,et al.  Site-directed mutagenesis using double-stranded plasmid DNA templates. , 1996, Methods in molecular biology.

[16]  D. Hughes,et al.  The Selective Advantage of Synonymous Codon Usage Bias in Salmonella , 2016, PLoS genetics.

[17]  Gaetano T. Montelione,et al.  Codon influence on protein expression in E. coli correlates with mRNA levels , 2016, Nature.

[18]  I. Pastan,et al.  Random recombination of antibody single chain Fv sequences after fragmentation with DNaseI in the presence of Mn2+. , 1995, Nucleic acids research.

[19]  Sasha F. Levy,et al.  Gene Architectures that Minimize Cost of Gene Expression. , 2017, Molecular cell.

[20]  C. Marx,et al.  Large-Effect Beneficial Synonymous Mutations Mediate Rapid and Parallel Adaptation in a Bacterium , 2016, Molecular biology and evolution.

[21]  P. Sharp,et al.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. , 1987, Nucleic acids research.

[22]  J. Walker,et al.  Over-production of proteins in Escherichia coli: mutant hosts that allow synthesis of some membrane proteins and globular proteins at high levels. , 1996, Journal of molecular biology.

[23]  Diethard Tautz,et al.  Random sequences are an abundant source of bioactive RNAs or peptides , 2017, Nature Ecology &Evolution.

[24]  Reinhard Wolf,et al.  Coding-Sequence Determinants of Gene Expression in Escherichia coli , 2009 .

[25]  Joao C. Guimaraes,et al.  Massive Factorial Design Untangles Coding Sequences Determinants of Translation Efficacy , 2017, bioRxiv.

[26]  R. Vale,et al.  RNA Phase Transitions in Repeat Expansion Disorders , 2017, Nature.

[27]  Joakim Näsvall,et al.  Compensating the Fitness Costs of Synonymous Mutations , 2016, Molecular biology and evolution.

[28]  O. Kuipers,et al.  Live Cell Imaging of Bacillus subtilis and Streptococcus pneumoniae using Automated Time-lapse Microscopy , 2011, Journal of visualized experiments : JoVE.

[29]  Christopher A. Voigt,et al.  Automated design of synthetic ribosome binding sites to control protein expression , 2016 .

[30]  C. Kurland,et al.  Codon usage determines translation rate in Escherichia coli. , 1989, Journal of molecular biology.

[31]  L. Hurst,et al.  Hearing silence: non-neutral evolution at synonymous sites in mammals , 2006, Nature Reviews Genetics.

[32]  Tong Zhou,et al.  A Universal Trend of Reduced mRNA Stability near the Translation-Initiation Site in Prokaryotes and Eukaryotes , 2010, PLoS Comput. Biol..

[33]  Dae-Hee Lee,et al.  Comparative genomics and experimental evolution of Escherichia coli BL21(DE3) strains reveal the landscape of toxicity escape from membrane protein overproduction , 2015, Scientific Reports.

[34]  W. Stemmer DNA shuffling by random fragmentation and reassembly: in vitro recombination for molecular evolution. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[35]  P. Genevaux,et al.  De-convoluting the Genetic Adaptations of E. coli C41(DE3) in Real Time Reveals How Alleviating Protein Production Stress Improves Yields. , 2015, Cell reports.

[36]  C. Kurland,et al.  Codon preferences in free-living microorganisms. , 1990, Microbiological reviews.

[37]  David Tollervey,et al.  Coding-Sequence Determinants of Gene Expression in Escherichia coli , 2009, Science.