Codon influence on protein expression in E. coli correlates with mRNA levels

Degeneracy in the genetic code, which enables a single protein to be encoded by a multitude of synonymous gene sequences, has an important role in regulating protein expression, but substantial uncertainty exists concerning the details of this phenomenon. Here we analyse the sequence features influencing protein expression levels in 6,348 experiments using bacteriophage T7 polymerase to synthesize messenger RNA in Escherichia coli. Logistic regression yields a new codon-influence metric that correlates only weakly with genomic codon-usage frequency, but strongly with global physiological protein concentrations and also mRNA concentrations and lifetimes in vivo. Overall, the codon content influences protein expression more strongly than mRNA-folding parameters, although the latter dominate in the initial ~16 codons. Genes redesigned based on our analyses are transcribed with unaltered efficiency but translated with higher efficiency in vitro. The less efficiently translated native sequences show greatly reduced mRNA levels in vivo. Our results suggest that codon content modulates a kinetic competition between protein elongation and mRNA degradation that is a central feature of the physiology and also possibly the regulation of translation in E. coli.

[1]  Y. Pilpel,et al.  An Evolutionarily Conserved Mechanism for Controlling the Efficiency of Protein Translation , 2010, Cell.

[2]  D. Frishman,et al.  Protein abundance profiling of the Escherichia coli cytosol , 2008, BMC Genomics.

[3]  M. Dreyfus,et al.  Bacteriophage T7 RNA polymerase travels far ahead of ribosomes in vivo , 1992, Journal of bacteriology.

[4]  Yoshiyuki Kuchino,et al.  Codon and amino-acid specificities of a transfer RNA are both converted by a single post-transcriptional modification , 1988, Nature.

[5]  Claus O. Wilke,et al.  Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution , 2008, Cell.

[6]  M. Ehrenberg,et al.  tmRNA-induced release of messenger RNA from stalled ribosomes. , 2005, Journal of molecular biology.

[7]  N. Blüthgen,et al.  Molecular Systems Biology 9; Article number 675; doi:10.1038/msb.2013.32 Citation: Molecular Systems Biology 9:675 , 2022 .

[8]  Nathan Morris,et al.  Codon Optimality Is a Major Determinant of mRNA Stability , 2015, Cell.

[9]  E. Grayhack,et al.  Translation of CGA codon repeats in yeast involves quality control components and ribosomal protein L1 , 2013, RNA.

[10]  R. Ehrlich,et al.  Synonymous codon selection controls in vivo turnover and amount of mRNA in Escherichia coli bla and ompA genes , 1996, Journal of bacteriology.

[11]  Sriram Kosuri,et al.  Causes and Effects of N-Terminal Codon Bias in Bacterial Genes , 2013, Science.

[12]  Mark Gerstein,et al.  Robotic cloning and Protein Production Platform of the Northeast Structural Genomics Consortium. , 2005, Methods in enzymology.

[13]  Svetlana A. Shabalina,et al.  Differential Arginylation of Actin Isoforms Is Regulated by Coding Sequence–Dependent Degradation , 2010, Science.

[14]  Gaetano T Montelione,et al.  Large-scale experimental studies show unexpected amino acid effects on protein expression and solubility in vivo in E. coli , 2011, Microbial Informatics and Experimentation.

[15]  J. Elf,et al.  Selective Charging of tRNA Isoacceptors Explains Patterns of Codon Usage , 2003, Science.

[16]  Huiyi Chen,et al.  Genome-wide study of mRNA degradation and transcript elongation in Escherichia coli , 2015, Molecular systems biology.

[17]  S. Liebhaber,et al.  Influence of duplexes 3' to the mRNA initiation codon on the efficiency of monosome formation. , 1988, Biochemistry.

[18]  M. Springer,et al.  The relationship between translational control and mRNA degradation for the Escherichia coli threonyl-tRNA synthetase gene. , 2001, Journal of molecular biology.

[19]  Gaetano T Montelione,et al.  The high-throughput protein sample production platform of the Northeast Structural Genomics Consortium. , 2010, Journal of structural biology.

[20]  Reinhard Wolf,et al.  Coding-Sequence Determinants of Gene Expression in Escherichia coli , 2009 .

[21]  Gaohua Liu,et al.  Preparation of protein samples for NMR structure, function, and small-molecule screening studies. , 2011, Methods in enzymology.

[22]  Eytan Ruppin,et al.  Translation efficiency is determined by both codon bias and folding energy , 2010, Proceedings of the National Academy of Sciences.

[23]  P. Sharp,et al.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. , 1987, Nucleic acids research.

[24]  Chava Kimchi-Sarfaty,et al.  Exposing synonymous mutations. , 2014, Trends in genetics : TIG.

[25]  H V Westerhoff,et al.  The use of lac-type promoters in control analysis. , 1993, European journal of biochemistry.

[26]  David H Burkhardt,et al.  Quantifying Absolute Protein Synthesis Rates Reveals Principles Underlying Allocation of Cellular Resources , 2014, Cell.

[27]  A. Kaji,et al.  Protein synthesis factors (RF1, RF2, RF3, RRF, and tmRNA) and peptidyl-tRNA hydrolase rescue stalled ribosomes at sense codons. , 2012, Journal of molecular biology.

[28]  Jamie Richards,et al.  Quality control of bacterial mRNA decoding and decay. , 2008, Biochimica et biophysica acta.

[29]  A. Novick,et al.  ENZYME INDUCTION AS AN ALL-OR-NONE PHENOMENON. , 1957, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Nicol N. Schraudolph,et al.  A Role for Codon Order in Translation Dynamics , 2010, Cell.

[31]  Y. Pilpel,et al.  Determinants of translation efficiency and accuracy , 2011, Molecular systems biology.

[32]  M Nirenberg,et al.  RNA codons and protein synthesis. 15. Dissimilar responses of mammalian and bacterial transfer RNA fractions to messenger RNA codons. , 1968, Journal of molecular biology.

[33]  M. Inouye,et al.  Role of the AGA/AGG codons, the rarest codons in global gene expression in Escherichia coli. , 1994, Genes & development.

[34]  Tamir Tuller,et al.  The effect of tRNA levels on decoding times of mRNA codons , 2014, Nucleic acids research.

[35]  M. Dreyfus,et al.  The stability of Escherichia coli lacZ mRNA depends upon the simultaneity of its synthesis and translation. , 1995, The EMBO journal.

[36]  Edoardo M Airoldi,et al.  Estimating selection on synonymous codon usage from noisy experimental data. , 2013, Molecular biology and evolution.

[37]  T. Ikemura Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli translational system. , 1981, Journal of molecular biology.

[38]  G. Montelione,et al.  High-level production of uniformly 15N-and 13C-enriched fusion proteins in Escherichia coli , 1996 .

[39]  A. Krogh,et al.  Prediction of lipoprotein signal peptides in Gram‐negative bacteria , 2003, Protein science : a publication of the Protein Society.

[40]  J. J. Olivares-Trejo,et al.  Adenine-containing codons enhance protein synthesis by promoting mRNA binding to ribosomal 30S subunits provided that specific tRNAs are not exhausted. , 2012, Biochimie.

[41]  Lorenz Wernisch,et al.  Unexpected correlations between gene expression and codon usage bias from microarray data for the whole Escherichia coli K-12 genome. , 2003, Nucleic Acids Research.

[42]  Rotem Sorek,et al.  Differential translation tunes uneven production of operon-encoded proteins. , 2013, Cell reports.

[43]  E. Goldman,et al.  Low-usage codons in Escherichia coli, yeast, fruit fly and primates. , 1991, Gene.

[44]  Kirsten Jung,et al.  Translation Elongation Factor EF-P Alleviates Ribosome Stalling at Polyproline Stretches , 2013, Science.

[45]  H. Akaike A new look at the statistical model identification , 1974 .

[46]  J. Elf,et al.  Selective charging of tRNA isoacceptors induced by amino‐acid starvation , 2005, EMBO reports.

[47]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[48]  C. Kurland,et al.  Co-variation of tRNA abundance and codon usage in Escherichia coli at different growth rates. , 1996, Journal of molecular biology.

[49]  P. Spencer,et al.  Silent substitutions predictably alter translation elongation rates and protein folding efficiencies. , 2012, Journal of molecular biology.

[50]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[51]  R. Green,et al.  Dom34:Hbs1 Promotes Subunit Dissociation and Peptidyl-tRNA Drop-Off to Initiate No-Go Decay , 2010, Science.

[52]  Vicent Pelechano,et al.  Widespread Co-translational RNA Decay Reveals Ribosome Dynamics , 2015, Cell.

[53]  M. Bulmer The selection-mutation-drift theory of synonymous codon usage. , 1991, Genetics.

[54]  D. Belin,et al.  Tight regulation, modulation, and high-level expression by vectors containing the arabinose PBAD promoter , 1995, Journal of bacteriology.

[55]  Jacques Ninio Fine tuning of ribosomal accuracy , 1986, FEBS letters.

[56]  David H. Mathews,et al.  RNAstructure: software for RNA secondary structure prediction and analysis , 2010, BMC Bioinformatics.

[57]  T. Abo,et al.  Escherichia coli YaeJ protein mediates a novel ribosome‐rescue pathway distinct from SsrA‐ and ArfA‐mediated pathways , 2011, Molecular microbiology.

[58]  Peter D. Karp,et al.  EcoCyc: fusing model organism databases with systems biology , 2012, Nucleic Acids Res..

[59]  Jianli Lu,et al.  Electrostatics in the ribosomal tunnel modulate chain elongation rates. , 2008, Journal of molecular biology.

[60]  R. Micura,et al.  Escherichia coli Ribosomal Protein S1 Unfolds Structured mRNAs Onto the Ribosome for Active Translation Initiation , 2013, PLoS biology.

[61]  Gene-Wei Li,et al.  The anti-Shine-Dalgarno sequence drives translational pausing and codon choice in bacteria , 2012, Nature.