Protein functional features are reflected in the patterns of mRNA translation speed

BackgroundThe degeneracy of the genetic code makes it possible for the same amino acid string to be coded by different messenger RNA (mRNA) sequences. These “synonymous mRNAs” may differ largely in a number of aspects related to their overall translational efficiency, such as secondary structure content and availability of the encoded transfer RNAs (tRNAs). Consequently, they may render different yields of the translated polypeptides. These mRNA features related to translation efficiency are also playing a role locally, resulting in a non-uniform translation speed along the mRNA, which has been previously related to some protein structural features and also used to explain some dramatic effects of “silent” single-nucleotide-polymorphisms (SNPs). In this work we perform the first large scale analysis of the relationship between three experimental proxies of mRNA local translation efficiency and the local features of the corresponding encoded proteins.ResultsWe found that a number of protein functional and structural features are reflected in the patterns of ribosome occupancy, secondary structure and tRNA availability along the mRNA. One or more of these proxies of translation speed have distinctive patterns around the mRNA regions coding for certain protein local features. In some cases the three patterns follow a similar trend. We also show specific examples where these patterns of translation speed point to the protein’s important structural and functional features.ConclusionsThis support the idea that the genome not only codes the protein functional features as sequences of amino acids, but also as subtle patterns of mRNA properties which, probably through local effects on the translation speed, have some consequence on the final polypeptide. These results open the possibility of predicting a protein’s functional regions based on a single genomic sequence, and have implications for heterologous protein expression and fine-tuning protein function.

[1]  T. Ikemura Codon usage and tRNA content in unicellular and multicellular organisms. , 1985, Molecular biology and evolution.

[2]  L Holm,et al.  Codon usage and gene expression. , 1986, Nucleic acids research.

[3]  P Argos,et al.  Protein secondary structural types are differentially coded on messenger RNA , 1996, Protein science : a publication of the Protein Society.

[4]  C. Kurland,et al.  Co-variation of tRNA abundance and codon usage in Escherichia coli at different growth rates. , 1996, Journal of molecular biology.

[5]  P Argos,et al.  Ribosome‐mediated translational pause and protein domain organization , 1996, Protein science : a publication of the Protein Society.

[6]  Etsuko N. Moriyama,et al.  Codon Usage Bias and tRNA Abundance in Drosophila , 1997, Journal of Molecular Evolution.

[7]  E. Korotkov,et al.  Evidence of rare codon clusters within Escherichia coli coding regions. , 1997, FEMS microbiology letters.

[8]  A. Pavesi,et al.  Transfer RNA gene redundancy and translational selection in Saccharomyces cerevisiae. , 1997, Journal of molecular biology.

[9]  H. Akashi,et al.  Translational selection and molecular evolution. , 1998, Current opinion in genetics & development.

[10]  L. Duret,et al.  tRNA gene number and codon usage in the C. elegans genome are co-adapted for optimal translation of highly expressed genes. , 2000, Trends in genetics : TIG.

[11]  Graĭfer Dm,et al.  [Structural-functional topography of human ribosomes based on the data from crosslinking with mRNA analogs--oligoribonucleotide derivatives]. , 2001 .

[12]  I. Beacham,et al.  Whole genome analysis reveals a high incidence of non-optimal codons in secretory signal sequences of Escherichia coli. , 2004, Biochemical and biophysical research communications.

[13]  V. Kolb,et al.  Cotranslational Protein Folding , 2001, Molecular Biology.

[14]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt) , 2005, Nucleic Acids Res..

[15]  Zsuzsanna Dosztányi,et al.  IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content , 2005, Bioinform..

[16]  Lippincott-Schwartz,et al.  Supporting Online Material Materials and Methods Som Text Figs. S1 to S8 Table S1 Movies S1 to S3 a " Silent " Polymorphism in the Mdr1 Gene Changes Substrate Specificity Corrected 30 November 2007; See Last Page , 2022 .

[17]  Patricia L. Clark,et al.  Rare Codons Cluster , 2008, PloS one.

[18]  Kristen K. Dang,et al.  Architecture and Secondary Structure of an Entire HIV-1 RNA Genome , 2009, Nature.

[19]  Nicholas T. Ingolia,et al.  Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling , 2009, Science.

[20]  Zsuzsanna Dosztányi,et al.  ANCHOR: web server for predicting protein binding regions in disordered proteins , 2009, Bioinform..

[21]  The UniProt Consortium,et al.  The Universal Protein Resource (UniProt) 2009 , 2008, Nucleic Acids Res..

[22]  Reinhard Wolf,et al.  Coding-Sequence Determinants of Gene Expression in Escherichia coli , 2009 .

[23]  Charlotte M. Deane,et al.  Synonymous codon usage influences the local protein structure observed , 2010, Nucleic acids research.

[24]  C. Wilke,et al.  Translationally optimal codons associate with aggregation‐prone sites in proteins , 2010, Proteomics.

[25]  Kurt Fredrick,et al.  How the Sequence of a Gene Can Tune Its Translation , 2010, Cell.

[26]  Y. Pilpel,et al.  An Evolutionarily Conserved Mechanism for Controlling the Efficiency of Protein Translation , 2010, Cell.

[27]  Howard Y. Chang,et al.  Genome-wide measurement of RNA secondary structure in yeast , 2010, Nature.

[28]  María Martín,et al.  Ongoing and future developments at the Universal Protein Resource , 2010, Nucleic Acids Res..

[29]  C. Kimchi-Sarfaty,et al.  Understanding the contribution of synonymous mutations to human disease , 2011, Nature Reviews Genetics.

[30]  Dennis B. Troup,et al.  NCBI GEO: archive for functional genomics data sets—10 years on , 2010, Nucleic Acids Res..

[31]  L. Hurst Molecular genetics: The sound of silence , 2011, Nature.

[32]  Annick Harel-Bellan,et al.  A synonymous variant in IRGM alters a binding site for miR-196 and causes deregulation of IRGM-dependent xenophagy in Crohn's disease , 2011, Nature Genetics.

[33]  Gene-Wei Li,et al.  The anti-Shine-Dalgarno sequence drives translational pausing and codon choice in bacteria , 2012, Nature.

[34]  Dmitrij Frishman,et al.  Sequence–structure relationships in yeast mRNAs , 2011, Nucleic acids research.

[35]  Rafael Najmanovich,et al.  Large-scale analysis of conserved rare codon clusters suggests an involvement in co-translational molecular recognition events , 2012, Bioinform..

[36]  Milana Frenkel-Morgenstern,et al.  Genes adopt non-optimal codon usage to generate cell cycle-dependent oscillations in protein levels , 2012, Molecular systems biology.

[37]  Tamir Tuller,et al.  Determinants of Translation Elongation Speed and Ribosomal Profiling Biases in Mouse Embryonic Stem Cells , 2012, PLoS Comput. Biol..

[38]  Matthew S. Sachs,et al.  Non-optimal codon usage affects expression , structure and function of clock protein FRQ , 2013 .

[39]  L. Hurst,et al.  Positively Charged Residues Are the Major Determinants of Ribosomal Velocity , 2013, PLoS biology.

[40]  Christopher J. Marx,et al.  Good Codons, Bad Transcript: Large Reductions in Gene Expression and Fitness Arising from Synonymous Mutations in a Key Enzyme , 2012, Molecular biology and evolution.

[41]  Thomas E. Gorochowski,et al.  Translational sensitivity of the Escherichia coli genome to fluctuating tRNA availability , 2013, Nucleic acids research.

[42]  Justin Gardin,et al.  Measurement of average decoding rates of the 61 sense codons in vivo , 2014, eLife.

[43]  Judith Frydman,et al.  Local slowdown of translation by nonoptimal codons promotes nascent-chain recognition by SRP in vivo , 2014, Nature Structural &Molecular Biology.

[44]  Jian-Rong Yang,et al.  Codon-by-Codon Modulation of Translational Speed and Accuracy Via mRNA Folding , 2014, PLoS biology.