Ribosome profiling reveals pervasive translation outside of annotated protein-coding genes.

Ribosome profiling suggests that ribosomes occupy many regions of the transcriptome thought to be noncoding, including 5' UTRs and long noncoding RNAs (lncRNAs). Apparent ribosome footprints outside of protein-coding regions raise the possibility of artifacts unrelated to translation, particularly when they occupy multiple, overlapping open reading frames (ORFs). Here, we show hallmarks of translation in these footprints: copurification with the large ribosomal subunit, response to drugs targeting elongation, trinucleotide periodicity, and initiation at early AUGs. We develop a metric for distinguishing between 80S footprints and nonribosomal sources using footprint size distributions, which validates the vast majority of footprints outside of coding regions. We present evidence for polypeptide production beyond annotated genes, including the induction of immune responses following human cytomegalovirus (HCMV) infection. Translation is pervasive on cytosolic transcripts outside of conserved reading frames, and direct detection of this expanded universe of translated products enables efforts at understanding how cells manage and exploit its consequences.

[1]  J. Lykke-Andersen,et al.  Execution of nonsense-mediated mRNA decay: what defines a substrate? , 2009, Current opinion in cell biology.

[2]  S. Salzberg,et al.  The Transcriptional Landscape of the Mammalian Genome , 2005, Science.

[3]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[4]  Nicholas T. Ingolia,et al.  Ribosome Profiling Provides Evidence that Large Noncoding RNAs Do Not Encode Proteins , 2013, Cell.

[5]  Phillip A Sharp,et al.  A triple helix stabilizes the 3' ends of long noncoding RNAs that lack poly(A) tails. , 2012, Genes & development.

[6]  Nicholas T. Ingolia,et al.  Ribosome Profiling of Mouse Embryonic Stem Cells Reveals the Complexity and Dynamics of Mammalian Proteomes , 2011, Cell.

[7]  Cole Trapnell,et al.  Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. , 2011, Genes & development.

[8]  Nicholas T. Ingolia,et al.  Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling , 2009, Science.

[9]  A. Hinnebusch,et al.  Regulation of Translation Initiation in Eukaryotes: Mechanisms and Biological Targets , 2009, Cell.

[10]  Josephine A. Reinhardt,et al.  De Novo ORFs in Drosophila Are Important to Organismal Fitness and Evolved Rapidly from Previously Non-coding Sequences , 2013, PLoS genetics.

[11]  J. Pérez-Ortín,et al.  Eukaryotic mRNA decay: methodologies, pathways, and links to other stages of gene expression. , 2013, Journal of molecular biology.

[12]  D. Vignali,et al.  Design and construction of 2A peptide-linked multicistronic vectors. , 2012, Cold Spring Harbor protocols.

[13]  John W. Tukey,et al.  Exploratory Data Analysis. , 1979 .

[14]  J. Steitz,et al.  Classification of gas5 as a Multi-Small-Nucleolar-RNA (snoRNA) Host Gene and a Member of the 5′-Terminal Oligopyrimidine Gene Family Reveals Common Features of snoRNA Host Genes , 1998, Molecular and Cellular Biology.

[15]  Y. Tor,et al.  Antibiotics that target protein synthesis , 2011, Wiley interdisciplinary reviews. RNA.

[16]  J. Rinn,et al.  Ribosome profiling reveals resemblance between long non-coding RNAs and 5′ leaders of coding RNAs , 2013, Development.

[17]  Frances M. G. Pearl,et al.  Conserved Regulation of Cardiac Calcium Uptake by Peptides Encoded in Small Open Reading Frames , 2013, Science.

[18]  P. Brown,et al.  Distinct stages of the translation elongation cycle revealed by sequencing ribosome-protected mRNA fragments , 2014, eLife.

[19]  Claire Halpin,et al.  E unum pluribus: multiple proteins from a self-processing polyprotein. , 2006, Trends in biotechnology.

[20]  A. Otsuka,et al.  DNA-binding and enzymatic domains of the bifunctional biotin operon repressor (BirA) of Escherichia coli. , 1986, Gene.

[21]  Miguel A. Andrade-Navarro,et al.  uORFdb—a comprehensive literature database on eukaryotic uORF biology , 2013, Nucleic Acids Res..

[22]  Cole Trapnell,et al.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions , 2013, Genome Biology.

[23]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[24]  Michael F. Lin,et al.  Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals , 2009, Nature.

[25]  P. Philippsen,et al.  Additional modules for versatile and economical PCR‐based gene deletion and modification in Saccharomyces cerevisiae , 1998, Yeast.

[26]  Louis J. Picker,et al.  Broadly targeted human cytomegalovirus-specific CD4+ and CD8+ T cells dominate the memory compartments of exposed subjects , 2005, The Journal of experimental medicine.

[27]  S Kobayashi,et al.  Small Peptides Switch the Transcriptional Activity of Shavenbaby During Drosophila Embryogenesis , 2010, Science.

[28]  Nicholas T. Ingolia,et al.  High-Resolution View of the Yeast Meiotic Program Revealed by Ribosome Profiling , 2011, Science.

[29]  J. Yewdell,et al.  DRiPs solidify: progress in understanding endogenous MHC class I antigen processing. , 2011, Trends in immunology.

[30]  V. Mootha,et al.  Upstream open reading frames cause widespread reduction of protein expression and are polymorphic among humans , 2009, Proceedings of the National Academy of Sciences.

[31]  David L. Spector,et al.  3′ End Processing of a Long Nuclear-Retained Noncoding RNA Yields a tRNA-like Cytoplasmic RNA , 2008, Cell.

[32]  Thomas E. Royce,et al.  Global Identification of Human Transcribed Sequences with Genome Tiling Arrays , 2004, Science.

[33]  D. Bartel,et al.  lincRNAs: Genomics, Evolution, and Mechanisms , 2013, Cell.

[34]  A. Hinnebusch,et al.  In vivo stabilization of preinitiation complexes by formaldehyde cross-linking. , 2007, Methods in enzymology.

[35]  T. Iwawaki,et al.  Cotranslational targeting of XBP1 protein to the membrane promotes cytoplasmic splicing of its own mRNA. , 2009, Molecular cell.

[36]  Jiao Ma,et al.  Toddler: An Embryonic Signal That Promotes Cell Movement via Apelin Receptors , 2014, Science.

[37]  N. Shastri,et al.  Leucine-tRNA Initiates at CUG Start Codons for Protein Synthesis and Presentation by MHC Class I , 2012, Science.

[38]  Jon R Lorsch,et al.  A mechanistic overview of translation initiation in eukaryotes , 2012, Nature Structural &Molecular Biology.

[39]  P. Walter,et al.  Ribosome pausing and stacking during translation of a eukaryotic mRNA. , 1988, The EMBO journal.

[40]  D Botstein,et al.  Many random sequences functionally replace the secretion signal sequence of yeast invertase. , 1987, Science.

[41]  George Coukos,et al.  Cancer immunotherapy comes of age , 2011, Nature.

[42]  Anna M. McGeachy,et al.  The ribosome profiling strategy for monitoring translation in vivo by deep sequencing of ribosome-protected mRNA fragments , 2012, Nature Protocols.

[43]  P. Greengard,et al.  A Translational Profiling Approach for the Molecular Characterization of CNS Cell Types , 2008, Cell.

[44]  Tilman Schneider-Poetsch,et al.  Inhibition of Eukaryotic Translation Elongation by Cycloheximide and Lactimidomycin , 2010, Nature chemical biology.

[45]  Manolis Kellis,et al.  PhyloCSF: a comparative genomics method to distinguish protein coding and non-coding regions , 2011, Bioinform..

[46]  Marco Y. Hein,et al.  Decoding Human Cytomegalovirus , 2012, Science.

[47]  Manolis Kellis,et al.  Performance and Scalability of Discriminative Metrics for Comparative Gene Identification in 12 Drosophila Genomes , 2008, PLoS Comput. Biol..

[48]  D. Beckett,et al.  A minimal peptide substrate in biotin holoenzyme synthetase‐catalyzed biotinylation , 2008, Protein science : a publication of the Protein Society.

[49]  Patrick Rodriguez,et al.  Efficient biotinylation and single-step purification of tagged transcription factors in mammalian cells and transgenic mice , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[50]  J. Steitz Polypeptide Chain Initiation: Nucleotide Sequences of the Three Ribosomal Binding Sites in Bacteriophage R17 RNA , 1969, Nature.

[51]  César A. Hidalgo,et al.  Proto-genes and de novo gene birth , 2012, Nature.

[52]  Nicholas T Ingolia,et al.  Genome-wide translational profiling by ribosome footprinting. , 2010, Methods in enzymology.

[53]  Rachel Green,et al.  Dom34 Rescues Ribosomes in 3′ Untranslated Regions , 2014, Cell.

[54]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.