uORF-Tools—Workflow for the determination of translation-regulatory upstream open reading frames

Ribosome profiling (ribo-seq) provides a means to analyze active translation by determining ribosome occupancy in a transcriptome-wide manner. The vast majority of ribosome protected fragments (RPFs) resides within the protein-coding sequence of mRNAs. However, commonly reads are also found within the transcript leader sequence (TLS) (aka 5’ untranslated region) preceding the main open reading frame (ORF), indicating the translation of regulatory upstream ORFs (uORFs). Here, we present a workflow for the identification of translation-regulatory uORFs. Specifically, uORF-Tools identifies uORFs within a given dataset and generates a uORF annotation file. In addition, a comprehensive human uORF annotation file, based on 35 ribo-seq files, is provided, which can serve as an alternative input file for the workflow. To assess the translation-regulatory activity of the uORFs, stimulus-induced changes in the ratio of the RPFs residing in the main ORFs relative to those found in the associated uORFs are determined. The resulting output file allows for the easy identification of candidate uORFs, which have translation-inhibitory effects on their associated main ORFs. uORF-Tools is available as a free and open Snakemake workflow at https://github.com/Biochemistry1-FFM/uORF-Tools. It is easily installed and all necessary tools are provided in a version-controlled manner, which also ensures lasting usability. uORF-Tools is designed for intuitive use and requires only limited computing times and resources.

[1]  Jun Hee Lee,et al.  TED-Seq Identifies the Dynamics of Poly(A) Length during ER Stress. , 2018, Cell reports.

[2]  Fabio Lauria,et al.  Active Ribosome Profiling with RiboLace. , 2018, Cell reports.

[3]  E. Jan,et al.  An Upstream Open Reading Frame Regulates Translation of GADD34 during Cellular Stresses That Induce eIF2α Phosphorylation* , 2009, Journal of Biological Chemistry.

[4]  Patrick B. F. O'Connor,et al.  Translation of 5′ leaders is pervasive in genes resistant to eIF2 repression , 2015, eLife.

[5]  D. Sabatini,et al.  A unifying model for mTORC1-mediated regulation of mRNA translation , 2012, Nature.

[6]  Binbin Shi,et al.  Ribosome elongating footprints denoised by wavelet transform comprehensively characterize dynamic cellular translation events , 2018, Nucleic acids research.

[7]  Sven Rahmann,et al.  Genome analysis , 2022 .

[8]  Tao Liu,et al.  Genome-wide identification and differential analysis of translational initiation , 2017, Nature Communications.

[9]  Beth Walters,et al.  Cap-Independent Translational Control of Carcinogenesis , 2016, Front. Oncol..

[10]  Sven Rahmann,et al.  Snakemake--a scalable bioinformatics workflow engine. , 2012, Bioinformatics.

[11]  Mila Ljujic,et al.  The integrated stress response , 2016, EMBO reports.

[12]  Jon R Lorsch,et al.  The mechanism of eukaryotic translation initiation: new insights and challenges. , 2012, Cold Spring Harbor perspectives in biology.

[13]  Gerben Menschaert,et al.  An update on sORFs.org: a repository of small ORFs identified by ribosome profiling , 2017, Nucleic Acids Res..

[14]  Zhe Ji,et al.  RibORF: Identifying Genome‐Wide Translated Open Reading Frames Using Ribosome Profiling , 2018, Current protocols in molecular biology.

[15]  V. Mootha,et al.  Upstream open reading frames cause widespread reduction of protein expression and are polymorphic among humans , 2009, Proceedings of the National Academy of Sciences.

[16]  Klaus Wethmar,et al.  The regulatory potential of upstream open reading frames in eukaryotic gene expression , 2014, Wiley interdisciplinary reviews. RNA.

[17]  Anne E Willis,et al.  A perspective on mammalian upstream open reading frame function , 2013, The International Journal of Biochemistry & Cell Biology.

[18]  Xuerui Yang,et al.  Genome-wide assessment of differential translations with ribosome profiling data , 2016, Nature Communications.

[19]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[20]  Masato Miyake,et al.  Integrated stress response of vertebrates is regulated by four eIF2α kinases , 2016, Scientific Reports.

[21]  Russell L. Ault,et al.  A comprehensive catalog of predicted functional upstream open reading frames in humans , 2018, Nucleic acids research.

[22]  Nikolaus Rajewsky,et al.  Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation , 2014, The EMBO journal.

[23]  Xuerui Yang,et al.  De novo annotation and characterization of the translatome with ribosome profiling data , 2017, bioRxiv.

[24]  R. Wek,et al.  Upstream Open Reading Frames Differentially Regulate Gene-specific Translation in the Integrated Stress Response* , 2016, The Journal of Biological Chemistry.

[25]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[26]  Nicholas T. Ingolia,et al.  Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling , 2009, Science.

[27]  Rafaela Lacerda,et al.  More than just scanning: the importance of cap-independent mRNA translation initiation for cellular stress response and cancer , 2016, Cellular and Molecular Life Sciences.

[28]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[29]  Hélène Touzet,et al.  SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data , 2012, Bioinform..

[30]  Renan Valieris,et al.  Bioconda: sustainable and comprehensive software distribution for the life sciences , 2018, Nature Methods.

[31]  Ralf Zimmer,et al.  Improved Ribo-seq enables identification of cryptic translation events , 2018, Nature Methods.

[32]  Uwe Ohler,et al.  Detecting actively translated open reading frames in ribosome profiling data , 2015, Nature Methods.

[33]  A. Hinnebusch,et al.  Translational control by 5′-untranslated regions of eukaryotic mRNAs , 2016, Science.