Bayesian prediction of RNA translation from ribosome profiling

Abstract Ribosome profiling via high-throughput sequencing (ribo-seq) is a promising new technique for characterizing the occupancy of ribosomes on messenger RNA (mRNA) at base-pair resolution. The ribosome is responsible for translating mRNA into proteins, so information about its occupancy offers a detailed view of ribosome density and position which could be used to discover new translated open reading frames (ORFs), among other things. In this work, we propose Rp-Bp, an unsupervised Bayesian approach to predict translated ORFs from ribosome profiles. We use state-of-the-art Markov chain Monte Carlo techniques to estimate posterior distributions of the likelihood of translation of each ORF. Hence, an important feature of Rp-Bp is its ability to incorporate and propagate uncertainty in the prediction process. A second novel contribution is automatic Bayesian selection of read lengths and ribosome P-site offsets (BPPS). We empirically demonstrate that our read length selection technique modestly improves sensitivity by identifying more canonical and non-canonical ORFs. Proteomics- and quantitative translation initiation sequencing-based validation verifies the high quality of all of the predictions. Experimental comparison shows that Rp-Bp results in more peptide identifications and proteomics-validated ORF predictions compared to another recent tool for translation prediction.

[1]  P. Baranov,et al.  Catch me if you can: trapping scanning ribosomes in their footsteps , 2016, Nature Structural &Molecular Biology.

[2]  Pascal Barbry,et al.  RiboProfiling: a Bioconductor package for standard Ribo-seq pipeline processing , 2016, F1000Research.

[3]  Tamir Tuller,et al.  Estimation of ribosome profiling performance and reproducibility at various levels of resolution , 2016, Biology Direct.

[4]  C. Desplan,et al.  Small peptides control heart activity , 2016, Science.

[5]  Uwe Ohler,et al.  Detecting actively translated open reading frames in ribosome profiling data , 2015, Nature Methods.

[6]  Aviv Regev,et al.  A Regression-Based Analysis of Ribosome-Profiling Data Reveals a Conserved Complexity to Mammalian Translation. , 2015, Molecular cell.

[7]  Yang I. Li,et al.  Thousands of novel translated open reading frames in humans inferred by ribosome footprint profiling , 2015, bioRxiv.

[8]  Jeffrey A. Hussmann,et al.  Understanding Biases in Ribosome Profiling Experiments Reveals Signatures of Translation Dynamics in Yeast , 2015, bioRxiv.

[9]  C. Dieterich,et al.  Transcriptome-wide measurement of ribosomal occupancy by ribosome profiling. , 2015, Methods.

[10]  Marcel J. T. Reinders,et al.  Unbiased Quantitative Models of Protein Translation Derived from Ribosome Profiling Data , 2015, PLoS Comput. Biol..

[11]  Edward L. Huttlin,et al.  Erratum: A mass-tolerant database search identifies a large proportion of unassigned spectra in shotgun proteomics as modified peptides , 2015, Nature Biotechnology.

[12]  Gunnar Rätsch,et al.  RiboDiff: detecting changes of mRNA translation efficiency from ribosome footprints , 2015, bioRxiv.

[13]  W. Van Criekinge,et al.  PROTEOFORMER: deep proteome coverage through ribosome profiling and MS integration , 2014, Nucleic acids research.

[14]  Shu-Bing Qian,et al.  Quantitative profiling of initiating ribosomes in vivo , 2014, Nature Methods.

[15]  Tamir Tuller,et al.  The effect of tRNA levels on decoding times of mRNA codons , 2014, Nucleic acids research.

[16]  Nikolaus Rajewsky,et al.  Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation , 2014, The EMBO journal.

[17]  C. Kosiol,et al.  Gaussian process test for high-throughput sequencing time series: application to experimental evolution , 2014, Bioinform..

[18]  Nicholas T. Ingolia Ribosome profiling: new views of translation, from single codons to genome scale , 2014, Nature Reviews Genetics.

[19]  Richard A. Olshen,et al.  Assessing gene-level translational control from ribosome profiling , 2013, Bioinform..

[20]  Nicholas T. Ingolia,et al.  Ribosome Profiling Provides Evidence that Large Noncoding RNAs Do Not Encode Proteins , 2013, Cell.

[21]  K. Gevaert,et al.  Deep Proteome Coverage Based on Ribosome Profiling Aids Mass Spectrometry-based Protein and Peptide Discovery and Provides Evidence of Alternative Translation Products and Near-cognate Translation Initiation Events* , 2013, Molecular & Cellular Proteomics.

[22]  Andrew Gelman,et al.  The No-U-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo , 2011, J. Mach. Learn. Res..

[23]  Neil D. Lawrence,et al.  A Simple Approach to Ranking Differentially Expressed Gene Expression Time Courses through Gaussian Process Regression , 2011, BMC Bioinformatics.

[24]  Nicholas T. Ingolia,et al.  Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling , 2009, Science.

[25]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[26]  Andrew Gelman,et al.  General methods for monitoring convergence of iterative simulations , 1998 .

[27]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[28]  Christian Sommer,et al.  IPG strip-based peptide fractionation for shotgun proteomics. , 2014, Methods in molecular biology.

[29]  A. Raftery,et al.  Bayes factors , 1995 .

[30]  Baris E. Suzek,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm098 Databases and ontologies UniRef: comprehensive and non-redundant UniProt reference , 2022 .