choros: correction of sequence-based biases for accurate quantification of ribosome profiling data

Ribosome profiling quantifies translation genome-wide by sequencing ribosome-protected fragments, or footprints. Its single-codon resolution allows identification of translation regulation, such as ribosome stalls or pauses, on individual genes. However, enzyme preferences during library preparation lead to pervasive sequence artifacts that obscure translation dynamics. Widespread over- and under-representation of ribosome footprints can dominate local footprint densities and skew estimates of elongation rates by up to five fold. To address these biases and uncover true patterns of translation, we present choros, a computational method that models ribosome footprint distributions to provide bias-corrected footprint counts. choros uses negative binomial regression to accurately estimate two sets of parameters: (i) biological contributions from codon-specific translation elongation rates; and (ii) technical contributions from nuclease digestion and ligation efficiencies. We use these parameter estimates to generate bias correction factors that eliminate sequence artifacts. Applying choros to multiple ribosome profiling datasets, we are able to accurately quantify and attenuate ligation biases to provide more faithful measurements of ribosome distribution. We show that a pattern interpreted as pervasive ribosome pausing near the beginning of coding regions is likely to arise from technical biases. Incorporating choros into standard analysis pipelines will improve biological discovery from measurements of translation.

[1]  Nicholas T. Ingolia,et al.  Streamlined and sensitive mono- and di-ribosome profiling in yeast and human cells , 2023, bioRxiv.

[2]  Nicholas T. Ingolia,et al.  Ribosome stalling during selenoprotein translation exposes a ferroptosis vulnerability in cancer , 2022, bioRxiv.

[3]  S. Tavazoie,et al.  Leucyl-tRNA synthetase is a tumour suppressor in breast cancer and regulates codon-dependent translation dynamics , 2022, Nature Cell Biology.

[4]  Wenfeng Qian,et al.  Disome-seq reveals widespread ribosome collisions that promote cotranslational protein folding , 2021, Genome biology.

[5]  J. Coller,et al.  Quantitative tRNA-sequencing uncovers metazoan tissue-specific tRNA regulation , 2020, Nature Communications.

[6]  M. Waldor,et al.  Comparative tRNA sequencing and RNA mass spectrometry for surveying tRNA modifications , 2020, Nature Chemical Biology.

[7]  N. Guydosh,et al.  Disome and Trisome Profiling Reveal Genome-wide Targets of Ribosome Quality Control. , 2020, Molecular cell.

[8]  J. Szavits-Nossan,et al.  Inferring efficiency of translation initiation and elongation from ribosome profiling , 2019, bioRxiv.

[9]  D. Gatfield,et al.  Transcriptome-wide sites of collided ribosomes reveal principles of translational pausing , 2019, bioRxiv.

[10]  Yuichiro Mishima,et al.  Genome-wide survey of ribosome collision , 2019, bioRxiv.

[11]  D. Sulzer,et al.  Widespread Alterations in Translation Elongation in the Brain of Juvenile Fmr1 Knockout Mice. , 2019, Cell reports.

[12]  R. Green,et al.  High-Resolution Ribosome Profiling Defines Discrete Ribosome Elongation States and Translational Regulation during Cellular Stress. , 2019, Molecular cell.

[13]  E. O’Shea,et al.  Translational Control through Differential Ribosome Pausing during Amino Acid Limitation in Mammalian Cells. , 2018, Molecular cell.

[14]  Dan D. Erdmann-Pham,et al.  The key parameters that govern translation efficiency , 2018, bioRxiv.

[15]  M. Garber,et al.  Transcriptome-wide Analysis of Roles for tRNA Modifications in Translational Regulation. , 2017, Molecular cell.

[16]  Nicholas J. McGlincy,et al.  Accurate design of translational output by a neural network model of ribosome distribution , 2017, Nature Structural & Molecular Biology.

[17]  Jianyang Zeng,et al.  Analysis of Ribosome Stalling and Translation Elongation Dynamics by Deep Learning. , 2017, Cell systems.

[18]  Nicholas T Ingolia,et al.  Transcriptome-wide measurement of translation by ribosome profiling. , 2017, Methods.

[19]  R. Green,et al.  eIF5A Functions Globally in Translation Elongation and Termination. , 2017, Molecular cell.

[20]  Felix Naef,et al.  Ribosome profiling and dynamic regulation of translation in mammals. , 2017, Current opinion in genetics & development.

[21]  Patrick B. F. O'Connor,et al.  Insights into the mechanisms of eukaryotic translation gained with ribosome profiling , 2016, Nucleic acids research.

[22]  Juan M. Vaquerizas,et al.  Dual randomization of oligonucleotides to reduce the bias in ribosome-profiling libraries , 2016, Methods.

[23]  P. Canoll,et al.  Ligation-free ribosome profiling of cell type-specific translation in the brain , 2016, Genome Biology.

[24]  Yun S. Song,et al.  Prediction of ribosome footprint profile shapes from transcript sequences , 2016, Bioinform..

[25]  A. Heger,et al.  UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy , 2016, bioRxiv.

[26]  Xuerui Yang,et al.  Genome-wide assessment of differential translations with ribosome profiling data , 2016, Nature Communications.

[27]  Reuven Agami,et al.  Tumour-specific proline vulnerability uncovered by differential ribosome codon reading , 2016, Nature.

[28]  Pavel V. Baranov,et al.  Comparative survey of the relative impact of mRNA features on local ribosome profiling read density , 2015, Nature Communications.

[29]  Gunnar Rätsch,et al.  RiboDiff: detecting changes of mRNA translation efficiency from ribosome footprints , 2015, bioRxiv.

[30]  Alexander Bartholomäus,et al.  Mapping the non-standardized biases of ribosome profiling , 2016, Biological chemistry.

[31]  J. Weissman,et al.  Ribosome profiling reveals the what, when, where and how of protein synthesis , 2015, Nature Reviews Molecular Cell Biology.

[32]  Jeffrey A. Hussmann,et al.  Understanding Biases in Ribosome Profiling Experiments Reveals Signatures of Translation Dynamics in Yeast , 2015, bioRxiv.

[33]  Jeffrey A. Hussmann,et al.  Improved ribosome-footprint and mRNA measurements provide insights into dynamics and regulation of yeast translation , 2015, bioRxiv.

[34]  Hunter B. Fraser,et al.  Accounting for biases in riboprofiling data indicates a major role for proline in stalling translation , 2014, Genome research.

[35]  Daphne Koller,et al.  Causal signals between codon bias, mRNA structure, and the efficiency of translation and elongation , 2014, Molecular systems biology.

[36]  Vadim N. Gladyshev,et al.  Translation inhibitors cause abnormalities in ribosome profiling experiments , 2014, Nucleic acids research.

[37]  Shu-Bing Qian,et al.  Ribosome profiling reveals sequence-independent post-initiation pausing as a signature of translation , 2014, Cell Research.

[38]  P. Brown,et al.  Distinct stages of the translation elongation cycle revealed by sequencing ribosome-protected mRNA fragments , 2014, eLife.

[39]  C. Thermes,et al.  Library preparation methods for next-generation sequencing: tone down the bias. , 2014, Experimental cell research.

[40]  Richard A. Olshen,et al.  Assessing gene-level translational control from ribosome profiling , 2013, Bioinform..

[41]  Ana Kozomara,et al.  Reducing ligation bias of small RNAs in libraries for next generation sequencing , 2012, Silence.

[42]  Ryan T Fuchs,et al.  Structural bias in T4 RNA ligase-mediated 3′-adapter ligation , 2012, Nucleic acids research.

[43]  A. Fire,et al.  Wobble base-pairing slows in vivo translation elongation in metazoans. , 2011, RNA.

[44]  R. Sachidanandam,et al.  Identification and remediation of biases in the activity of RNA ligases in small-RNA deep sequencing , 2011, Nucleic acids research.

[45]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[46]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[47]  K. Hansen,et al.  Biases in Illumina transcriptome sequencing caused by random hexamer priming , 2010, Nucleic acids research.

[48]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[49]  Nicholas T. Ingolia,et al.  Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling , 2009, Science.

[50]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[51]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[52]  D. Botstein,et al.  Exploring the new world of the genome with DNA microarrays , 1999, Nature Genetics.