Riboexp: an interpretable reinforcement learning framework for ribosome density modeling

Translation elongation is a crucial phase during protein biosynthesis. In this study, we develop a novel deep reinforcement learning-based framework, named Riboexp, to model the determinants of the uneven distribution of ribosomes on mRNA transcripts during translation elongation. In particular, our model employs a policy network to perform a context-dependent feature selection in the setting of ribosome density prediction. Our extensive tests demonstrated that Riboexp can significantly outperform the state-of-the-art methods in predicting ribosome density by up to 5.9% in terms of per-gene Pearson correlation coefficient on the datasets from three species. In addition, Riboexp can indicate more informative sequence features for the prediction task than other commonly used attribution methods in deep learning. In-depth analyses also revealed the meaningful biological insights generated by the Riboexp framework. Moreover, the application of Riboexp in codon optimization resulted in an increase of protein production by around 31% over the previous state-of-the-art method that models ribosome density. These results have established Riboexp as a powerful and useful computational tool in the studies of translation dynamics and protein synthesis. Availability: The data and code of this study are available on GitHub: https://github.com/Liuxg16/Riboexp. Contact:  zengjy321@tsinghua.edu.cn; songsen@tsinghua.edu.cn.

[1]  Daphne Koller,et al.  Causal signals between codon bias, mRNA structure, and the efficiency of translation and elongation , 2014, Molecular systems biology.

[2]  Yun S. Song,et al.  The impact of ribosomal interference, codon usage, and exit tunnel interactions on translation elongation rate variation , 2017, bioRxiv.

[3]  A. Fire,et al.  Wobble base-pairing slows in vivo translation elongation in metazoans. , 2011, RNA.

[4]  Hunter B. Fraser,et al.  Accounting for biases in riboprofiling data indicates a major role for proline in stalling translation , 2014, Genome research.

[5]  Jeffrey A. Hussmann,et al.  Understanding Biases in Ribosome Profiling Experiments Reveals Signatures of Translation Dynamics in Yeast , 2015, bioRxiv.

[6]  R. Green,et al.  Translation Elongation and Recoding in Eukaryotes. , 2018, Cold Spring Harbor perspectives in biology.

[7]  Arshdeep Sekhon,et al.  Attend and Predict: Understanding Gene Regulation by Selective Attention on Chromatin , 2017, bioRxiv.

[8]  Vadim N. Gladyshev,et al.  Ribonuclease selection for ribosome profiling , 2016, Nucleic acids research.

[9]  C. Dobson,et al.  In vivo translation rates can substantially delay the cotranslational folding of the Escherichia coli cytosolic proteome , 2012, Proceedings of the National Academy of Sciences.

[10]  R. Green,et al.  A systematically-revised ribosome profiling method for bacteria reveals pauses at single-codon resolution , 2019, eLife.

[11]  Nicholas T. Ingolia,et al.  Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling , 2009, Science.

[12]  W. B. Cavnar,et al.  N-gram-based text categorization , 1994 .

[13]  Michal Ziv-Ukelson,et al.  Composite effects of gene determinants on the translation speed and density of ribosomes , 2011, Genome Biology.

[14]  M. C. Jones,et al.  A reliable data-based bandwidth selection method for kernel density estimation , 1991 .

[15]  Ralf Bundschuh,et al.  RiboProP: a probabilistic ribosome positioning algorithm for ribosome profiling , 2018, Bioinform..

[16]  Nicholas T. Ingolia Ribosome Footprint Profiling of Translation throughout the Genome , 2016, Cell.

[17]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[18]  L. Hurst,et al.  Positively Charged Residues Are the Major Determinants of Ribosomal Velocity , 2013, PLoS biology.

[19]  D. Bartel,et al.  Poly(A)-tail profiling reveals an embryonic switch in translational control , 2014, Nature.

[20]  G. Brar Beyond the Triplet Code: Context Cues Transform Translation , 2016, Cell.

[21]  Jianyang Zeng,et al.  Analysis of Ribosome Stalling and Translation Elongation Dynamics by Deep Learning. , 2017, Cell systems.

[22]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[23]  Dumitru Erhan,et al.  A Benchmark for Interpretability Methods in Deep Neural Networks , 2018, NeurIPS.

[24]  Vadim N. Gladyshev,et al.  Translation inhibitors cause abnormalities in ribosome profiling experiments , 2014, Nucleic acids research.

[25]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[26]  Ronald J. Williams,et al.  Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[27]  Yun S. Song,et al.  Prediction of ribosome footprint profile shapes from transcript sequences , 2016, Bioinform..

[28]  Tamir Tuller,et al.  The effect of tRNA levels on decoding times of mRNA codons , 2014, Nucleic acids research.

[29]  Pavel V. Baranov,et al.  Comparative survey of the relative impact of mRNA features on local ribosome profiling read density , 2015, Nature Communications.

[30]  Tamir Tuller,et al.  Estimation of ribosome profiling performance and reproducibility at various levels of resolution , 2016, Biology Direct.

[31]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[32]  Regina Barzilay,et al.  Rationalizing Neural Predictions , 2016, EMNLP.

[33]  Carl Kingsford,et al.  Accurate Recovery of Ribosome Positions Reveals Slow Translation of Wobble-Pairing Codons in Yeast , 2016, RECOMB.

[34]  T. Ideker,et al.  Deciphering signaling specificity with interpretable deep neural networks , 2018, bioRxiv.

[35]  Jeffrey A. Hussmann,et al.  Ribosome Profiling: Global Views of Translation. , 2018, Cold Spring Harbor perspectives in biology.

[36]  Marc'Aurelio Ranzato,et al.  Sequence Level Training with Recurrent Neural Networks , 2015, ICLR.

[37]  Huihao Zhou,et al.  Ribosome stalling induced by mutation of a CNS-specific tRNA causes neurodegeneration , 2014, Science.

[38]  Fabio Lauria,et al.  riboWaltz: Optimization of ribosome P-site positioning in ribosome profiling data , 2017, bioRxiv.

[39]  Tao Jiang,et al.  DeepHINT: understanding HIV-1 integration via deep learning with attention , 2019, Bioinform..

[40]  Adam Siepel,et al.  Scikit-ribo Enables Accurate Estimation and Robust Modeling of Translation Dynamics at Codon Resolution. , 2018, Cell systems.

[41]  Nicholas J. McGlincy,et al.  Accurate design of translational output by a neural network model of ribosome distribution , 2017, Nature Structural & Molecular Biology.

[42]  S. Teichmann,et al.  Cotranslational protein assembly imposes evolutionary constraints on homomeric proteins , 2018, Nature Structural & Molecular Biology.

[43]  Karl Johan Åström,et al.  Optimal control of Markov processes with incomplete state information , 1965 .

[44]  Jeffrey A. Hussmann,et al.  Improved Ribosome-Footprint and mRNA Measurements Provide Insights into Dynamics and Regulation of Yeast Translation. , 2016, Cell reports.

[45]  Eytan Ruppin,et al.  Translation efficiency is determined by both codon bias and folding energy , 2010, Proceedings of the National Academy of Sciences.

[46]  P. Sharp,et al.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. , 1987, Nucleic acids research.

[47]  Lippincott-Schwartz,et al.  Supporting Online Material Materials and Methods Som Text Figs. S1 to S8 Table S1 Movies S1 to S3 a " Silent " Polymorphism in the Mdr1 Gene Changes Substrate Specificity Corrected 30 November 2007; See Last Page , 2022 .

[48]  Tamir Tuller,et al.  Modelling the Efficiency of Codon–tRNA Interactions Based on Codon Usage Bias , 2014, DNA research : an international journal for rapid publication of reports on genes and genomes.

[49]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[50]  Nicholas T. Ingolia,et al.  Rocaglates convert DEAD-box protein eIF4A into a sequence-selective translational repressor , 2016, Nature.

[51]  Jianzhi Zhang,et al.  Balanced Codon Usage Optimizes Eukaryotic Translational Efficiency , 2012, PLoS genetics.

[52]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[53]  Paul M. Sharp,et al.  Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes , 1986, Nucleic Acids Res..