RNA Modification Level Estimation with pulseR

RNA modifications regulate the complex life of transcripts. An experimental approach called LAIC-seq was developed to characterize modification levels on a transcriptome-wide scale. In this method, the modified and unmodified molecules are separated using antibodies specific for a given RNA modification (e.g., m6A). In essence, the procedure of biochemical separation yields three fractions: Input, eluate, and supernatent, which are subjected to RNA-seq. In this work, we present a bioinformatics workflow, which starts from RNA-seq data to infer gene-specific modification levels by a statistical model on a transcriptome-wide scale. Our workflow centers around the pulseR package, which was originally developed for the analysis of metabolic labeling experiments. We demonstrate how to analyze data without external normalization (i.e., in the absence of spike-ins), given high efficiency of separation, and how, alternatively, scaling factors can be derived from unmodified spike-ins. Importantly, our workflow provides an estimate of uncertainty of modification levels in terms of confidence intervals for model parameters, such as gene expression and RNA modification levels. We also compare alternative model parametrizations, log-odds, or the proportion of the modified molecules and discuss the pros and cons of each representation. In summary, our workflow is a versatile approach to RNA modification level estimation, which is open to any read-count-based experimental approach.

[1]  Yu-Sheng Chen,et al.  Dynamic transcriptomic m6A decoration: writers, erasers, readers and functions in RNA metabolism , 2018, Cell Research.

[2]  Chuan He,et al.  Post-transcriptional gene regulation by mRNA modifications , 2016, Nature Reviews Molecular Cell Biology.

[3]  S. Salzberg,et al.  StringTie enables improved reconstruction of a transcriptome from RNA-seq reads , 2015, Nature Biotechnology.

[4]  M. Kupiec,et al.  Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq , 2012, Nature.

[5]  Schraga Schwartz,et al.  The m1A landscape on cytosolic and mitochondrial mRNA at single-base resolution , 2017, Nature.

[6]  Christoph Dieterich,et al.  pulseR: Versatile computational analysis of RNA turnover from metabolic labeling experiments , 2017, Bioinform..

[7]  Bing Ren,et al.  N6-methyladenosine-dependent regulation of messenger RNA stability , 2013 .

[8]  Chengqi Yi,et al.  Transcriptome-wide mapping reveals reversible and dynamic N(1)-methyladenosine methylome. , 2016, Nature chemical biology.

[9]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[10]  Knut Reinert,et al.  Flexbar 3.0 ‐ SIMD and multicore parallelization , 2017, Bioinform..

[11]  O. Elemento,et al.  Comprehensive Analysis of mRNA Methylation Reveals Enrichment in 3′ UTRs and near Stop Codons , 2012, Cell.

[12]  Yuri Motorin,et al.  Detecting RNA modifications in the epitranscriptome: predict and validate , 2017, Nature Reviews Genetics.

[13]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[14]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..