Correcting 4sU induced quantification bias in nucleotide conversion RNA-seq data

Nucleoside analogues like 4-thiouridine (4sU) are used to metabolically label newly synthesized RNA. Chemical conversion of 4sU before sequencing induces T-to-C mismatches in reads sequenced from labelled RNA, allowing to obtain total and labelled RNA expression profiles from a single sequencing library. Cytotoxicity due to extended periods of labelling or high 4sU concentrations has been described, but the effects of extensive 4sU labelling on expression estimates from nucleotide conversion RNA-seq have not been studied. Here, we performed nucleotide conversion RNA-seq with escalating doses of 4sU with short-term labelling (1h) and over a progressive time course (up to 2h) in different cell lines. With high concentrations or at later time points, expression estimates were biased in an RNA half-life dependent manner. We show that bias arose by a combination of reduced mappability of reads carrying multiple conversions, and a global, unspecific underrepresentation of labelled RNA due to impaired reverse transcription efficiency and potentially global reduction of RNA synthesis. We developed a computational tool to rescue unmappable reads, which performed favourably compared to previous read mappers, and a statistical method, which could fully remove remaining bias. All methods developed here are freely available as part of our GRAND-SLAM pipeline and grandR package.

[1]  I. Amit,et al.  Time-resolved single-cell RNA-seq using metabolic RNA labelling , 2022, Nature Reviews Methods Primers.

[2]  Florian Erhard,et al.  grandR: a comprehensive package for nucleotide conversion sequencing data analysis , 2022, bioRxiv.

[3]  L. S. Churchman,et al.  Genome-wide quantification of RNA flow across subcellular compartments reveals determinants of the mammalian transcript life cycle , 2022, bioRxiv.

[4]  Florian Erhard,et al.  Multi-omics reveals principles of gene regulation and pervasive non-productive transcription in the human cytomegalovirus genome , 2022, bioRxiv.

[5]  T. Geiger,et al.  Nascent Ribo-Seq measures ribosomal loading time and reveals kinetic impact on ribosome density , 2021, Nature Methods.

[6]  Florian Erhard,et al.  Targeted protein degradation reveals a direct role of SPT6 in RNAPII elongation and termination , 2021, Molecular cell.

[7]  Micah Thornton,et al.  Rapid and accurate alignment of nucleotide conversion sequencing reads with HISAT-3N , 2021, Genome research.

[8]  I. Ulitsky,et al.  SARS-CoV-2 uses a multipronged strategy to impede host protein synthesis , 2021, Nature.

[9]  Carson C. Thoreen,et al.  Roadblock-qPCR: a simple and inexpensive strategy for targeted measurements of mRNA stability , 2020, RNA.

[10]  M. Landthaler,et al.  Integrative functional genomics decodes herpes simplex virus 1 , 2020, Nature Communications.

[11]  J. Shendure,et al.  Sci-fate characterizes the dynamics of gene expression in single cells , 2020, Nature Biotechnology.

[12]  J. Schug,et al.  Comparative evaluation of RNA-Seq library preparation methods for strand-specificity and low input , 2019, Scientific Reports.

[13]  Steven L Salzberg,et al.  Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype , 2019, Nature Biotechnology.

[14]  Veronika A. Herzog,et al.  Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets , 2019, BMC Bioinform..

[15]  M. Borad,et al.  Hepatocytes direct the formation of a pro-metastatic niche in the liver. , 2019, Nature.

[16]  Fabian J Theis,et al.  scSLAM-seq reveals core features of transcription dynamics in single cells , 2019, bioRxiv.

[17]  Florian Erhard,et al.  Dissecting newly transcribed and old RNA using GRAND-SLAM , 2018, Bioinform..

[18]  Meaghan C. Sullivan,et al.  TimeLapse-seq: Adding a temporal dimension to RNA sequencing through nucleoside recoding , 2018, Nature Methods.

[19]  Dietmar Rieder,et al.  Osmium-Mediated Transformation of 4-Thiouridine to Cytidine as Key To Study RNA Dynamics by Sequencing. , 2017, Angewandte Chemie.

[20]  Johannes Zuber,et al.  Thiol-linked alkylation of RNA to assess expression dynamics , 2017, Nature Methods.

[21]  Eun Ji Kim,et al.  Simulation-based comprehensive benchmarking of RNA-seq aligners , 2016, Nature Methods.

[22]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[23]  L. Dölken,et al.  4-thiouridine inhibits rRNA synthesis and causes a nucleolar stress response , 2013, RNA biology.

[24]  Felix Krueger,et al.  Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications , 2011, Bioinform..

[25]  R. Zimmer,et al.  High-resolution gene expression profiling for simultaneous kinetic parameter analysis of RNA synthesis and decay. , 2008, RNA.