Improving the study of RNA dynamics through advances in RNA-seq with metabolic labeling and nucleotide-recoding chemistry

RNA metabolic labeling using 4-thiouridine (s4U) captures the dynamics of RNA synthesis and decay. The power of this approach is dependent on appropriate quantification of labeled and unlabeled sequencing reads, which can be compromised by the apparent loss of s4U-labeled reads in a process we refer to as dropout. Here we show that s4U-containing transcripts can be selectively lost when RNA samples are handled under sub-optimal conditions, but that this loss can be minimized using an optimized protocol. We demonstrate a second cause of dropout in nucleotide recoding and RNA sequencing (NR-seq) experiments that is computational and downstream of library preparation. NR-seq experiments involve chemically converting s4U from a uridine analog to a cytidine analog and using the apparent T-to-C mutations to identify the populations of newly synthesized RNA. We show that high levels of T-to-C mutations can prevent read alignment with some computational pipelines, but that this bias can be overcome using improved alignment pipelines. Importantly, kinetic parameter estimates are affected by dropout independent of the NR chemistry employed, and all chemistries are practically indistinguishable in bulk, short-read RNA-seq experiments. Dropout is an avoidable problem that can be identified by including unlabeled controls, and mitigated through improved sample handing and read alignment that together improve the robustness and reproducibility of NR-seq experiments.

[1]  Florian Erhard,et al.  Correcting 4sU induced quantification bias in nucleotide conversion RNA-seq data , 2023, bioRxiv.

[2]  M. Simon,et al.  bakR: uncovering differential RNA synthesis and degradation kinetics transcriptome-wide with Bayesian hierarchical modeling , 2023, RNA.

[3]  L. S. Churchman,et al.  Genome-wide quantification of RNA flow across subcellular compartments reveals determinants of the mammalian transcript life cycle , 2022, bioRxiv.

[4]  Krishna Shankara Narayanan,et al.  Parsing the role of NSP1 in SARS-CoV-2 infection , 2022, bioRxiv.

[5]  J. Steitz,et al.  STL-seq reveals pause-release and termination kinetics for promoter-proximal paused RNA polymerase II transcripts. , 2021, Molecular cell.

[6]  Christoph Dieterich,et al.  A comparison of metabolic labeling and statistical methods to infer genome-wide dynamics of RNA turnover , 2021, Briefings Bioinform..

[7]  Florian Erhard,et al.  Targeted protein degradation reveals a direct role of SPT6 in RNAPII elongation and termination , 2021, Molecular cell.

[8]  Micah Thornton,et al.  Rapid and accurate alignment of nucleotide conversion sequencing reads with HISAT-3N , 2021, Genome research.

[9]  Sven Rahmann,et al.  Sustainable data analysis with Snakemake , 2021, F1000Research.

[10]  Charles Y. Lin,et al.  Modulating Androgen Receptor-Driven Transcription in Prostate Cancer with Selective CDK9 Inhibitors. , 2020, Cell chemical biology.

[11]  E. Segal,et al.  Gene Architecture and Sequence Composition Underpin Selective Dependency of Nuclear Export of Long RNAs on NXF1 and the TREX Complex. , 2020, Molecular cell.

[12]  X. Weng,et al.  Acrylonitrile‐Mediated Nascent RNA Sequencing for Transcriptome‐Wide Profiling of Cellular RNA Dynamics , 2020, Advanced science.

[13]  Dietmar Rieder,et al.  Thioguanosine Conversion Enables mRNA‐Lifetime Evaluation by RNA Sequencing Using Double Metabolic Labeling (TUC‐seq DUAL) , 2020, Angewandte Chemie.

[14]  Steven L Salzberg,et al.  Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype , 2019, Nature Biotechnology.

[15]  J. Weissman,et al.  Mapping Transcriptomic Vector Fields of Single Cells , 2022, Cell.

[16]  Veronika A. Herzog,et al.  Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets , 2019, BMC Bioinform..

[17]  Jeremy A. Schofield,et al.  Gaining insight into transcriptome‐wide RNA population dynamics through the chemistry of 4‐thiouridine , 2018, Wiley interdisciplinary reviews. RNA.

[18]  Jeremy A. Schofield,et al.  Expanding the Nucleoside Recoding Toolkit: Revealing RNA Population Dynamics with 6-Thioguanosine. , 2018, Journal of the American Chemical Society.

[19]  T. Maniatis,et al.  Solid phase chemistry to covalently and reversibly capture thiolated RNA , 2018, Nucleic acids research.

[20]  Florian Erhard,et al.  Dissecting newly transcribed and old RNA using GRAND-SLAM , 2018, Bioinform..

[21]  Meaghan C. Sullivan,et al.  TimeLapse-seq: Adding a temporal dimension to RNA sequencing through nucleoside recoding , 2018, Nature Methods.

[22]  Dietmar Rieder,et al.  Osmium-Mediated Transformation of 4-Thiouridine to Cytidine as Key To Study RNA Dynamics by Sequencing. , 2017, Angewandte Chemie.

[23]  Johannes Zuber,et al.  Thiol-linked alkylation of RNA to assess expression dynamics , 2017, Nature Methods.

[24]  T. Takumi,et al.  Unusual semi‐extractability as a hallmark of nuclear body‐associated architectural noncoding RNAs , 2017, The EMBO journal.

[25]  J. Gagneur,et al.  TT-seq maps the human transient transcriptome , 2016, Science.

[26]  Mark B Gerstein,et al.  Tracking Distinct RNA Populations Using Efficient and Reversible Covalent Chemistry. , 2015, Molecular cell.

[27]  Steven L Salzberg,et al.  HISAT: a fast spliced aligner with low memory requirements , 2015, Nature Methods.

[28]  D. Gresham,et al.  Determination of in vivo RNA kinetics using RATE-seq , 2014, RNA.

[29]  Paul Theodor Pyl,et al.  HTSeq—a Python framework to work with high-throughput sequencing data , 2014, bioRxiv.

[30]  Arndt von Haeseler,et al.  NextGenMap: fast and accurate read mapping in highly polymorphic genomes , 2013, Bioinform..

[31]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[32]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[33]  Felix Krueger,et al.  Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications , 2011, Bioinform..

[34]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[35]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[36]  R. Zimmer,et al.  High-resolution gene expression profiling for simultaneous kinetic parameter analysis of RNA synthesis and decay. , 2008, RNA.

[37]  W. Schmid,et al.  Microarray analysis of newly synthesized RNA in cells and animals , 2007, Proceedings of the National Academy of Sciences.

[38]  J. Boothroyd,et al.  Biosynthetic labeling of RNA with uracil phosphoribosyltransferase allows cell-specific microarray analysis of mRNA synthesis and decay , 2005, Nature Biotechnology.

[39]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..