Detection and mitigation of spurious antisense expression with RoSA

Antisense transcription is known to have a range of impacts on sense gene expression, including (but not limited to) impeding transcription initiation, disrupting post-transcriptional processes, and enhancing, slowing, or even preventing transcription of the sense gene. Strand-specific RNA-Seq protocols preserve the strand information of the original RNA in the data, and so can be used to identify where antisense transcription may be implicated in regulating gene expression. However, our analysis of 199 strand-specific RNA-Seq experiments reveals that spurious antisense reads are often present in these datasets at levels greater than 1% of sense gene expression levels. Furthermore, these levels can vary substantially even between replicates in the same experiment, potentially disrupting any downstream analysis, if the incorrectly assigned antisense counts dominate the set of genes with high antisense transcription levels. Currently, no tools exist to detect or correct for this spurious antisense signal. Our tool, RoSA (Removal of Spurious Antisense), detects the presence of high levels of spurious antisense read alignments in strand-specific RNA-Seq datasets. It uses incorrectly spliced reads on the antisense strand and/or ERCC spikeins (if present in the data) to calculate both global and gene-specific antisense correction factors. We demonstrate the utility of our tool to filter out spurious antisense transcript counts in an Arabidopsis thaliana RNA-Seq experiment. Availability: RoSA is open source software available under the GPL licence via the Barton Group GitHub page https://github.com/bartongroup.

[1]  H. Seitz,et al.  Functional lability of RNA-dependent RNA polymerases in animals , 2018, bioRxiv.

[2]  Daniel R. Garalde,et al.  Highly parallel direct RNA sequencing on an array of nanopores , 2016, Nature Methods.

[3]  K. Shinozaki,et al.  Novel Stress-Inducible Antisense RNAs of Protein-Coding Loci Are Synthesized by RNA-Dependent RNA Polymerase1[OPEN] , 2017, Plant Physiology.

[4]  S. Winters-Hilt RNA-Dependent RNA Polymerase encoding Artifacts in Eukaryotic Transcriptomes , 2017 .

[5]  Geoffrey J. Barton,et al.  How well do RNA-Seq differential gene expression tools perform in higher eukaryotes? , 2016 .

[6]  Xiquan Zhang,et al.  Characteristics of Antisense Transcript Promoters and the Regulation of Their Activity , 2015, International journal of molecular sciences.

[7]  David C. Norris,et al.  Integrated genome browser: visual analytics platform for genomics , 2015, bioRxiv.

[8]  Edwin Cuppen,et al.  Sambamba: fast processing of NGS alignment formats , 2015, Bioinform..

[9]  Paul Theodor Pyl,et al.  HTSeq—a Python framework to work with high-throughput sequencing data , 2014, bioRxiv.

[10]  C. Thermes,et al.  Library preparation methods for next-generation sequencing: tone down the bias. , 2014, Experimental cell research.

[11]  Wei Shi,et al.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features , 2013, Bioinform..

[12]  L. Steinmetz,et al.  Gene regulation by antisense transcription , 2013, Nature Reviews Genetics.

[13]  I. Dotan,et al.  Directionality of noncoding human RNAs: how to avoid artifacts. , 2013, Analytical biochemistry.

[14]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[15]  Wu Wei,et al.  RNA Polymerase II Collision Interrupts Convergent Transcription , 2012, Molecular cell.

[16]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[17]  Zhe Wu,et al.  Flowering time control: another window to the connection between antisense RNA and chromatin. , 2012, Trends in genetics : TIG.

[18]  D. Haussler,et al.  Gene Isoform Specificity through Enhancer-Associated Antisense Transcription , 2012, PloS one.

[19]  A. Mortazavi,et al.  Technical considerations for functional sequencing assays , 2012, Nature Immunology.

[20]  G. Barton,et al.  Direct Sequencing of Arabidopsis thaliana RNA Reveals Patterns of Cleavage and Polyadenylation , 2012, Nature Structural &Molecular Biology.

[21]  M. Salit,et al.  Synthetic Spike-in Standards for Rna-seq Experiments Material Supplemental Open Access License Commons Creative , 2022 .

[22]  Sibum Sung,et al.  Vernalization-Mediated Epigenetic Silencing by a Long Intronic Noncoding RNA , 2011, Science.

[23]  Tsute Chen,et al.  Strand-specific transcriptome profiling with directly labeled RNA on genomic tiling microarrays , 2011, BMC Molecular Biology.

[24]  G. Storz,et al.  Bacterial antisense RNAs: how many are there, and what are they doing? , 2010, Annual review of genetics.

[25]  N. Friedman,et al.  Comprehensive comparative analysis of strand-specific RNA sequencing methods , 2010, Nature Methods.

[26]  David Tollervey,et al.  Apparent Non-Canonical Trans-Splicing Is Generated by Reverse Transcriptase In Vitro , 2010, PloS one.

[27]  C. Lister,et al.  Targeted 3′ Processing of Antisense Transcripts Triggers Arabidopsis FLC Chromatin Silencing , 2010, Science.

[28]  C. Dean,et al.  Cold-induced silencing by long antisense transcripts of an Arabidopsis Polycomb target , 2009, Nature.

[29]  T. Borodina,et al.  Transcriptome analysis by strand-specific sequencing of complementary DNA , 2009, Nucleic acids research.

[30]  P. Simon,et al.  Sense or antisense? False priming reverse transcription controls are required for determining sequence orientation by reverse transcription-PCR. , 2007, Analytical biochemistry.

[31]  L. Steinmetz,et al.  Antisense artifacts in transcriptome microarray experiments are resolved by actinomycin D , 2007, Nucleic acids research.

[32]  F. Haddad,et al.  Potential pitfalls in the accuracy of analysis of natural sense-antisense RNA pairs by reverse transcription-PCR , 2007, BMC biotechnology.

[33]  G. Storz,et al.  An antisense RNA controls synthesis of an SOS-induced toxin evolved from an antitoxin , 2007, Molecular microbiology.

[34]  C. Rivetti,et al.  Collision events between RNA polymerases in convergent transcription studied by atomic force microscopy , 2006, Nucleic acids research.

[35]  Shao-Ming Wu,et al.  The complexity of antisense transcription revealed by the study of developing male germ cells. , 2006, Genomics.

[36]  The External Rna Controls Consortium The External RNA Controls Consortium: a progress report , 2005 .

[37]  D. Higgs,et al.  Transcription of antisense RNA leading to gene silencing and methylation as a novel cause of human genetic disease , 2003, Nature Genetics.

[38]  E. Koonin,et al.  Evolutionary connection between the catalytic subunits of DNA-dependent RNA polymerases and eukaryotic RNA-dependent RNA polymerases and the origin of RNA polymerases , 2003, BMC Structural Biology.

[39]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.