Mapping the RNA-Seq trash bin

Prokaryotic transcripts constitute almost always uninterrupted intervals when mapped back to the genome. Split reads, i.e., RNA-seq reads consisting of parts that only map to discontiguous loci, are thus disregarded in most analysis pipelines. There are, however, some well-known exceptions, in particular, tRNA splicing and circularized small RNAs in Archaea as well as self-splicing introns. Here, we reanalyze a series of published RNA-seq data sets, screening them specifically for non-contiguously mapping reads. We recover most of the known cases together with several novel archaeal ncRNAs associated with circularized products. In Eubacteria, only a handful of interesting candidates were obtained beyond a few previously described group I and group II introns. Most of the atypically mapping reads do not appear to correspond to well-defined, specifically processed products. Whether this diffuse background is, at least in part, an incidental by-product of prokaryotic RNA processing or whether it consists entirely of technical artifacts of reverse transcription or amplification remains unknown.

[1]  Andrea Tanzer,et al.  A multi-split mapping algorithm for circular RNA, splicing, trans-splicing and fusion detection , 2014, Genome Biology.

[2]  Sebastian D. Mackowiak,et al.  Circular RNAs are a large class of animal RNAs with regulatory potency , 2013, Nature.

[3]  J. Kjems,et al.  Natural RNA circles function as efficient microRNA sponges , 2013, Nature.

[4]  Michael K. Slevin,et al.  Circular RNAs are abundant, conserved, and associated with ALU repeats. , 2013, RNA.

[5]  N. Polacek,et al.  tRNA-Derived Fragments Target the Ribosome and Function as Regulatory Non-Coding RNA in Haloferax volcanii , 2012, Archaea.

[6]  I. Moll,et al.  Selective translation during stress in Escherichia coli. , 2012, Trends in biochemical sciences.

[7]  T. Pan,et al.  Genome-wide Identification and Quantitative Analysis of Cleaved tRNA Fragments Induced by Cellular Stress* , 2012, The Journal of Biological Chemistry.

[8]  L. Randau,et al.  RNA processing in the minimal organism Nanoarchaeum equitans , 2012, Genome Biology.

[9]  M. Tress,et al.  Chimeras taking shape: Potential functions of proteins encoded by chimeric RNA transcripts , 2012, Genome research.

[10]  A. Schleiffer,et al.  Diversity and roles of (t)RNA ligases , 2012, Cellular and Molecular Life Sciences.

[11]  Charles Gawad,et al.  Circular RNAs Are the Predominant Transcript Isoform from Hundreds of Human Genes in Diverse Cell Types , 2012, PloS one.

[12]  Schraga Schwartz,et al.  Transcriptome-wide discovery of circular RNAs in Archaea , 2011, Nucleic acids research.

[13]  P. Stadler,et al.  Genome-wide transcriptome analysis of the plant pathogen Xanthomonas identifies sRNAs with putative virulence functions , 2011, Nucleic acids research.

[14]  Li Wu,et al.  Database for bacterial group II introns , 2011, Nucleic Acids Res..

[15]  Isabella Moll,et al.  Selective Translation of Leaderless mRNAs by Specialized Ribosomes Generated by MazF in Escherichia coli , 2011, Cell.

[16]  S. Shuman,et al.  RtcB, a Novel RNA Ligase, Can Catalyze tRNA Splicing and HAC1 mRNA Splicing in Vivo* , 2011, The Journal of Biological Chemistry.

[17]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[18]  M. Belfort,et al.  Learning to live together: mutualism between self-splicing introns and their hosts , 2011, BMC Biology.

[19]  G. Tocchini-Valentini,et al.  Evolution of introns in the archaeal world , 2011, Proceedings of the National Academy of Sciences.

[20]  S. Shuman,et al.  RtcB Is the RNA Ligase Component of an Escherichia coli RNA Repair Operon* , 2011, The Journal of Biological Chemistry.

[21]  J. Yates,et al.  Archaeal 3′-phosphate RNA splicing ligase characterization identifies the missing component in tRNA maturation , 2011, Proceedings of the National Academy of Sciences.

[22]  Kristin Reiche,et al.  The primary transcriptome of the major human pathogen Helicobacter pylori , 2010, Nature.

[23]  David Tollervey,et al.  Apparent Non-Canonical Trans-Splicing Is Generated by Reverse Transcriptase In Vitro , 2010, PloS one.

[24]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[25]  Ilka U. Heinemann,et al.  Transfer RNA processing in archaea: Unusual pathways and enzymes , 2010, FEBS letters.

[26]  A. Malhotra,et al.  A novel class of small RNAs: tRNA-derived RNA fragments (tRFs). , 2009, Genes & development.

[27]  T. Gingeras Implications of chimaeric non-co-linear transcripts , 2009, Nature.

[28]  Peter F. Stadler,et al.  Fast Mapping of Short Sequences with Mismatches, Insertions and Deletions Using Index Structures , 2009, PLoS Comput. Biol..

[29]  H. Nielsen,et al.  Group I introns: Moving in new directions , 2009, RNA biology.

[30]  Takashi Itoh,et al.  Gain and loss of an intron in a protein-coding gene in Archaea: the case of an archaeal RNA pseudouridine synthase gene , 2009, BMC Evolutionary Biology.

[31]  R. Parker,et al.  Stressing Out over tRNA Cleavage , 2009, Cell.

[32]  Masaru Tomita,et al.  Tri-split tRNA is a transfer RNA made from 3 transcripts that provides insight into the evolution of fragmented tRNAs in archaea , 2009, Proceedings of the National Academy of Sciences.

[33]  Pamela J Green,et al.  tRNA cleavage is a conserved response to oxidative stress in eukaryotes. , 2008, RNA.

[34]  Hui Zhou,et al.  Stress-induced tRNA-derived RNAs: a novel class of small RNAs in the primitive eukaryote Giardia lamblia , 2008, Nucleic acids research.

[35]  D. Söll,et al.  Transfer RNA genes in pieces , 2008, EMBO reports.

[36]  A. Kolstø,et al.  Survey of group I and group II introns in 29 sequenced genomes of the Bacillus cereus group: insights into their spread and evolution , 2008, Nucleic acids research.

[37]  M. Irimia,et al.  When good transcripts go bad: artifactual RT-PCR 'splicing' and genome analysis. , 2008, BioEssays : news and reviews in molecular, cellular and developmental biology.

[38]  Peter F. Stadler,et al.  Small ncRNA transcriptome analysis from Aspergillus fumigatus suggests a novel mechanism for regulation of protein synthesis , 2008, Nucleic acids research.

[39]  Yu Zhou,et al.  GISSD: Group I Intron Sequence and Structure Database , 2007, Nucleic Acids Res..

[40]  S. Shuman,et al.  Reprogramming the tRNA-splicing activity of a bacterial RNA repair enzyme , 2007, Nucleic acids research.

[41]  M. Tomita,et al.  In silico screening of archaeal tRNA-encoding genes having multiple introns with bulge-helix-bulge splicing motifs. , 2007, RNA.

[42]  E. Delong,et al.  Archaeal pre-mRNA splicing: a connection to hetero-oligomeric splicing endonuclease. , 2006, Biochemical and biophysical research communications.

[43]  R. Veitia,et al.  Reverse transcriptase template switching and false alternative transcripts. , 2006, Genomics.

[44]  A. Kolstø,et al.  Unusual Group II Introns in Bacteria of the Bacillus cereus Group , 2005, Journal of bacteriology.

[45]  Dieter Jahn,et al.  Nanoarchaeum equitans creates functional tRNAs from separate genes for their 5′- and 3′-halves , 2005, Nature.

[46]  Sean R Eddy,et al.  Circular box C/D RNAs in Pyrococcus furiosus. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[47]  Henri Grosjean,et al.  Identification of BHB splicing motifs in intron-containing tRNAs from 18 archaea: evolutionary implications. , 2003, RNA.

[48]  Sanjay K. Singh,et al.  Two reactions of Haloferax volcanii RNA splicing enzymes: joining of exons and circularization of introns. , 2003, RNA.

[49]  A. Hüttenhofer,et al.  RNomics in Archaea reveals a further link between splicing of archaeal introns and rRNA processing. , 2002, Nucleic acids research.

[50]  Chankyu Park,et al.  Group I Self-Splicing Intron in the recA Gene of Bacillus anthracis , 2001, Journal of bacteriology.

[51]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[52]  D. Biniszkiewicz,et al.  Self‐splicing group I intron in cyanobacterial initiator methionine tRNA: evidence for lateral transfer of introns in bacteria. , 1994, The EMBO journal.

[53]  R. Garrett,et al.  Protein-coding introns from the 23S rRNA-encoding gene form stable circles in the hyperthermophilic archaeon Pyrobaculum organotrophum. , 1992, Gene.

[54]  R. Levitz,et al.  Bacteriophage T4 anticodon nuclease, polynucleotide kinase and RNA ligase reprocess the host lysine tRNA. , 1987, The EMBO journal.

[55]  M. David,et al.  Bacteriophage T4-induced anticodon-loop nuclease detected in a host strain restrictive to RNA ligase mutants. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[56]  W. Filipowicz,et al.  RNA ligation via 2'-phosphomonoester, 3'5'-phosphodiester linkage: requirement of 2',3'-cyclic phosphate termini and involvement of a 5'-hydroxyl polynucleotide kinase. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[57]  Christian Zwieb,et al.  tmRDB (tmRNA database) , 2000, Nucleic Acids Res..

[58]  T. Cech Self-splicing of group I introns. , 1990, Annual review of biochemistry.