Is an observed non-co-linear RNA product spliced in trans, in cis or just in vitro?

Global transcriptome investigations often result in the detection of an enormous number of transcripts composed of non-co-linear sequence fragments. Such ‘aberrant’ transcript products may arise from post-transcriptional events or genetic rearrangements, or may otherwise be false positives (sequencing/alignment errors or in vitro artifacts). Moreover, post-transcriptionally non-co-linear (‘PtNcl’) transcripts can arise from trans-splicing or back-splicing in cis (to generate so-called ‘circular RNA’). Here, we collected previously-predicted human non-co-linear RNA candidates, and designed a validation procedure integrating in silico filters with multiple experimental validation steps to examine their authenticity. We showed that >50% of the tested candidates were in vitro artifacts, even though some had been previously validated by RT-PCR. After excluding the possibility of genetic rearrangements, we distinguished between trans-spliced and circular RNAs, and confirmed that these two splicing forms can share the same non-co-linear junction. Importantly, the experimentally-confirmed PtNcl RNA events and their corresponding PtNcl splicing types (i.e. trans-splicing, circular RNA, or both sharing the same junction) were all expressed in rhesus macaque, and some were even expressed in mouse. Our study thus describes an essential procedure for confirming PtNcl transcripts, and provides further insight into the evolutionary role of PtNcl RNA events, opening up this important, but understudied, class of post-transcriptional events for comprehensive characterization.

[1]  T. Gingeras Implications of chimaeric non-co-linear transcripts , 2009, Nature.

[2]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[3]  Huanming Yang,et al.  Deep RNA sequencing at single base-pair resolution reveals high complexity of the rice transcriptome. , 2010, Genome research.

[4]  Kathleen R. Cho,et al.  Scrambled exons , 1991, Cell.

[5]  M. Tress,et al.  Chimeras taking shape: Potential functions of proteins encoded by chimeric RNA transcripts , 2012, Genome research.

[6]  M. Rubin,et al.  SLC45A3-ELK4 is a novel and frequent erythroblast transformation-specific fusion transcript in prostate cancer. , 2009, Cancer research.

[7]  Jian Gu,et al.  RNA-Seq Mapping and Detection of Gene Fusions with a Suffix Array Algorithm , 2012, PLoS Comput. Biol..

[8]  A. Reymond,et al.  Tandem chimerism as a means to increase protein complexity in the human genome. , 2005, Genome research.

[9]  J. Taylor,et al.  Template switching by reverse transcriptase during DNA synthesis , 1990, Journal of virology.

[10]  A. Eychène,et al.  A new MAFia in cancer , 2008, Nature Reviews Cancer.

[11]  Enrico Macii,et al.  Bellerophontes: an RNA-Seq data analysis framework for chimeric transcripts discovery based on accurate fusion model , 2012, Bioinform..

[12]  P. Sharp,et al.  Trans splicing of mrna precursors in vitro , 1985, Cell.

[13]  C Joel McManus,et al.  Global analysis of trans-splicing in Drosophila , 2010, Proceedings of the National Academy of Sciences.

[14]  David S. Wishart,et al.  An improved method to detect correct protein folds using partial clustering , 2013, BMC Bioinformatics.

[15]  Mark Wade,et al.  Post-transcriptional exon shuffling events in humans can be evolutionarily conserved and abundant. , 2011, Genome research.

[16]  N. Samani,et al.  A genome-wide survey demonstrates widespread non-linear mRNA in expressed sequences from multiple species , 2005, Nucleic acids research.

[17]  Sanghyuk Lee,et al.  ChimerDB 2.0—a knowledgebase for fusion genes updated , 2009, Nucleic Acids Res..

[18]  Marco Beccuti,et al.  State of art fusion-finder algorithms are suitable to detect transcription-induced chimeras in normal tissues? , 2013, BMC Bioinformatics.

[19]  J. Harrow,et al.  Identifying protein-coding genes in genomic sequences , 2009, Genome Biology.

[20]  M. Kimmel,et al.  Conflict of interest statement. None declared. , 2010 .

[21]  Jørgen Kjems,et al.  miRNA‐dependent gene silencing involving Ago2‐mediated cleavage of a circular antisense RNA , 2011, The EMBO journal.

[22]  Alfonso Valencia,et al.  ChiTaRS: a database of human, mouse and fruit fly chimeric transcripts and RNA-sequencing data , 2012, Nucleic Acids Res..

[23]  S. Salzberg,et al.  TopHat-Fusion: an algorithm for discovery of novel fusion transcripts , 2011, Genome Biology.

[24]  S. Mitalipov,et al.  Isolation and Characterization of Novel Rhesus Monkey Embryonic Stem Cell Lines , 2006, Stem cells.

[25]  Thomas D. Wu,et al.  Deep RNA sequencing analysis of readthrough gene fusions in human prostate adenocarcinoma and reference samples , 2011, BMC Medical Genomics.

[26]  S. Luo,et al.  Chimeric transcript discovery by paired-end transcriptome sequencing , 2009, Proceedings of the National Academy of Sciences.

[27]  M. Coca-Prados,et al.  Electron microscopic evidence for the circular form of RNA in the cytoplasm of eukaryotic cells , 1979, Nature.

[28]  Sebastian D. Mackowiak,et al.  Circular RNAs are a large class of animal RNAs with regulatory potency , 2013, Nature.

[29]  David Tollervey,et al.  Apparent Non-Canonical Trans-Splicing Is Generated by Reverse Transcriptase In Vitro , 2010, PloS one.

[30]  Andrea Tanzer,et al.  A multi-split mapping algorithm for circular RNA, splicing, trans-splicing and fusion detection , 2014, Genome Biology.

[31]  Xiang Shao,et al.  Bioinformatic analysis of exon repetition, exon scrambling and trans-splicing in humans , 2006, Bioinform..

[32]  J. Sklar,et al.  A Neoplastic Gene Fusion Mimics Trans-Splicing of RNAs in Normal Human Cells , 2008, Science.

[33]  Toshiro Aigaki,et al.  Alternative trans‐splicing: a novel mode of pre‐mRNA processing , 2006, Biology of the cell.

[34]  D. Solnick Trans splicing of mRNA precursors , 1985, Cell.

[35]  Alfonso Valencia,et al.  Novel domain combinations in proteins encoded by chimeric transcripts , 2012, Bioinform..

[36]  S. Donatelli,et al.  State-of-the-Art Fusion-Finder Algorithms Sensitivity and Specificity , 2013, BioMed research international.

[37]  Xin Li,et al.  Short Homologous Sequences Are Strongly Associated with the Generation of Chimeric RNAs in Eukaryotes , 2008, Journal of Molecular Evolution.

[38]  J. Kjems,et al.  Natural RNA circles function as efficient microRNA sponges , 2013, Nature.

[39]  Jonathan M. Mudge,et al.  Evidence for Transcript Networks Composed of Chimeric RNAs in Human Cells , 2012, PloS one.

[40]  P. Hall,et al.  Transcriptional consequences of genomic structural aberrations in breast cancer. , 2011, Genome research.

[41]  Lei Ma,et al.  Identification and analysis of pig chimeric mRNAs using RNA sequencing data , 2012, BMC Genomics.

[42]  A. Ashworth,et al.  Identification of gene fusion transcripts by transcriptome sequencing in BRCA1-mutated breast cancers and cell lines , 2011, BMC Medical Genomics.

[43]  Fatih Ozsolak,et al.  RNA sequencing: advances, challenges and opportunities , 2011, Nature Reviews Genetics.

[44]  Michel Eduardo Beleza Yamagishi,et al.  Detection of human interchromosomal trans-splicing in sequence databanks , 2010, Briefings Bioinform..

[45]  R. Dorn,et al.  The Modifier of mdg4 Locus in Drosophila: Functional Complexity is Resolved by trans Splicing , 2003, Genetica.

[46]  Lee T. Sam,et al.  Transcriptome Sequencing to Detect Gene Fusions in Cancer , 2009, Nature.

[47]  R. Sorek,et al.  Transcription-mediated gene fusion in the human genome. , 2005, Genome research.

[48]  Charles Gawad,et al.  Circular RNAs Are the Predominant Transcript Isoform from Hundreds of Human Genes in Diverse Cell Types , 2012, PloS one.

[49]  Süleyman Cenk Sahinalp,et al.  deFuse: An Algorithm for Gene Fusion Discovery in Tumor RNA-Seq Data , 2011, PLoS Comput. Biol..

[50]  P. Brown,et al.  Circular RNA Is Expressed across the Eukaryotic Tree of Life , 2014, PloS one.

[51]  Ieuan Clay,et al.  The transcriptional interactome: gene expression in 3D. , 2010, Current opinion in genetics & development.

[52]  R. Veitia,et al.  Reverse transcriptase template switching and false alternative transcripts. , 2006, Genomics.

[53]  E. Brody,et al.  Temperature-dependent template switching during in vitro cDNA synthesis by the AMV-reverse transcriptase. , 1992, Nucleic acids research.

[54]  S. Redaelli,et al.  FusionAnalyser: a new graphical, event-driven tool for fusion rearrangements discovery , 2012, Nucleic acids research.

[55]  Denise Anderson,et al.  FusionFinder: A Software Tool to Identify Expressed Gene Fusion Candidates from RNA-Seq Data , 2012, PloS one.

[56]  Trees-Juen Chuang,et al.  Integrative transcriptome sequencing identifies trans-splicing events with important roles in human embryonic stem cell pluripotency , 2014, Genome research.

[57]  Brian J. Stevenson,et al.  Transcriptome-guided characterization of genomic rearrangements in a breast cancer cell line , 2009, Proceedings of the National Academy of Sciences.

[58]  Toshiro Aigaki,et al.  Alternative splicing of lola generates 19 transcription factors controlling axon guidance in Drosophila , 2003, Nature Neuroscience.

[59]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.

[60]  Michael K. Slevin,et al.  Circular RNAs are abundant, conserved, and associated with ALU repeats. , 2013, RNA.