Circular RNA Detection from High-throughput Sequencing

Alternative splicing refers to the production of multiple mRNA isoforms from a single gene due to alternative selection of exons or splice sites during pre-mRNA splicing. While canonical alternative splicing produces a linear form of RNA by joining an upstream donor site (5' splice site) with a downstream acceptor site (3' splice site), a special form of alternative splicing produces a non-coding circular form of RNA (circular RNA) by ligating a downstream donor site (5' splice site) with an upstream acceptor site (3' splice site); i.e., back-splicing. Over the past two decades, many studies have discovered this special form of alternative splicing that produces a circular form of RNA. Although these circular RNAs have garnered considerable attention in the scientific community for their biogenesis and functions, the focus of these studies has been on exonic circular RNAs (circRNAs: donor site and acceptor site are from exon boundaries) and circular intronic RNAs (ciRNAs: donor and acceptor are from a single intron). This type of approach was conducted in the relative absence of methods for searching another group of circular RNAs, or circular complex RNAs (ccRNAs: either the donor site or acceptor site is not from exon boundaries), that contains at least one exon and one or more flanking introns. Studies of ccRNAs would serve as a significant first step in filling this void. In this paper, we developed a new computational algorithm that can detect all three types of circular RNAs. We applied our algorithm on a set of RNA-seq data to examine the composition of circular RNAs in the given dataset. Surprisingly, our results showed that the new type of circular RNA (ccRNA) was the second most common type of circular RNA while circRNA was the most common type as expected.

[1]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[2]  Kai Wang,et al.  Circular RNA profile in gliomas revealed by identification tool UROBORUS , 2016, Nucleic acids research.

[3]  Derek Y. Chiang,et al.  MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery , 2010, Nucleic acids research.

[4]  Lili Wan,et al.  RNA and Disease , 2009, Cell.

[5]  Michael K. Slevin,et al.  Circular RNAs are abundant, conserved, and associated with ALU repeats. , 2013, RNA.

[6]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[7]  Sebastian D. Mackowiak,et al.  Circular RNAs are a large class of animal RNAs with regulatory potency , 2013, Nature.

[8]  G. Shan,et al.  Circular RNAs in Eukaryotic Cells , 2015, Current genomics.

[9]  Qi Feng,et al.  Transcriptome-wide investigation of circular RNAs in rice , 2015, RNA.

[10]  Sol Shenker,et al.  Genome-wide analysis of drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation. , 2014, Cell reports.

[11]  Qian-Hao Zhu,et al.  Widespread noncoding circular RNAs in plants. , 2015, The New phytologist.

[12]  Rasko Leinonen,et al.  The sequence read archive: explosive growth of sequencing data , 2011, Nucleic Acids Res..

[13]  F. Zhao,et al.  CIRI: an efficient and unbiased algorithm for de novo circular RNA identification , 2015, Genome Biology.

[14]  P. Brown,et al.  Circular RNA Is Expressed across the Eukaryotic Tree of Life , 2014, PloS one.

[15]  Robert A Hegele,et al.  0021-972X/06/$15.00/0 The Journal of Clinical Endocrinology & Metabolism 91(7):2689–2695 Printed in U.S.A. Copyright © 2006 by The Endocrine Society doi: 10.1210/jc.2005-2746 A LMNA Splicing Mutation in Two Sisters with Severe Dunnigan-Type Familial Parti , 2022 .

[16]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[17]  Peter Goodfellow,et al.  Circular transcripts of the testis-determining gene Sry in adult mouse testis , 1993, Cell.

[18]  A. Bittner,et al.  Comparison of RNA-Seq and Microarray in Transcriptome Profiling of Activated T Cells , 2014, PloS one.

[19]  Laura Scott,et al.  Recurrent de novo point mutations in lamin A cause Hutchinson–Gilford progeria syndrome , 2003, Nature.

[20]  Francesco Muntoni,et al.  Dystrophin and mutations: one gene, several proteins, multiple phenotypes , 2003, The Lancet Neurology.

[21]  Charles Gawad,et al.  Circular RNAs Are the Predominant Transcript Isoform from Hundreds of Human Genes in Diverse Cell Types , 2012, PloS one.

[22]  Wei Lin,et al.  A comprehensive overview and evaluation of circular RNA detection tools , 2017, PLoS Comput. Biol..

[23]  May D. Wang,et al.  Comparison of RNA-seq and microarray-based models for clinical endpoint prediction , 2015, Genome Biology.

[24]  Ling-Ling Chen,et al.  Complementary Sequence-Mediated Exon Circularization , 2014, Cell.

[25]  Kathleen R. Cho,et al.  Scrambled exons , 1991, Cell.

[26]  Linda Szabo,et al.  Statistically based splicing detection reveals neural enrichment and tissue-specific induction of circular RNA during human fetal development , 2015, Genome Biology.

[27]  D. Riesner,et al.  Viroids are single-stranded covalently closed circular RNA molecules existing as highly base-paired rod-like structures. , 1976, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Trees-Juen Chuang,et al.  NCLscan: accurate identification of non-co-linear transcripts (fusion, trans-splicing and circular RNA) with a good balance between sensitivity and precision , 2015, Nucleic acids research.

[29]  R. Parker,et al.  Circular RNAs: diversity of form and function , 2014, RNA.

[30]  Hideaki Sugawara,et al.  The Sequence Read Archive , 2010, Nucleic Acids Res..

[31]  Jun Cheng,et al.  Specific identification and quantification of circular RNAs from sequencing data , 2016, Bioinform..

[32]  F. Baas,et al.  Identification of mutations in the gene encoding lamins A/C in autosomal dominant limb girdle muscular dystrophy with atrioventricular conduction disturbances (LGMD1B). , 2000, Human molecular genetics.

[33]  Andrea Tanzer,et al.  A multi-split mapping algorithm for circular RNA, splicing, trans-splicing and fusion detection , 2014, Genome Biology.

[34]  David J. Elliott,et al.  PTESFinder: a computational method to identify post-transcriptional exon shuffling (PTES) events , 2016, BMC Bioinformatics.