Comparison of circular RNA prediction tools

CircRNAs are novel members of the non-coding RNA family. For several decades circRNAs have been known to exist, however only recently the widespread abundance has become appreciated. Annotation of circRNAs depends on sequencing reads spanning the backsplice junction and therefore map as non-linear reads in the genome. Several pipelines have been developed to specifically identify these non-linear reads and consequently predict the landscape of circRNAs based on deep sequencing datasets. Here, we use common RNAseq datasets to scrutinize and compare the output from five different algorithms; circRNA_finder, find_circ, CIRCexplorer, CIRI, and MapSplice and evaluate the levels of bona fide and false positive circRNAs based on RNase R resistance. By this approach, we observe surprisingly dramatic differences between the algorithms specifically regarding the highly expressed circRNAs and the circRNAs derived from proximal splice sites. Collectively, this study emphasizes that circRNA annotation should be handled with care and that several algorithms should ideally be combined to achieve reliable predictions.

[1]  Peter Goodfellow,et al.  Circular transcripts of the testis-determining gene Sry in adult mouse testis , 1993, Cell.

[2]  C. Cocquerelle,et al.  Mis‐splicing yields circular RNA molecules , 1993, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[3]  P. Zaphiropoulos,et al.  Exon skipping and circular RNA formation in transcripts of the human cytochrome P-450 2C18 gene in epidermis and of the rat androgen binding protein gene in testis , 1997, Molecular and cellular biology.

[4]  William R. Jeck,et al.  Expression of Linear and Novel Circular Forms of an INK4/ARF-Associated Non-Coding RNA Correlates with Atherosclerosis Risk , 2010, PLoS genetics.

[5]  Derek Y. Chiang,et al.  MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery , 2010, Nucleic acids research.

[6]  Jørgen Kjems,et al.  miRNA‐dependent gene silencing involving Ago2‐mediated cleavage of a circular antisense RNA , 2011, The EMBO journal.

[7]  Charles Gawad,et al.  Circular RNAs Are the Predominant Transcript Isoform from Hundreds of Human Genes in Diverse Cell Types , 2012, PloS one.

[8]  J. Kjems,et al.  Natural RNA circles function as efficient microRNA sponges , 2013, Nature.

[9]  Sebastian D. Mackowiak,et al.  Circular RNAs are a large class of animal RNAs with regulatory potency , 2013, Nature.

[10]  Michael K. Slevin,et al.  Circular RNAs are abundant, conserved, and associated with ALU repeats. , 2013, RNA.

[11]  Andrea Tanzer,et al.  A multi-split mapping algorithm for circular RNA, splicing, trans-splicing and fusion detection , 2014, Genome Biology.

[12]  Shanshan Zhu,et al.  Circular intronic long noncoding RNAs. , 2013, Molecular cell.

[13]  N. Sharpless,et al.  Detecting and characterizing circular RNAs , 2014, Nature Biotechnology.

[14]  Mitchell Guttman,et al.  RNA and dynamic nuclear organization , 2014, Science.

[15]  Ling-Ling Chen,et al.  Complementary Sequence-Mediated Exon Circularization , 2014, Cell.

[16]  N. Rajewsky,et al.  circRNA biogenesis competes with pre-mRNA splicing. , 2014, Molecular cell.

[17]  Petar Glažar,et al.  circBase: a database for circular RNAs , 2014, RNA.

[18]  F. Zhao,et al.  CIRI: an efficient and unbiased algorithm for de novo circular RNA identification , 2015, Genome Biology.

[19]  Sol Shenker,et al.  Genome-wide analysis of drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation. , 2014, Cell reports.

[20]  L. Laurent,et al.  Erratum to: Statistically based splicing detection reveals neural enrichment and tissue-specific induction of circular RNA during human fetal development , 2016, Genome Biology.