Improved circRNA Identification by Combining Prediction Algorithms

Non-coding RNA is an interesting class of gene regulators with diverse functionalities. One large subgroup of non-coding RNAs is the recently discovered class of circular RNAs (circRNAs). CircRNAs are conserved and expressed in a tissue and developmental specific manner, although for the vast majority, the functional relevance remains unclear. To identify and quantify circRNAs expression, several bioinformatic pipelines have been developed to assess the catalog of circRNAs in any given total RNA sequencing dataset. We recently compared five different algorithms for circRNA detection, but here this analysis is extended to 11 algorithms. By comparing the number of circRNAs discovered and their respective sensitivity to RNaseR digestion, the sensitivity and specificity of each algorithm are evaluated. Moreover, the ability to predict de novo circRNA, i.e., circRNAs not derived from annotated splice sites, is also determined as well as the effect of eliminating low quality and adaptor-containing reads prior to circRNA prediction. Finally, and most importantly, all possible pair-wise combinations of algorithms are tested and guidelines for algorithm complementarity are provided. Conclusively, the algorithms mostly agree on highly expressed circRNAs, however, in many cases, algorithm-specific false positives with high read counts are predicted, which is resolved by using the shared output from two (or more) algorithms.

[1]  Sebastian D. Mackowiak,et al.  Circular RNAs are a large class of animal RNAs with regulatory potency , 2013, Nature.

[2]  J. Kjems,et al.  Comparison of circular RNA prediction tools , 2015, Nucleic acids research.

[3]  Ling-Ling Chen,et al.  Complementary Sequence-Mediated Exon Circularization , 2014, Cell.

[4]  Petar Glažar,et al.  Circular RNAs in the Mammalian Brain Are Highly Abundant, Conserved, and Dynamically Expressed. , 2015, Molecular cell.

[5]  S. Chandrasegaran,et al.  Rewriting the blueprint of life by synthetic genomics and genome engineering , 2015, Genome Biology.

[6]  F. Zhao,et al.  CIRI: an efficient and unbiased algorithm for de novo circular RNA identification , 2015, Genome Biology.

[7]  Sol Shenker,et al.  Genome-wide analysis of drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation. , 2014, Cell reports.

[8]  Julia Salzman,et al.  Cell-Type Specific Features of Circular RNA Expression , 2013, PLoS genetics.

[9]  Derek Y. Chiang,et al.  MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery , 2010, Nucleic acids research.

[10]  Wilfried Haerty,et al.  Genome-wide discovery of human splicing branchpoints , 2015, Genome research.

[11]  Yang Zhang,et al.  Extensive translation of circular RNAs driven by N6-methyladenosine , 2017, Cell Research.

[12]  Carmen Birchmeier,et al.  Loss of a mammalian circular RNA locus causes miRNA deregulation and affects brain function , 2017, Science.

[13]  J. Kjems,et al.  Natural RNA circles function as efficient microRNA sponges , 2013, Nature.

[14]  E. Schuman,et al.  Neural circular RNAs are derived from synaptic genes and regulated by development and plasticity , 2015, Nature Neuroscience.

[15]  Jørgen Kjems,et al.  miRNA‐dependent gene silencing involving Ago2‐mediated cleavage of a circular antisense RNA , 2011, The EMBO journal.

[16]  Kai Wang,et al.  Circular RNA profile in gliomas revealed by identification tool UROBORUS , 2016, Nucleic acids research.

[17]  Michael K. Slevin,et al.  Circular RNAs are abundant, conserved, and associated with ALU repeats. , 2013, RNA.

[18]  N. Rajewsky,et al.  Circ-ZNF609 Is a Circular RNA that Can Be Translated and Functions in Myogenesis , 2017, Molecular cell.

[19]  Yuan Gao,et al.  Circular RNA identification based on multiple seed matching , 2018, Briefings Bioinform..

[20]  Wei Lin,et al.  A comprehensive overview and evaluation of circular RNA detection tools , 2017, PLoS Comput. Biol..

[21]  Jun Zhang,et al.  Diverse alternative back-splicing and alternative splicing landscape of circular RNAs , 2016, Genome research.

[22]  Linda Szabo,et al.  Statistically based splicing detection reveals neural enrichment and tissue-specific induction of circular RNA during human fetal development , 2015, Genome Biology.

[23]  N. Rajewsky,et al.  Translation of CircRNAs , 2017, Molecular cell.

[24]  Jun Cheng,et al.  Specific identification and quantification of circular RNAs from sequencing data , 2016, Bioinform..