circMeta: a unified computational framework for genomic feature annotation and differential expression analysis of circular RNAs

MOTIVATION Circular RNAs (circRNAs), a class of non-coding RNAs generated from non-canonical back-splicing events, have emerged to play key roles in many biological processes. Though numerous tools have been developed to detect circRNAs from rRNA-depleted RNA-seq data based on back-splicing junction-spanning reads, computational tools to identify critical genomic features regulating circRNA biogenesis are still lacking. In addition, rigorous statistical methods to perform differential expression (DE) analysis of circRNAs remain under-developed. RESULTS We present circMeta, a unified computational framework for circRNA analyses. circMeta has three primarily functional modules: (i) a pipeline for comprehensive genomic feature annotation related to circRNA biogenesis, including length of introns flanking circularized exons, repetitive elements such as Alu elements and SINEs, competition score for forming circulation and RNA editing in back-splicing flanking introns (ii) a two-stage DE approach of circRNAs based on circular junction reads to quantitatively compare circRNA levels (iii) a Bayesian hierarchical model for DE analysis of circRNAs based on the ratio of circular reads to linear reads in back-splicing sites to study spatial and temporal regulation of circRNA production. Both proposed DE methods without and with considering host genes outperform existing methods by obtaining better control of false discovery rate (FDR) and comparable statistical power. Moreover, the identified DE circRNAs by the proposed two-stage DE approach display potential biological functions in Gene Ontology and circRNAmiRNA-mRNA networks that are not able to be detected using existing mRNA DE methods. Furthermore, top DE circRNAs have been further validated by RT-qPCR using divergent primers spanning back-splicing junctions. AVAILABILITY The software circMeta is freely available at https://github.com/lichen-lab/circMeta. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Ling-Ling Chen,et al.  Complementary Sequence-Mediated Exon Circularization , 2014, Cell.

[2]  Petar Glažar,et al.  Circular RNAs in the Mammalian Brain Are Highly Abundant, Conserved, and Dynamically Expressed. , 2015, Molecular cell.

[3]  E. Huang,et al.  Neurotrophins: roles in neuronal development and function. , 2001, Annual review of neuroscience.

[4]  K. Conneely,et al.  A Bayesian hierarchical model to detect differentially methylated loci from single nucleotide resolution sequencing data , 2014, Nucleic acids research.

[5]  Xiang Li,et al.  The Biogenesis, Functions, and Challenges of Circular RNAs. , 2018, Molecular cell.

[6]  Dongming Liang,et al.  The Output of Protein-Coding Genes Shifts to Circular RNAs When the Pre-mRNA Processing Machinery Is Limiting. , 2017, Molecular cell.

[7]  Hui Zhou,et al.  starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein–RNA interaction networks from large-scale CLIP-Seq data , 2013, Nucleic Acids Res..

[8]  Hui Zhou,et al.  starBase: a database for exploring microRNA–mRNA interaction maps from Argonaute CLIP-Seq and Degradome-Seq data , 2010, Nucleic Acids Res..

[9]  William B. Langdon,et al.  Performance of genetic programming optimised Bowtie2 on genome comparison and analytic testing (GCAT) benchmarks , 2015, BioData Mining.

[10]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[11]  Alain Bergeron,et al.  Widespread and Functional RNA Circularization in Localized Prostate Cancer , 2019, Cell.

[12]  Jin Billy Li,et al.  RADAR: a rigorously annotated database of A-to-I RNA editing , 2013, Nucleic Acids Res..

[13]  S. Dhanasekaran,et al.  The Landscape of Circular RNA in Cancer , 2019, Cell.

[14]  Artemis G. Hatzigeorgiou,et al.  DIANA-miRPath v3.0: deciphering microRNA function with experimental support , 2015, Nucleic Acids Res..

[15]  J. Kjems,et al.  Comparison of circular RNA prediction tools , 2015, Nucleic acids research.

[16]  J. Wilusz,et al.  A 360° view of circular RNAs: From biogenesis to functions , 2018, Wiley interdisciplinary reviews. RNA.

[17]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[18]  Carmen Birchmeier,et al.  Loss of a mammalian circular RNA locus causes miRNA deregulation and affects brain function , 2017, Science.

[19]  Sol Shenker,et al.  Genome-wide analysis of drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation. , 2014, Cell reports.

[20]  Julia Salzman,et al.  Cell-Type Specific Features of Circular RNA Expression , 2013, PLoS genetics.

[21]  Cole Trapnell,et al.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions , 2013, Genome Biology.

[22]  Sebastian D. Mackowiak,et al.  Circular RNAs are a large class of animal RNAs with regulatory potency , 2013, Nature.

[23]  F. Zhao,et al.  CIRI: an efficient and unbiased algorithm for de novo circular RNA identification , 2015, Genome Biology.

[24]  Pavel V. Baranov,et al.  DARNED: a DAtabase of RNa EDiting in humans , 2010, Bioinform..

[25]  Yi Xing,et al.  Genome-Wide Maps of m6A circRNAs Identify Widespread and Cell-Type-Specific Methylation Patterns that Are Distinct from mRNAs. , 2017, Cell reports.

[26]  Hiroki Ueda,et al.  Transcriptome-wide identification of adenosine-to-inosine editing using the ICE-seq method , 2015, Nature Protocols.

[27]  J. Salzman,et al.  Detecting circular RNAs: bioinformatic and experimental challenges , 2016, Nature Reviews Genetics.

[28]  Jun Cheng,et al.  Specific identification and quantification of circular RNAs from sequencing data , 2016, Bioinform..

[29]  Ling-Ling Chen The biogenesis and emerging roles of circular RNAs , 2016, Nature Reviews Molecular Cell Biology.

[30]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[31]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[32]  B. Nicolet,et al.  Circular RNA expression in human hematopoietic cells is widespread and cell-type specific , 2018, bioRxiv.

[33]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.