CircCode: A Powerful Tool for Identifying circRNA Coding Ability

Circular RNAs (circRNAs), which play vital roles in many regulatory pathways, are widespread in many species. Although many circRNAs have been discovered in plants and animals, the functions of these RNAs have not been fully investigated. In addition to the function of circRNAs as microRNA (miRNA) decoys, the translation potential of circRNAs is important for the study of their functions; yet, few tools are available to identify their translation potential. With the development of high-throughput sequencing technology and the emergence of ribosome profiling technology, it is possible to identify the coding ability of circRNAs with high sensitivity. To evaluate the coding ability of circRNAs, we first developed the CircCode tool and then used CircCode to investigate the translation potential of circRNAs from humans and Arabidopsis thaliana. Based on the ribosome profile databases downloaded from NCBI, we found 3,610 and 1,569 translated circRNAs in humans and A. thaliana, respectively. Finally, we tested the performance of CircCode and found a low false discovery rate and high sensitivity for identifying circRNA coding ability. CircCode, a Python 3–based framework for identifying the coding ability of circRNAs, is also a simple and powerful command line-based tool. To investigate the translation potential of circRNAs, the user can simply fill in the given configuration file and run the Python 3 scripts. The tool is freely available at https://github.com/PSSUN/CircCode.

[1]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[2]  P. Willems,et al.  N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana , 2017, Molecular & Cellular Proteomics.

[3]  Li Yang,et al.  Regulation of circRNA biogenesis , 2015, RNA biology.

[4]  Qian-Hao Zhu,et al.  PlantcircBase: A Database for Plant Circular RNAs. , 2017, Molecular plant.

[5]  Peifeng Li,et al.  Biogenesis of circular RNAs and their roles in cardiovascular development and pathology , 2018, The FEBS journal.

[6]  Christoph Dieterich,et al.  circtools—a one-stop software solution for circular RNA research , 2018, Bioinform..

[7]  Ming Chen,et al.  CircPro: an integrated tool for the identification of circRNAs with protein‐coding potential , 2017, Bioinform..

[8]  P. Hsu,et al.  Small but Mighty: Functional Peptides Encoded by Small ORFs in Plants , 2018, Proteomics.

[9]  Yann Ponty,et al.  GenRGenS: software for generating random genomic sequences and structures , 2006, Bioinform..

[10]  N. Rajewsky,et al.  Translation of CircRNAs , 2017, Molecular cell.

[11]  Yong Zhang,et al.  CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine , 2007, Nucleic Acids Res..

[12]  Nicholas T. Ingolia,et al.  Ribosome Profiling Provides Evidence that Large Noncoding RNAs Do Not Encode Proteins , 2013, Cell.

[13]  Ling-Ling Chen,et al.  Genome-Wide Annotation of circRNAs and Their Alternative Back-Splicing/Splicing with CIRCexplorer Pipeline. , 2018, Methods in molecular biology.

[14]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[15]  F. Zhao,et al.  CIRI: an efficient and unbiased algorithm for de novo circular RNA identification , 2015, Genome Biology.

[16]  J. Wilusz,et al.  Non-AUG translation: a new start for protein synthesis in eukaryotes , 2017, Genes & development.

[17]  Guangchuang Yu,et al.  clusterProfiler: an R package for comparing biological themes among gene clusters. , 2012, Omics : a journal of integrative biology.

[18]  Li Yang,et al.  CIRCpedia v2: An Updated Database for Comprehensive Circular RNA Annotation and Expression Comparison , 2018, Genom. Proteom. Bioinform..

[19]  Haixu Tang,et al.  FragGeneScan: predicting genes in short and error-prone reads , 2010, Nucleic acids research.

[20]  J. Rinn,et al.  Peptidomic discovery of short open reading frame-encoded peptides in human cells , 2012, Nature chemical biology.

[21]  Ribosome profiling: a Hi‐Def monitor for protein synthesis at the genome‐wide scale , 2017, Wiley interdisciplinary reviews. RNA.

[22]  Audrey M. Michel,et al.  Ribosome profiling: a Hi-Def monitor for protein synthesis at the genome-wide scale , 2013, Wiley interdisciplinary reviews. RNA.

[23]  Armaghan W. Naik,et al.  Conserved non-AUG uORFs revealed by a novel regression analysis of ribosome profiling data , 2018, Genome research.

[24]  Nicholas T. Ingolia,et al.  Ribosome Profiling of Mouse Embryonic Stem Cells Reveals the Complexity and Dynamics of Mammalian Proteomes , 2011, Cell.

[25]  Yang Zhang,et al.  Extensive translation of circular RNAs driven by N6-methyladenosine , 2017, Cell Research.

[26]  Jianmin Wu,et al.  KOBAS server: a web-based platform for automated annotation and pathway identification , 2006, Nucleic Acids Res..

[27]  Fabricio M. Lopes,et al.  BASiNET—BiologicAl Sequences NETwork: a case study on coding and non-coding RNAs identification , 2018, Nucleic acids research.

[28]  Xiaoduan Li,et al.  Circular RNAs and their Emerging Roles as Diagnostic and Prognostic Biomarkers in Ovarian Cancer. , 2020, Cancer letters.

[29]  Jinrong Fu,et al.  Circular RNAs and Their Emerging Roles in Immune Regulation , 2018, Front. Immunol..

[30]  Christoph Dieterich,et al.  Computational approaches for circular RNA analysis , 2019, Wiley interdisciplinary reviews. RNA.

[31]  J. Weissman,et al.  Ribosome profiling reveals the what, when, where and how of protein synthesis , 2015, Nature Reviews Molecular Cell Biology.

[32]  Uwe Ohler,et al.  Super-resolution ribosome profiling reveals unannotated translation events in Arabidopsis , 2016, Proceedings of the National Academy of Sciences.

[33]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.