CirRNAPL: A web server for the identification of circRNA based on extreme learning machine

Circular RNA (circRNA) plays an important role in the development of diseases, and it provides a novel idea for drug development. Accurate identification of circRNAs is important for a deeper understanding of their functions. In this study, we developed a new classifier, CirRNAPL, which extracts the features of nucleic acid composition and structure of the circRNA sequence and optimizes the extreme learning machine based on the particle swarm optimization algorithm. We compared CirRNAPL with existing methods, including blast, on three datasets and found CirRNAPL significantly improved the identification accuracy for the three datasets, with accuracies of 0.815, 0.802, and 0.782, respectively. Additionally, we performed sequence alignment on 564 sequences of the independent detection set of the third data set and analyzed the expression level of circRNAs. Results showed the expression level of the sequence is positively correlated with the abundance. A user-friendly CirRNAPL web server is freely available at http://server.malab.cn/CirRNAPL/.

[1]  Xing Gao,et al.  An Improved Protein Structural Classes Prediction Method by Incorporating Both Sequence and Structure Information , 2015, IEEE Transactions on NanoBioscience.

[2]  Schraga Schwartz,et al.  Transcriptome-wide discovery of circular RNAs in Archaea , 2011, Nucleic acids research.

[3]  Xiaoyong Pan,et al.  Discriminating cirRNAs from other lncRNAs using a hierarchical extreme learning machine (H-ELM) algorithm with feature selection , 2017, Zeitschrift für Induktive Abstammungs- und Vererbungslehre.

[4]  Shuigeng Zhou,et al.  A new taxonomy-based protein fold recognition approach based on autocross-covariance transformation , 2009, Bioinform..

[5]  B. Xiao,et al.  Reduced expression of circRNA hsa_circ_0003159 in gastric cancer and its clinical significance , 2018, Journal of clinical laboratory analysis.

[6]  Jan Gorodkin,et al.  WebCircRNA: Classifying the Circular RNA Potential of Coding and Noncoding RNA , 2018, Genes.

[7]  Jun Cheng,et al.  Specific identification and quantification of circular RNAs from sequencing data , 2016, Bioinform..

[8]  T. O. Diener Potato spindle tuber "virus". IV. A replicating, low molecular weight RNA. , 1971, Virology.

[9]  Xiaoyong Pan,et al.  PredcircRNA: computational classification of circular RNA from other long non-coding RNA using hybrid features. , 2015, Molecular bioSystems.

[10]  Mohamed Chaabane,et al.  circDeep: deep learning approach for circular RNA classification from other long non-coding RNA , 2019, Bioinform..

[11]  Nan Liu,et al.  Voting based extreme learning machine , 2012, Inf. Sci..

[12]  Dianhui Wang,et al.  Protein sequence classification using extreme learning machine , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[13]  Nello Cristianini,et al.  Classification using String Kernels , 2000 .

[14]  Liujuan Cao,et al.  A novel features ranking metric with application to scalable visual and bioinformatics data classification , 2016, Neurocomputing.

[15]  Kai Wang,et al.  Circular RNA profile in gliomas revealed by identification tool UROBORUS , 2016, Nucleic acids research.

[16]  Wei Chen,et al.  iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition , 2013, Nucleic acids research.

[17]  Zhu-Hong You,et al.  Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis , 2013, BMC Bioinformatics.

[18]  Junjie Chen,et al.  Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences , 2015, Nucleic Acids Res..

[19]  Qinghai Bai,et al.  Analysis of Particle Swarm Optimization Algorithm , 2010, Comput. Inf. Sci..

[20]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[21]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[22]  Jifeng Guo,et al.  An Inductive Logic Programming Algorithm Based on Artificial Bee Colony , 2019, J. Inf. Technol. Res..

[23]  Yan Li,et al.  circRNADb: A comprehensive database for human circular RNAs with protein-coding annotations , 2016, Scientific Reports.

[24]  Swetlana Nikolajewa,et al.  DiProDB: a database for dinucleotide properties , 2008, Nucleic Acids Res..

[25]  B. Xiao,et al.  Circular RNA 0068669 as a new biomarker for hepatocellular carcinoma metastasis , 2018, Journal of clinical laboratory analysis.

[26]  Hong-Wei Li,et al.  Identification of circular RNAs and their altered expression under poly(I:C) challenge in key antiviral immune pathways in amphioxus , 2019, Fish & shellfish immunology.

[27]  Ling-Ling Chen,et al.  Complementary Sequence-Mediated Exon Circularization , 2014, Cell.

[28]  Jijun Tang,et al.  PhosPred-RF: A Novel Sequence-Based Predictor for Phosphorylation Sites Using Sequential Information Only , 2017, IEEE Transactions on NanoBioscience.

[29]  Tina R. Patil,et al.  Performance Analysis of Naive Bayes and J 48 Classification Algorithm for Data Classification , 2013 .

[30]  Duc Truong Pham,et al.  Intelligent Optimisation Techniques: Genetic Algorithms, Tabu Search, Simulated Annealing and Neural Networks , 2011 .

[31]  F. Zhao,et al.  Expanded Expression Landscape and Prioritization of Circular RNAs in Mammals. , 2019, Cell reports.

[32]  Wei Chen,et al.  PseKNC-General: a cross-platform package for generating various modes of pseudo nucleotide compositions , 2015, Bioinform..

[33]  Noureddine Zerhouni,et al.  SW-ELM: A summation wavelet extreme learning machine algorithm with a priori parameter initialization , 2014, Neurocomputing.

[34]  T. Conrad,et al.  Acfs: accurate circRNA identification and quantification from RNA-Seq data , 2016, Scientific Reports.

[35]  R. Ji,et al.  Improved and Promising Identification of Human MicroRNAs by Incorporating a High-Quality Negative Set , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[36]  Jiuwen Cao,et al.  Protein Sequence Classification with Improved Extreme Learning Machine Algorithms , 2014, BioMed research international.

[37]  Hsien-Da Huang,et al.  CircNet: a database of circular RNAs derived from transcriptome sequencing data , 2015, Nucleic Acids Res..

[38]  S. Dhanasekaran,et al.  The Landscape of Circular RNA in Cancer , 2019, Cell.

[39]  Tongbao Lin,et al.  Identification and characterization of CircRNAs involved in the regulation of wheat root length , 2019, Biological Research.

[40]  Gang Tian,et al.  Accurate Prediction of Transposon-Derived piRNAs by Integrating Various Sequential and Physicochemical Features , 2016, PloS one.

[41]  Li Yang,et al.  CIRCpedia v2: An Updated Database for Comprehensive Circular RNA Annotation and Expression Comparison , 2018, Genom. Proteom. Bioinform..

[42]  Feilong Cao,et al.  A study on effectiveness of extreme learning machine , 2011, Neurocomputing.

[43]  F. Hubé,et al.  A new method for the identification of thousands of circular RNAs , 2018 .

[44]  B. Liu,et al.  Identification of Real MicroRNA Precursors with a Pseudo Structure Status Composition Approach , 2015, PloS one.

[45]  Qiwei Yang,et al.  Identification of Key Genes and Circular RNAs in Human Gastric Cancer , 2019, Medical science monitor : international medical journal of experimental and clinical research.

[46]  Limin Jiang,et al.  BP Neural Network Could Help Improve Pre-miRNA Identification in Various Species , 2016, BioMed research international.

[47]  Yanqing Niu,et al.  Predicting flexible length linear B-cell epitopes using pairwise sequence similarity , 2010, 2010 3rd International Conference on Biomedical Engineering and Informatics.

[48]  Jian-ning Chen,et al.  Identification of virus-encoded circular RNA. , 2019, Virology.

[49]  Thomas B. Hansen,et al.  Improved circRNA Identification by Combining Prediction Algorithms , 2018, Front. Cell Dev. Biol..

[50]  Geoffrey J. Barton,et al.  Jalview Version 2—a multiple sequence alignment editor and analysis workbench , 2009, Bioinform..

[51]  Petar Glažar,et al.  circBase: a database for circular RNAs , 2014, RNA.

[52]  Antonino Fiannaca,et al.  nRC: non-coding RNA Classifier based on structural features , 2017, BioData Mining.

[53]  Jun Wang,et al.  Comprehensive characterization of tissue-specific circular RNAs in the human and mouse genomes , 2016, Briefings Bioinform..

[54]  Leng Han,et al.  Circular RNAs as promising biomarkers in cancer: detection, function, and beyond , 2019, Genome Medicine.

[55]  Chunyu Wang,et al.  RFAmyloid: A Web Server for Predicting Amyloid Proteins , 2018, International journal of molecular sciences.

[56]  Dawood B. Dudekula,et al.  CircInteractome: A web tool for exploring circular RNAs and their interacting proteins and microRNAs , 2016, RNA biology.

[57]  A. Dahl,et al.  Sequence and expression levels of circular RNAs in progenitor cell types during mouse corticogenesis , 2019, Life Science Alliance.

[58]  N. Sharpless,et al.  Detecting and characterizing circular RNAs , 2014, Nature Biotechnology.

[59]  Sebastian D. Mackowiak,et al.  Circular RNAs are a large class of animal RNAs with regulatory potency , 2013, Nature.

[60]  Li Chen,et al.  PcircRNA_finder: a software for circRNA prediction in plants , 2016, Bioinform..

[61]  C. Zeng,et al.  Role of circular RNAs in cardiovascular diseases , 2019, Experimental biology and medicine.

[62]  F. Zhao,et al.  CIRI: an efficient and unbiased algorithm for de novo circular RNA identification , 2015, Genome Biology.

[63]  Qian-Hao Zhu,et al.  Widespread noncoding circular RNAs in plants. , 2015, The New phytologist.

[64]  B. Ni,et al.  RNA-seq of circular RNAs identified circPTPN22 as a potential new activity indicator in systemic lupus erythematosus , 2019, Lupus.