iCDA-CGR: Identification of circRNA-disease associations based on Chaos Game Representation

Found in recent research, tumor cell invasion, proliferation, or other biological processes are controlled by circular RNA. Understanding the association between circRNAs and diseases is an important way to explore the pathogenesis of complex diseases and promote disease-targeted therapy. Most methods, such as k-mer and PSSM, based on the analysis of high-throughput expression data have the tendency to think functionally similar nucleic acid lack direct linear homology regardless of positional information and only quantify nonlinear sequence relationships. However, in many complex diseases, the sequence nonlinear relationship between the pathogenic nucleic acid and ordinary nucleic acid is not much different. Therefore, the analysis of positional information expression can help to predict the complex associations between circRNA and disease. To fill up this gap, we propose a new method, named iCDA-CGR, to predict the circRNA-disease associations. In particular, we introduce circRNA sequence information and quantifies the sequence nonlinear relationship of circRNA by Chaos Game Representation (CGR) technology based on the biological sequence position information for the first time in the circRNA-disease prediction model. In the cross-validation experiment, our method achieved 0.8533 AUC, which was significantly higher than other existing methods. In the validation of independent data sets including circ2Disease, circRNADisease and CRDD, the prediction accuracy of iCDA-CGR reached 95.18%, 90.64% and 95.89%. Moreover, in the case studies, 19 of the top 30 circRNA-disease associations predicted by iCDA-CGR on circRDisease dataset were confirmed by newly published literature. These results demonstrated that iCDA-CGR has outstanding robustness and stability, and can provide highly credible candidates for biological experiments.

[1]  Petar Glažar,et al.  circBase: a database for circular RNAs , 2014, RNA.

[2]  H. J. Jeffrey Chaos game representation of gene structure. , 1990, Nucleic acids research.

[3]  Zhu-Hong You,et al.  CGMDA: An Approach to Predict and Validate MicroRNA-Disease Associations by Utilizing Chaos Game Representation and LightGBM , 2019, IEEE Access.

[4]  H. J. Jeffrey Chaos game representation of gene structure. , 1990, Nucleic acids research.

[5]  Hsien-Da Huang,et al.  CircNet: a database of circular RNAs derived from transcriptome sequencing data , 2015, Nucleic Acids Res..

[6]  Matthew J. Higgins,et al.  Inhibition of RNA lariat debranching enzyme suppresses TDP-43 toxicity in ALS disease models , 2012, Nature Genetics.

[7]  Shanshan Zhu,et al.  Circular intronic long noncoding RNAs. , 2013, Molecular cell.

[8]  Philip S. Yu,et al.  A new method to measure the semantic similarity of GO terms , 2007, Bioinform..

[9]  Yan-Bin Wang,et al.  MISSIM: Improved miRNA-Disease Association Prediction Model Based on Chaos Game Representation and Broad Learning System , 2019, ICIC.

[10]  Yan Li,et al.  circRNADb: A comprehensive database for human circular RNAs with protein-coding annotations , 2016, Scientific Reports.

[11]  Fang-Xiang Wu,et al.  Prediction of CircRNA-Disease Associations Using KATZ Model Based on Heterogeneous Networks , 2018, International journal of biological sciences.

[12]  Elena Marchiori,et al.  Gaussian interaction profile kernels for predicting drug-target interaction , 2011, Bioinform..

[13]  Zhu-Hong You,et al.  Constructing prediction models from expression profiles for large scale lncRNA–miRNA interaction profiling , 2017, Bioinform..

[14]  Xiaoyan Mo,et al.  Using circular RNA as a novel type of biomarker in the screening of gastric cancer. , 2015, Clinica chimica acta; international journal of clinical chemistry.

[15]  Xiujuan Lei,et al.  CircR2Disease: a manually curated database for experimentally supported circular RNAs associated with various diseases , 2018, Database J. Biol. Databases Curation.

[16]  Xiangxiang Zeng,et al.  Inferring MicroRNA-Disease Associations by Random Walk on a Heterogeneous Network with Multiple Data Sources , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[17]  Bing Zhou,et al.  A novel identified circular RNA, circRNA_010567, promotes myocardial fibrosis via suppressing miR-141 by targeting TGF-β1. , 2017, Biochemical and biophysical research communications.

[18]  Lingqing Wang,et al.  Predicting Protein-Protein Interactions from Matrix-Based Protein Sequence Using Convolution Neural Network and Feature-Selective Rotation Forest , 2019, Scientific Reports.

[19]  N. Rajewsky,et al.  circRNA biogenesis competes with pre-mRNA splicing. , 2014, Molecular cell.

[20]  Li Yang,et al.  CIRCpedia v2: An Updated Database for Comprehensive Circular RNA Annotation and Expression Comparison , 2018, Genom. Proteom. Bioinform..

[21]  Yan Lu,et al.  Circ2Disease: a manually curated database of experimentally validated circRNAs in human disease , 2018, Scientific Reports.

[22]  Jie Wu,et al.  deepBase v2.0: identification, expression, evolution and function of small RNAs, LncRNAs and circular RNAs from deep-sequencing data , 2015, Nucleic Acids Res..

[23]  Yifeng Zhou,et al.  Circular RNA ITCH has inhibitory effect on ESCC by suppressing the Wnt/β-catenin pathway , 2015, Oncotarget.

[24]  Hai-Feng Liang,et al.  Circular RNA circ-ABCB10 promotes breast cancer proliferation and progression through sponging miR-1271. , 2017, American journal of cancer research.

[25]  Faryal Mehwish Awan,et al.  Induction of tumor apoptosis through a circular RNA enhancing Foxo3 activity , 2016, Cell Death and Differentiation.

[26]  Xiujuan Lei,et al.  PWCDA: Path Weighted Method for Predicting circRNA-Disease Associations , 2018, International journal of molecular sciences.

[27]  G. Shan,et al.  Exon-intron circular RNAs regulate transcription in the nucleus , 2015, Nature Structural &Molecular Biology.

[28]  Wei Li,et al.  The circular RNA Cdr1as, via miR-7 and its targets, regulates insulin transcription and secretion in islet cells , 2015, Scientific Reports.

[29]  Hui Zhou,et al.  deepBase: a database for deeply annotating and mining deep sequencing data , 2009, Nucleic Acids Res..

[30]  Yong Li,et al.  Circular RNAs function as ceRNAs to regulate and control human cancer progression , 2018, Molecular Cancer.

[31]  Yufei Huang,et al.  Prediction of microRNAs Associated with Human Diseases Based on Weighted k Most Similar Neighbors , 2013, PloS one.

[32]  Zhu-Hong You,et al.  DBMDA: A Unified Embedding for Sequence-Based miRNA Similarity Measure with Applications to Predict and Validate miRNA-Disease Associations , 2019, Molecular therapy. Nucleic acids.

[33]  Lei Wang,et al.  Prediction of RNA-protein interactions by combining deep convolutional neural network with feature selection ensemble method. , 2019, Journal of theoretical biology.

[34]  Ming Chen,et al.  CircFunBase: a database for functional circular RNAs , 2019, Database J. Biol. Databases Curation.

[35]  Carole A. Goble,et al.  Investigating Semantic Similarity Measures Across the Gene Ontology: The Relationship Between Sequence and Annotation , 2003, Bioinform..

[36]  Tao Jiang,et al.  circRNA disease: a manually curated database of experimentally supported circRNA-disease associations , 2018, Cell Death & Disease.

[37]  Xing Chen,et al.  LMTRDA: Using logistic model tree to predict MiRNA-disease associations by fusing multi-source information of sequences and similarities , 2019, PLoS Comput. Biol..

[38]  Jianhua Dai,et al.  Computational Prediction of Human Disease- Associated circRNAs Based on Manifold Regularization Learning Framework , 2019, IEEE Journal of Biomedical and Health Informatics.

[39]  Li-Ping Li,et al.  MLMDA: a machine learning approach to predict and validate MicroRNA–disease associations by integrating of heterogenous information sources , 2019, Journal of Translational Medicine.

[40]  Michitaka Fujiwara,et al.  Identification of a serum-based miRNA signature for response of esophageal squamous cell carcinoma to neoadjuvant chemotherapy , 2019, Journal of Translational Medicine.