A novel algorithm based on bi-random walks to identify disease-related lncRNAs

There is evidence to suggest that lncRNAs are associated with distinct and diverse biological processes. The dysfunction or mutation of lncRNAs are implicated in a wide range of diseases. An accurate computational model can benefit the diagnosis of diseases and help us to gain a better understanding of the molecular mechanism. Although many related algorithms have been proposed, there is still much room to improve the accuracy of the algorithm. We developed a novel algorithm, BiWalkLDA, to predict disease-related lncRNAs in three real datasets, which have 528 lncRNAs, 545 diseases and 1216 interactions in total. To compare performance with other algorithms, the leave-one-out validation test was performed for BiWalkLDA and three other existing algorithms, SIMCLDA, LDAP and LRLSLDA. Additional tests were carefully designed to analyze the parameter effects such as α, β, l and r, which could help user to select the best choice of these parameters in their own application. In a case study of prostate cancer, eight out of the top-ten disease-related lncRNAs reported by BiWalkLDA were previously confirmed in literatures. In this paper, we develop an algorithm, BiWalkLDA, to predict lncRNA-disease association by using bi-random walks. It constructs a lncRNA-disease network by integrating interaction profile and gene ontology information. Solving cold-start problem by using neighbors’ interaction profile information. Then, bi-random walks was applied to three real biological datasets. Results show that our method outperforms other algorithms in predicting lncRNA-disease association in terms of both accuracy and specificity. https://github.com/screamer/BiwalkLDA

[1]  J. Hou,et al.  Long noncoding RNA MALAT-1 is a new potential therapeutic target for castration resistant prostate cancer. , 2013, The Journal of urology.

[2]  Xing Chen,et al.  LncRNADisease: a database for long-non-coding RNA-associated diseases , 2012, Nucleic Acids Res..

[3]  Zhu-Hong You,et al.  A novel approach based on KATZ measure to predict associations of human microbiota with non‐infectious diseases , 2016, Bioinform..

[4]  Laura Inés Furlong,et al.  DisGeNET: a Cytoscape plugin to visualize, integrate, search and analyze gene-disease networks , 2010, Bioinform..

[5]  Eugenia G. Giannopoulou,et al.  The oestrogen receptor alpha-regulated lncRNA NEAT1 is a critical modulator of prostate cancer , 2014, Nature Communications.

[6]  Xing Chen,et al.  Novel human lncRNA-disease association inference based on lncRNA expression profiles , 2013, Bioinform..

[7]  Jie Liu,et al.  MD-SVM: a novel SVM-based algorithm for the motif discovery of transcription factor binding sites , 2019, BMC Bioinformatics.

[8]  HENRY R. PROCTER The Spectrum of the Aurora , 1870, Nature.

[9]  Xuequn Shang,et al.  MiteFinderII: a novel tool to identify miniature inverted-repeat transposable elements hidden in eukaryotic genomes , 2018, BMC Medical Genomics.

[10]  Kerstin B. Meyer,et al.  A Functional Variant at a Prostate Cancer Predisposition Locus at 8q24 Is Associated with PVT1 Expression , 2011, PLoS genetics.

[11]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[12]  Guangyuan Fu,et al.  Matrix factorization-based data fusion for the prediction of lncRNA–disease associations , 2018, Bioinform..

[13]  Sarah C. Ayling,et al.  The Ensembl gene annotation system , 2016, Database J. Biol. Databases Curation.

[14]  Yan Zheng,et al.  WebNetCoffee: a web-based application to identify functionally conserved proteins from Multiple PPI networks , 2018, BMC Bioinformatics.

[15]  Xuequn Shang,et al.  Detection of Network Motif Based on a Novel Graph Canonization Algorithm from Transcriptional Regulation Networks , 2017, Molecules.

[16]  Xin-Yu Na,et al.  Long non-coding RNA UCA1 contributes to the progression of prostate cancer and regulates proliferation through KLF4-KRT6/13 signaling pathway. , 2015, International journal of clinical and experimental medicine.

[17]  Michael F. Lin,et al.  Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals , 2009, Nature.

[18]  Yi Pan,et al.  LDAP: a web server for lncRNA‐disease association prediction , 2016, Bioinform..

[19]  Lin Liu,et al.  Inferring novel lncRNA-disease associations based on a random walk model of a lncRNA functional similarity network. , 2014, Molecular bioSystems.

[20]  Jindan Yu,et al.  LncRNA HOTAIR Enhances the Androgen-Receptor-Mediated Transcriptional Program and Drives Castration-Resistant Prostate Cancer , 2015, Cell reports.

[21]  Qin Chen,et al.  lncRNA H19/miR‐675 axis represses prostate cancer metastasis by targeting TGFBI , 2014, The FEBS journal.

[22]  Yi Pan,et al.  Prediction of lncRNA–disease associations based on inductive matrix completion , 2018, Bioinform..

[23]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[24]  Xing Chen KATZLDA: KATZ measure for the lncRNA-disease association prediction , 2015, Scientific Reports.

[25]  Xing Chen,et al.  IRWRLDA: improved random walk with restart for lncRNA-disease association prediction , 2016, Oncotarget.

[26]  Yan Zheng,et al.  KF-finder: identification of key factors from host-microbial networks in cervical cancer , 2018, BMC Syst. Biol..

[27]  Jiajie Peng,et al.  Predicting Parkinson's Disease Genes Based on Node2vec and Autoencoder , 2019, Front. Genet..

[28]  Guosong Jiang,et al.  Long Non-Coding RNA MEG3 Inhibits Cell Proliferation and Induces Apoptosis in Prostate Cancer , 2015, Cellular Physiology and Biochemistry.

[29]  Nadav S. Bar,et al.  Landscape of transcription in human cells , 2012, Nature.

[30]  M. Mourtada-Maarabouni,et al.  Long non-coding RNA GAS5 regulates apoptosis in prostate cancer cell lines. , 2013, Biochimica et biophysica acta.