A Novel Approach for Potential Human LncRNA-Disease Association Prediction Based on Local Random Walk

In recent years, lncRNAs (long non-coding RNAs) have been proved to be closely related to many diseases that are seriously harmful to human health. Although researches on clarifying the relationships between lncRNAs and diseases are developing rapidly, associations between the lncRNAs and diseases are still remaining largely unknown. In this manuscript, a novel Local Random Walk based prediction model called LRWHLDA is proposed for inferring potential associations between human lncRNAs and diseases. In LRWHLDA, a new heterogeneous network is established first, which allows that LRWHLDA can be implemented in the case of lacking known lncRNA-disease associations. And then, an improved local random walk method is designed for prediction of novel lncRNA-disease associations, which can help LRWHLDA achieve high prediction accuracy but with low time complexity. Finally, in order to evaluate the prediction performance of LRWHLDA, different frameworks such as LOOCV, 2-folds CV, and 5-folds CV have been implemented, simulation results indicate that LRWHLDA can achieve reliable AUCs of 0.8037, 0.8354, and 0.8556 under the frameworks of 2-fold CV, 5-fold CV, and LOOCV, respectively. Hence, it is easy to know that LRWHLDA contains the potential to be a representative of emerging methods in the field of research on potential lncRNA-disease associations prediction.

[1]  Xing Chen,et al.  LncRNADisease: a database for long-non-coding RNA-associated diseases , 2012, Nucleic Acids Res..

[2]  Lei Wang,et al.  Prediction of microRNA-disease associations based on distance correlation set , 2018, BMC Bioinformatics.

[3]  Bo Liao,et al.  Global network random walk for predicting potential human lncRNA-disease associations , 2017, Scientific Reports.

[4]  Xing Chen,et al.  PBMDA: A novel and effective path-based computational model for miRNA-disease association prediction , 2017, PLoS Comput. Biol..

[5]  S. Brenner,et al.  General Nature of the Genetic Code for Proteins , 1961, Nature.

[6]  H. Willard,et al.  Characterization of a murine gene expressed from the inactive X chromosome , 1991, Nature.

[7]  Lei Wang,et al.  BNPMDA: Bipartite Network Projection for MiRNA–Disease Association prediction , 2018, Bioinform..

[8]  Q. Cui,et al.  An Analysis of Human MicroRNA and Disease Associations , 2008, PloS one.

[9]  Xing Chen,et al.  LRSSLMDA: Laplacian Regularized Sparse Subspace Learning for MiRNA-Disease Association prediction , 2017, PLoS Comput. Biol..

[10]  Na-Na Guan,et al.  Predicting miRNA‐disease association based on inductive matrix completion , 2018, Bioinform..

[11]  A. Giuliano,et al.  Pathophysiological basis of human papillomavirus in penile cancer: Key to prevention and delivery of more effective therapies , 2016, CA: a cancer journal for clinicians.

[12]  Lin Liu,et al.  Inferring novel lncRNA-disease associations based on a random walk model of a lncRNA functional similarity network. , 2014, Molecular bioSystems.

[13]  Xing Chen,et al.  Novel human lncRNA-disease association inference based on lncRNA expression profiles , 2013, Bioinform..

[14]  John S. Mattick,et al.  lncRNAdb: a reference database for long noncoding RNAs , 2010, Nucleic Acids Res..

[15]  Thomas E. Royce,et al.  Global Identification of Human Transcribed Sequences with Genome Tiling Arrays , 2004, Science.

[16]  Qionghai Dai,et al.  Constructing lncRNA functional similarity network based on lncRNA-disease associations and disease semantic similarity , 2015, Scientific Reports.

[17]  Linyuan Lu,et al.  Link prediction based on local random walk , 2010, 1001.2467.

[18]  J. Rinn,et al.  Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs , 2010, Nature Biotechnology.

[19]  Christos Faloutsos,et al.  Fast Random Walk with Restart and Its Applications , 2006, Sixth International Conference on Data Mining (ICDM'06).

[20]  Dapeng Hao,et al.  Prioritizing candidate disease-related long non-coding RNAs by walking on the heterogeneous lncRNA and disease network. , 2015, Molecular bioSystems.

[21]  Yi-Cheng Zhang,et al.  Effect of initial configuration on network-based recommendation , 2007, 0711.2506.

[22]  Xing Chen,et al.  MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction , 2018, PLoS Comput. Biol..

[23]  Martin S. Taylor,et al.  Genome-wide analysis of mammalian promoter architecture and evolution , 2006, Nature Genetics.

[24]  Tim R. Mercer,et al.  NRED: a database of long noncoding RNA expression , 2008, Nucleic Acids Res..

[25]  Elena Marchiori,et al.  Gaussian interaction profile kernels for predicting drug-target interaction , 2011, Bioinform..

[26]  Lei Wang,et al.  A Probabilistic Matrix Factorization Method for Identifying lncRNA-Disease Associations , 2019, Genes.

[27]  Thomas D. Schmittgen,et al.  Ultraconserved regions encoding ncRNAs are altered in human leukemias and carcinomas. , 2007, Cancer cell.

[28]  C. Yanofsky Establishing the Triplet Nature of the Genetic Code , 2007, Cell.

[29]  Xing Chen,et al.  IRWRLDA: improved random walk with restart for lncRNA-disease association prediction , 2016, Oncotarget.

[30]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[31]  R. Sharan,et al.  Protein networks in disease. , 2008, Genome research.

[32]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[33]  Dominic P. Norris,et al.  The product of the mouse Xist gene is a 15 kb inactive X-specific transcript containing no conserved ORF and located in the nucleus , 1992, Cell.

[34]  L. Kiemeney,et al.  DD3(PCA3), a very sensitive and specific marker to detect prostate tumors. , 2002, Cancer research.

[35]  Xing Chen KATZLDA: KATZ measure for the lncRNA-disease association prediction , 2015, Scientific Reports.

[36]  Roded Sharan,et al.  Associating Genes and Protein Complexes with Disease via Network Propagation , 2010, PLoS Comput. Biol..

[37]  Christopher R. Cabanski,et al.  Transcriptome sequencing reveals altered long intergenic non-coding RNAs in lung cancer , 2014, Genome Biology.

[38]  Xing Chen,et al.  Long non-coding RNAs and complex diseases: from experimental results to computational models , 2016, Briefings Bioinform..

[39]  Dong Wang,et al.  Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases , 2010, Bioinform..

[40]  Howard Y. Chang,et al.  Long noncoding RNA HOTAIR reprograms chromatin state to promote cancer metastasis , 2010, Nature.

[41]  J. Claverie Fewer Genes, More Noncoding RNA , 2005, Science.

[42]  Hui Xiao,et al.  NONCODE v3.0: integrative annotation of long noncoding RNAs , 2011, Nucleic Acids Res..

[43]  C. Ponting,et al.  Evolution and Functions of Long Noncoding RNAs , 2009, Cell.

[44]  Michael F. Lin,et al.  Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals , 2009, Nature.

[45]  Ankur Omer,et al.  microRNAs: role in leukemia and their computational perspective , 2015, Wiley interdisciplinary reviews. RNA.

[46]  E. Dees,et al.  The product of the H19 gene may function as an RNA , 1990, Molecular and cellular biology.

[47]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[48]  Lei Wang,et al.  A Novel Method for LncRNA-Disease Association Prediction Based on an lncRNA-Disease Association Network , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.