Novel human lncRNA-disease association inference based on lncRNA expression profiles

MOTIVATION More and more evidences have indicated that long-non-coding RNAs (lncRNAs) play critical roles in many important biological processes. Therefore, mutations and dysregulations of these lncRNAs would contribute to the development of various complex diseases. Developing powerful computational models for potential disease-related lncRNAs identification would benefit biomarker identification and drug discovery for human disease diagnosis, treatment, prognosis and prevention. RESULTS In this article, we proposed the assumption that similar diseases tend to be associated with functionally similar lncRNAs. Then, we further developed the method of Laplacian Regularized Least Squares for LncRNA-Disease Association (LRLSLDA) in the semisupervised learning framework. Although known disease-lncRNA associations in the database are rare, LRLSLDA still obtained an AUC of 0.7760 in the leave-one-out cross validation, significantly improving the performance of previous methods. We also illustrated the performance of LRLSLDA is not sensitive (even robust) to the parameters selection and it can obtain a reliable performance in all the test classes. Plenty of potential disease-lncRNA associations were publicly released and some of them have been confirmed by recent results in biological experiments. It is anticipated that LRLSLDA could be an effective and important biological tool for biomedical research. AVAILABILITY The code of LRLSLDA is freely available at http://asdcd.amss.ac.cn/Software/Details/2.

[1]  Xiaobo Zhou,et al.  Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces , 2010, BMC Systems Biology.

[2]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[3]  Howard Y. Chang,et al.  Long noncoding RNAs and human disease. , 2011, Trends in cell biology.

[4]  J. Mattick The Genetic Signatures of Noncoding RNAs , 2009, PLoS genetics.

[5]  Michael F. Lin,et al.  Systematic identification of long noncoding RNAs expressed during zebrafish embryogenesis. , 2012, Genome research.

[6]  Leighton J. Core,et al.  Nascent RNA Sequencing Reveals Widespread Pausing and Divergent Initiation at Human Promoters , 2008, Science.

[7]  J. Rinn,et al.  Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs , 2010, Nature biotechnology.

[8]  Han Hao,et al.  [Evaluation of novel gene UCA1 as a tumor biomarker for the detection of bladder cancer]. , 2012, Zhonghua yi xue za zhi.

[9]  J. Mattick,et al.  The relationship between non-protein-coding DNA and eukaryotic complexity. , 2007, BioEssays : news and reviews in molecular, cellular and developmental biology.

[10]  J. Claverie Fewer Genes, More Noncoding RNA , 2005, Science.

[11]  Xing Chen,et al.  Drug-target interaction prediction by random walk on the heterogeneous network. , 2012, Molecular bioSystems.

[12]  Elena Marchiori,et al.  Gaussian interaction profile kernels for predicting drug-target interaction , 2011, Bioinform..

[13]  D. Spector,et al.  Long noncoding RNAs: functional surprises from the RNA world. , 2009, Genes & development.

[14]  Roded Sharan,et al.  Associating Genes and Protein Complexes with Disease via Network Propagation , 2010, PLoS Comput. Biol..

[15]  Svetlana A. Shabalina,et al.  Negative Correlation between Expression Level and Evolutionary Rate of Long Intergenic Noncoding RNAs , 2011, Genome biology and evolution.

[16]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[17]  S. Sunkin,et al.  Specific expression of long noncoding RNAs in the mouse brain , 2008, Proceedings of the National Academy of Sciences.

[18]  J. Mattick,et al.  Non-coding RNA. , 2006, Human molecular genetics.

[19]  Carolyn J. Brown,et al.  The functional role of long non-coding RNA in human carcinomas , 2011, Molecular Cancer.

[20]  R. Spizzo,et al.  Long non-coding RNAs and cancer: a new frontier of translational research? , 2012, Oncogene.

[21]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[22]  Eric T. Wang,et al.  An Abundance of Ubiquitously Expressed Genes Revealed by Tissue Transcriptome Sequence Data , 2009, PLoS Comput. Biol..

[23]  Kun Qu,et al.  BRAFV600E remodels the melanocyte transcriptome and induces BANCR to regulate melanoma cell migration. , 2012, Genome research.

[24]  Mesut Remzi,et al.  The relationship between Prostate CAncer gene 3 (PCA3) and prostate cancer significance , 2012, BJU international.

[25]  T. Hughes,et al.  A systematic search for new mammalian noncoding RNAs indicates little conserved intergenic transcription , 2005, BMC Genomics.

[26]  Xing Chen,et al.  LncRNADisease: a database for long-non-coding RNA-associated diseases , 2012, Nucleic Acids Res..

[27]  P. Stadler,et al.  RNA Maps Reveal New RNA Classes and a Possible Function for Pervasive Transcription , 2007, Science.

[28]  Howard Y. Chang,et al.  Long noncoding RNA HOTAIR reprograms chromatin state to promote cancer metastasis , 2010, Nature.

[29]  J. Mattick,et al.  Long non-coding RNAs in nervous system function and disease , 2010, Brain Research.

[30]  Haiyang Xie,et al.  Overexpression of Long Non-coding RNA HOTAIR Predicts Tumor Recurrence in Hepatocellular Carcinoma Patients Following Liver Transplantation , 2011, Annals of Surgical Oncology.

[31]  S. Brenner,et al.  General Nature of the Genetic Code for Proteins , 1961, Nature.

[32]  Rory Johnson Long non-coding RNAs in Huntington's disease neurodegeneration , 2012, Neurobiology of Disease.

[33]  J. Mattick,et al.  Non‐coding RNAs: regulators of disease , 2010, The Journal of pathology.

[34]  Xing Chen,et al.  RWRMDA: predicting novel human microRNA-disease associations. , 2012, Molecular bioSystems.

[35]  Michael F. Lin,et al.  Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals , 2009, Nature.

[36]  John S. Mattick,et al.  lncRNAdb: a reference database for long noncoding RNAs , 2010, Nucleic Acids Res..

[37]  J. Mattick,et al.  Long non-coding RNAs: insights into functions , 2009, Nature Reviews Genetics.

[38]  L. Harries,et al.  Long non-coding RNAs and human disease. , 2012, Biochemical Society transactions.

[39]  J. Rinn,et al.  Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs , 2010, Nature Biotechnology.

[40]  Zhiming Cai,et al.  Long intergenic non‐coding RNA TUG1 is overexpressed in urothelial carcinoma of the bladder , 2013, Journal of surgical oncology.

[41]  Tim R. Mercer,et al.  NRED: a database of long noncoding RNA expression , 2008, Nucleic Acids Res..

[42]  Q. Cui,et al.  Prediction of Disease-Related Interactions between MicroRNAs and Environmental Factors Based on a Semi-Supervised Classifier , 2012, PloS one.

[43]  Y. Hayashizaki,et al.  Systematic expression profiling of the mouse transcriptome using RIKEN cDNA microarrays. , 2003, Genome research.

[44]  Hailin Chen,et al.  Similarity-based methods for potential human microRNA-disease association prediction , 2013, BMC Medical Genomics.

[45]  E. Marcotte,et al.  A flaw in the typical evaluation scheme for pair-input computational predictions , 2012, Nature Methods.

[46]  C. Yanofsky Establishing the Triplet Nature of the Genetic Code , 2007, Cell.

[47]  Martin S. Taylor,et al.  Genome-wide analysis of mammalian promoter architecture and evolution , 2006, Nature Genetics.

[48]  Hui Xiao,et al.  NONCODE v3.0: integrative annotation of long noncoding RNAs , 2011, Nucleic Acids Res..

[49]  T. Katsuya,et al.  Genetic variants at the 9p21 locus contribute to atherosclerosis through modulation of ANRIL and CDKN2A/B. , 2012, Atherosclerosis.

[50]  Yadong Wang,et al.  Prioritization of disease microRNAs through a human phenome-microRNAome network , 2010, BMC Systems Biology.

[51]  Thomas E. Royce,et al.  Global Identification of Human Transcribed Sequences with Genome Tiling Arrays , 2004, Science.

[52]  C. Ponting,et al.  Evolution and Functions of Long Noncoding RNAs , 2009, Cell.

[53]  Yusuke Nakamura,et al.  Association of a novel long non‐coding RNA in 8q24 with prostate cancer susceptibility , 2011, Cancer science.