Predicting the interaction biomolecule types for lncRNA: an ensemble deep learning approach

Long noncoding RNAs (lncRNAs) play significant roles in various physiological and pathological processes via their interactions with biomolecules like DNA, RNA and protein. The existing in silico methods used for predicting the functions of lncRNA mainly rely on calculating the similarity of lncRNA or investigating whether an lncRNA can interact with a specific biomolecule or disease. In this work, we explored the functions of lncRNA from a different perspective: we presented a tool for predicting the interaction biomolecule type for a given lncRNA. For this purpose, we first investigated the main molecular mechanisms of the interactions of lncRNA-RNA, lncRNA-protein and lncRNA-DNA. Then, we developed an ensemble deep learning model: lncIBTP (lncRNA Interaction Biomolecule Type Prediction). This model predicted the interactions between lncRNA and different types of biomolecules. On the 5-fold cross-validation, the lncIBTP achieves average values of 0.7042 in accuracy, 0.7903 and 0.6421 in macro-average area under receiver operating characteristic curve and precision-recall curve, respectively, which illustrates the model effectiveness. Besides, based on the analysis of the collected published data and prediction results, we hypothesized that the characteristics of lncRNAs that interacted with DNA may be different from those that interacted with only RNA.

[1]  Z. Zeng,et al.  LncRNAs regulate cancer metastasis via binding to functional proteins , 2017, Oncotarget.

[2]  Ke Hu,et al.  Upregulated lncRNA Gm2044 inhibits male germ cell development by acting as miR-202 host gene , 2019, Animal cells and systems.

[3]  Ling Wang,et al.  The clinical significance and biological function of lncRNA SOCAR in serous ovarian carcinoma. , 2019, Gene.

[4]  Zhengwei Zhu,et al.  CD-HIT: accelerated for clustering the next-generation sequencing data , 2012, Bioinform..

[5]  Xing Chen,et al.  Long non-coding RNAs and complex diseases: from experimental results to computational models , 2016, Briefings Bioinform..

[6]  Jia Yu,et al.  RNAInter in 2020: RNA interactome repository with increased coverage and annotation , 2019, Nucleic Acids Res..

[7]  Peng Zhang,et al.  Plant miRNA–lncRNA Interaction Prediction with the Ensemble of CNN and IndRNN , 2019, Interdisciplinary Sciences: Computational Life Sciences.

[8]  Shuyuan Wang,et al.  Systematical analysis of lncRNA–mRNA competing endogenous RNA network in breast cancer subtypes , 2018, Breast Cancer Research and Treatment.

[9]  M. Nalls,et al.  Evidence for natural antisense transcript-mediated inhibition of microRNA function , 2010, Genome Biology.

[10]  Junjie Chen,et al.  Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences , 2015, Nucleic Acids Res..

[11]  Y. Zhang,et al.  LncRNA‐RP11‐714G18.1 suppresses vascular cell migration via directly targeting LRP2BP , 2018, Immunology and cell biology.

[12]  Y. Jiao,et al.  Differentially expressed long noncoding RNAs and regulatory mechanism of LINC02407 in human gastric adenocarcinoma , 2019, World journal of gastroenterology.

[13]  Liang Wang,et al.  Long non-coding RNA DSCR8 acts as a molecular sponge for miR-485-5p to activate Wnt/β-catenin signal pathway in hepatocellular carcinoma , 2018, Cell Death & Disease.

[14]  Cheng Liang,et al.  Identifying lncRNA and mRNA Co-Expression Modules from Matched Expression Data in Ovarian Cancer , 2020, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[15]  Wang Yan,et al.  LncRNA NEAT1 promotes autophagy in MPTP-induced Parkinson's disease through stabilizing PINK1 protein. , 2017, Biochemical and biophysical research communications.

[16]  W. Gu,et al.  LncRNA MACC1-AS1 sponges multiple miRNAs and RNA-binding protein PTBP1 , 2019, Oncogenesis.

[17]  V. Villegas,et al.  Neighboring Gene Regulation by Antisense Long Non-Coding RNAs , 2015, International journal of molecular sciences.

[18]  Rajvir Dahiya,et al.  Genistein Inhibits Prostate Cancer Cell Growth by Targeting miR-34a and Oncogenic HOTAIR , 2013, PloS one.

[19]  A. Strasser,et al.  E mu-bcl-2 transgene facilitates spontaneous transformation of early pre-B and immunoglobulin-secreting cells but not T cells. , 1993, Oncogene.

[20]  Mark Gerstein,et al.  GENCODE reference annotation for the human and mouse genomes , 2018, Nucleic Acids Res..

[21]  Chang Liu,et al.  R-Loop Mediated trans Action of the APOLO Long Noncoding RNA. , 2020, Molecular cell.

[22]  Gail Clement,et al.  Corrigendum: TCTEX1D2 mutations underlie Jeune asphyxiating thoracic dystrophy with impaired retrograde intraflagellar transport , 2016, Nature Communications.

[23]  Eric A. Ortlund,et al.  The structure, function and evolution of proteins that bind DNA and RNA , 2014, Nature Reviews Molecular Cell Biology.

[24]  Gang Xiong,et al.  Upregulation of a novel lncRNA LINC01980 promotes tumor growth of esophageal squamous cell carcinoma. , 2019, Biochemical and biophysical research communications.

[25]  Yuanting Gu,et al.  Exosome-mediated lncRNA AFAP1-AS1 promotes trastuzumab resistance through binding with AUF1 and activating ERBB2 translation , 2020, Molecular Cancer.

[26]  Howard Y. Chang,et al.  Extensive and coordinated transcription of noncoding RNAs within cell cycle promoters , 2011, Nature Genetics.

[27]  C. Glass,et al.  Induced ncRNAs Allosterically Modify RNA Binding Proteins in cis to Inhibit Transcription , 2008, Nature.

[28]  Jeannie T. Lee,et al.  The Long Noncoding RNA, Jpx, Is a Molecular Switch for X Chromosome Inactivation , 2010, Cell.

[29]  A. Ciarrocchi,et al.  RAIN Is a Novel Enhancer-Associated lncRNA That Controls RUNX2 Expression and Promotes Breast and Thyroid Cancer , 2019, Molecular Cancer Research.

[30]  B. Soibam,et al.  Deep learning identifies genome-wide DNA binding sites of long noncoding RNAs , 2018, RNA biology.

[31]  ncRNA- and Pc2 methylation-dependent gene relocation between nuclear structures mediates gene activation programs. , 2011, Cell.

[32]  T. Ørntoft,et al.  SNHG5 promotes colorectal cancer cell survival by counteracting STAU1-mediated mRNA destabilization , 2016, Nature Communications.

[33]  Yonghao Yu,et al.  The Emerging Function and Mechanism of ceRNAs in Cancer. , 2016, Trends in genetics : TIG.

[34]  S. P. Moran,et al.  lncRNA DIGIT and BRD3 protein form phase-separated condensates to regulate endoderm differentiation , 2019, bioRxiv.

[35]  Xing Chen,et al.  Novel human lncRNA-disease association inference based on lncRNA expression profiles , 2013, Bioinform..

[36]  Chee Keong Kwoh,et al.  DeepCPP: a deep neural network based on nucleotide bias information and minimum distribution similarity feature selection for RNA coding potential prediction , 2020, Briefings Bioinform..

[37]  Jianyun Nie,et al.  LncRNA MIR100HG promotes cell proliferation in triple-negative breast cancer through triplex formation with p27 loci , 2018, Cell Death & Disease.

[38]  Yuwei Zhang,et al.  Long noncoding RNA: a crosslink in biological regulatory network , 2018, Briefings Bioinform..

[39]  Alessio Colantoni,et al.  Revealing protein–lncRNA interaction , 2015, Briefings Bioinform..

[40]  Hongliang Zhu,et al.  Unveiling the hidden function of long non-coding RNA by identifying its major partner-protein , 2015, Cell & Bioscience.

[41]  A long noncoding RNA controls muscle differentiation by functioning as a competing endogenous RNA. , 2011, Cell.

[42]  Qiong Zhang,et al.  lncRInter: A database of experimentally validated long non-coding RNA interaction. , 2017, Journal of genetics and genomics = Yi chuan xue bao.

[43]  Jill P. Mesirov,et al.  RNA Duplex Map in Living Cells Reveals Higher-Order Transcriptome Structure , 2016, Cell.

[44]  Na-Na Guan,et al.  Computational models for lncRNA function prediction and functional similarity calculation , 2018, Briefings in functional genomics.

[45]  Hui Zhang,et al.  HLPI-Ensemble: Prediction of human lncRNA-protein interactions based on ensemble strategy , 2018, RNA biology.

[46]  Hiroshi Mamitsuka,et al.  Computational recognition for long non-coding RNA (lncRNA): Software and databases , 2016, Briefings Bioinform..

[47]  M. Ballantyne,et al.  lncRNA/MicroRNA interactions in the vasculature , 2016, Clinical pharmacology and therapeutics.

[48]  Jia Wen Liang,et al.  LINC01980 facilitates esophageal squamous cell carcinoma progression via regulation of miR-190a-5p/MYO5A pathway. , 2020, Archives of biochemistry and biophysics.

[49]  D. Solari,et al.  RPSAP52 lncRNA is overexpressed in pituitary tumors and promotes cell proliferation by acting as miRNA sponge for HMGA proteins , 2019, Journal of Molecular Medicine.

[50]  Shunmin He,et al.  NPInter v4.0: an integrated database of ncRNA interactions , 2019, Nucleic Acids Res..

[51]  Jin Fu,et al.  Dissection of functional lncRNAs in Alzheimer's disease by construction and analysis of lncRNA-mRNA networks based on competitive endogenous RNAs. , 2017, Biochemical and biophysical research communications.

[52]  Yuehua Wu,et al.  Long noncoding RNAs with snoRNA ends. , 2012, Molecular cell.

[53]  F. Slack,et al.  Junk DNA and the long non-coding RNA twist in cancer genetics , 2015, Oncogene.

[54]  Gwyn T. Williams,et al.  Molecular and Cellular Mechanisms of Action of Tumour Suppressor GAS5 LncRNA , 2015, Genes.

[55]  Xing Chen KATZLDA: KATZ measure for the lncRNA-disease association prediction , 2015, Scientific Reports.