Predicting lncRNA-miRNA Interaction via Graph Convolution Auto-Encoder

The interaction of miRNA and lncRNA is known to be important for gene regulations. However, the number of known lncRNA-miRNA interactions is still very limited and there are limited computational tools available for predicting new ones. Considering that lncRNAs and miRNAs share internal patterns in the partnership between each other, the underlying lncRNA-miRNA interactions could be predicted by utilizing the known ones, which could be considered as a semi-supervised learning problem. It is shown that the attributes of lncRNA and miRNA have a close relationship with the interaction between each other. Effective use of side information could be helpful for improving the performance especially when the training samples are limited. In view of this, we proposed an end-to-end prediction model called GCLMI (Graph Convolution for novel lncRNA-miRNA Interactions) by combining the techniques of graph convolution and auto-encoder. Without any preprocessing process on the feature information, our method can incorporate raw data of node attributes with the topology of the interaction network. Based on a real dataset collected from a public database, the results of experiments conducted on k-fold cross validations illustrate the robustness and effectiveness of the prediction performance of the proposed prediction model. We prove the graph convolution layer as designed in the proposed model able to effectively integrate the input data by filtering the graph with node features. The proposed model is anticipated to yield highly potential lncRNA-miRNA interactions in the scenario that different types of numerical features describing lncRNA or miRNA are provided by users, serving as a useful computational tool.

[1]  Zhu-Hong You,et al.  FMSM: a novel computational model for predicting potential miRNA biomarkers for various human diseases , 2018, BMC Systems Biology.

[2]  Howard Y. Chang,et al.  Long noncoding RNAs and human disease. , 2011, Trends in cell biology.

[3]  Yun-hui Li,et al.  Integrated analysis of long non-coding RNA‑associated ceRNA network reveals potential lncRNA biomarkers in human lung adenocarcinoma. , 2016, International journal of oncology.

[4]  Yun Zheng,et al.  Accurate detection for a wide range of mutation and editing sites of microRNAs from small RNA high-throughput sequencing profiles , 2016, Nucleic acids research.

[5]  Vijay S. Pande,et al.  Molecular graph convolutions: moving beyond fingerprints , 2016, Journal of Computer-Aided Molecular Design.

[6]  H. Seitz,et al.  microRNA target prediction programs predict many false positives , 2017, Genome research.

[7]  Xing Chen,et al.  Construction of reliable protein-protein interaction networks using weighted sparse representation based classifier with pseudo substitution matrix representation features , 2016, Neurocomputing.

[8]  Hui Xiao,et al.  NONCODE v3.0: integrative annotation of long noncoding RNAs , 2011, Nucleic Acids Res..

[9]  R. Sunkar,et al.  Identification of microRNAs, phasiRNAs and Their Targets in Pineapple , 2016, Tropical Plant Biology.

[10]  C. Croce,et al.  MicroRNA signatures in human cancers , 2006, Nature Reviews Cancer.

[11]  Zhu-Hong You,et al.  Constructing prediction models from expression profiles for large scale lncRNA–miRNA interaction profiling , 2017, Bioinform..

[12]  G. Shukla,et al.  A comprehensive review of web-based non-coding RNA resources for cancer research. , 2017, Cancer letters.

[13]  R. Sunkar,et al.  MicroRNAs, tasiRNAs, phasiRNAs, and Their Potential Functions in Pineapple , 2018 .

[14]  Xuerui Yang,et al.  An Extensive MicroRNA-Mediated Network of RNA-RNA Interactions Regulates Established Oncogenic Pathways in Glioblastoma , 2011, Cell.

[15]  C. Burge,et al.  Prediction of Mammalian MicroRNA Targets , 2003, Cell.

[16]  Pier Paolo Pandolfi,et al.  ceRNA cross-talk in cancer: when ce-bling rivalries go awry. , 2013, Cancer discovery.

[17]  An-Yuan Guo,et al.  lncRNASNP: a database of SNPs in lncRNAs and their potential functions in human and mouse , 2014, Nucleic Acids Res..

[18]  P. Pandolfi,et al.  The multilayered complexity of ceRNA crosstalk and competition , 2014, Nature.

[19]  Lorenzo Farina,et al.  Computational analysis identifies a sponge interaction network between long non-coding RNAs and messenger RNAs in human breast cancer , 2014, BMC Systems Biology.

[20]  Xia Li,et al.  Identification of lncRNA-associated competing triplets reveals global patterns and prognostic markers for cancer , 2015, Nucleic acids research.

[21]  Paul T. Groth,et al.  The ENCODE (ENCyclopedia Of DNA Elements) Project , 2004, Science.

[22]  Yuda Fang,et al.  Intron Lariat RNA Inhibits MicroRNA Biogenesis by Sequestering the Dicing Complex in Arabidopsis , 2016, PLoS genetics.

[23]  H. Horvitz,et al.  MicroRNA expression profiles classify human cancers , 2005, Nature.

[24]  Xing Chen,et al.  Prediction of microbe–disease association from the integration of neighbor and graph with collaborative recommendation model , 2017, Journal of Translational Medicine.

[25]  Xing Chen,et al.  Improved protein-protein interactions prediction via weighted sparse representation model combining continuous wavelet descriptor and PseAA composition , 2016, BMC Systems Biology.

[26]  Max Welling,et al.  Variational Graph Auto-Encoders , 2016, ArXiv.

[27]  Xuefei Shi,et al.  Long non-coding RNAs: a new frontier in the study of human diseases. , 2013, Cancer letters.

[28]  Doron Betel,et al.  The microRNA.org resource: targets and expression , 2007, Nucleic Acids Res..

[29]  Zhu-Hong You,et al.  Novel link prediction for large-scale miRNA-lncRNA interaction network in a bipartite graph , 2018, BMC Medical Genomics.

[30]  Yun Zheng,et al.  Revealing editing and SNPs of microRNAs in colon tissues by analyzing high-throughput sequencing profiles of small RNAs , 2014, BMC Genomics.

[31]  Joan Bruna,et al.  Spectral Networks and Locally Connected Networks on Graphs , 2013, ICLR.

[32]  P. Pandolfi,et al.  A ceRNA Hypothesis: The Rosetta Stone of a Hidden RNA Language? , 2011, Cell.

[33]  Lorenzo Farina,et al.  Role of the long non-coding RNA PVT1 in the dysregulation of the ceRNA-ceRNA network in human breast cancer , 2017, PloS one.

[34]  J. Rinn,et al.  Integrative analyses reveal a long noncoding RNA-mediated sponge regulatory network in prostate cancer , 2016, Nature Communications.