MiRGOFS: a GO‐based functional similarity measurement for miRNAs, with applications to the prediction of miRNA subcellular localization and miRNA‐disease association

Motivation: Benefiting from high‐throughput experimental technologies, whole‐genome analysis of microRNAs (miRNAs) has been more and more common to uncover important regulatory roles of miRNAs and identify miRNA biomarkers for disease diagnosis. As a complementary information to the high‐throughput experimental data, domain knowledge like the Gene Ontology and KEGG pathway is usually used to guide gene function analysis. However, functional annotation for miRNAs is scarce in the public databases. Till now, only a few methods have been proposed for measuring the functional similarity between miRNAs based on public annotation data, and these methods cover a very limited number of miRNAs, which are not applicable to large‐scale miRNA analysis. Results: In this paper, we propose a new method to measure the functional similarity for miRNAs, called miRGOFS, which has two notable features: (i) it adopts a new GO semantic similarity metric which considers both common ancestors and descendants of GO terms; (i) it computes similarity between GO sets in an asymmetric manner, and weights each GO term by its statistical significance. The miRGOFS‐based predictor achieves an F1 of 61.2% on a benchmark dataset of miRNA localization, and AUC values of 87.7 and 81.1% on two benchmark sets of miRNA‐disease association, respectively. Compared with the existing functional similarity measurements of miRNAs, miRGOFS has the advantages of higher accuracy and larger coverage of human miRNAs (over 1000 miRNAs). Availability and implementation: http://www.csbio.sjtu.edu.cn/bioinf/MiRGOFS/ Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Haixuan Yang,et al.  Improving GO semantic similarity measures by exploring the ontology beneath the terms and modelling uncertainty , 2012, Bioinform..

[2]  H. Berg Cold Spring Harbor Symposia on Quantitative Biology.: Vol. LII. Evolution of Catalytic Functions. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1987, ISBN 0-87969-054-2, xix + 955 pp., US $150.00. , 1989 .

[3]  Chunyu Wang,et al.  A novel insight into Gene Ontology semantic similarity. , 2013, Genomics.

[4]  Anton J. Enright,et al.  Human MicroRNA Targets , 2004, PLoS biology.

[5]  Wei Pan,et al.  Incorporating biological knowledge into distance-based clustering analysis of microarray gene expression data , 2006, Bioinform..

[6]  A. Leung,et al.  The Whereabouts of microRNA Actions: Cytoplasm and Beyond. , 2015, Trends in cell biology.

[7]  Xiangxiang Zeng,et al.  Integrative approaches for predicting microRNA function and prioritizing disease-related microRNA using biological interaction networks , 2016, Briefings Bioinform..

[8]  Oliver Kohlbacher,et al.  YLoc—an interpretable web server for predicting subcellular localization , 2010, Nucleic Acids Res..

[9]  Ivo Grosse,et al.  Functional microRNA targets in protein coding sequences , 2012, Bioinform..

[10]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[11]  Hailin Chen,et al.  Similarity-based methods for potential human microRNA-disease association prediction , 2013, BMC Medical Genomics.

[12]  Xiaowei Wang,et al.  miRDB: an online resource for microRNA target prediction and functional annotations , 2014, Nucleic Acids Res..

[13]  K. Gunsalus,et al.  Combinatorial microRNA target predictions , 2005, Nature Genetics.

[14]  R. Aharonov,et al.  Identification of hundreds of conserved and nonconserved human microRNAs , 2005, Nature Genetics.

[15]  Xing Chen,et al.  HGIMDA: Heterogeneous graph inference for miRNA-disease association prediction , 2016, Oncotarget.

[16]  Xiaoyan Liu,et al.  Measuring gene functional similarity based on group-wise comparison of GO terms , 2013, Bioinform..

[17]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[18]  C. Perou,et al.  A custom microarray platform for analysis of microRNA gene expression , 2004, Nature Methods.

[19]  F. Slack,et al.  Oncomirs — microRNAs with a role in cancer , 2006, Nature Reviews Cancer.

[20]  Stijn van Dongen,et al.  miRBase: microRNA sequences, targets and gene nomenclature , 2005, Nucleic Acids Res..

[21]  Doron Betel,et al.  The microRNA.org resource: targets and expression , 2007, Nucleic Acids Res..

[22]  H. Horvitz,et al.  MicroRNA expression profiles classify human cancers , 2005, Nature.

[23]  Catia Pesquita,et al.  Metrics for GO based protein semantic similarity: a systematic evaluation , 2008, BMC Bioinformatics.

[24]  Xing Chen,et al.  PBMDA: A novel and effective path-based computational model for miRNA-disease association prediction , 2017, PLoS Comput. Biol..

[25]  Qing-Yu He,et al.  A new method for measuring functional similarity of microRNAs , 2011 .

[26]  Mário J. Silva,et al.  Measuring semantic similarity between Gene Ontology terms , 2007, Data Knowl. Eng..

[27]  Xing Chen,et al.  DRMDA: deep representations‐based miRNA–disease association prediction , 2017, Journal of cellular and molecular medicine.

[28]  Olivier Bodenreider,et al.  Ontology-driven similarity approaches to supporting gene func- tional assessment , 2005 .

[29]  C. Burge,et al.  Prediction of Mammalian MicroRNA Targets , 2003, Cell.

[30]  Sam Griffiths-Jones,et al.  Bias in microRNA functional enrichment analysis , 2015, Bioinform..

[31]  G. Vriend,et al.  A text-mining analysis of the human phenome , 2006, European Journal of Human Genetics.

[32]  Hong-Bin Shen,et al.  Hum‐mPLoc 3.0: prediction enhancement of human protein subcellular localization through modeling the hidden correlations of gene ontology and functional domain features , 2016, Bioinform..

[33]  D. Bartel,et al.  Predicting effective microRNA target sites in mammalian mRNAs , 2015, eLife.

[34]  Steven Skiena,et al.  Lowest common ancestors in trees and directed acyclic graphs , 2005, J. Algorithms.

[35]  Michael Kertesz,et al.  The role of site accessibility in microRNA target recognition , 2007, Nature Genetics.

[36]  Carole A. Goble,et al.  Semantic Similarity Measures as Tools for Exploring the Gene Ontology , 2002, Pacific Symposium on Biocomputing.

[37]  Dong Wang,et al.  Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases , 2010, Bioinform..

[38]  Jeffrey A. Thompson,et al.  Common features of microRNA target prediction tools , 2014, Front. Genet..

[39]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[40]  Xing Chen,et al.  RKNNMDA: Ranking-based KNN for MiRNA-Disease Association prediction , 2017, RNA biology.

[41]  Thomas Lengauer,et al.  A new measure for functional similarity of gene products based on Gene Ontology , 2006, BMC Bioinformatics.

[42]  Yibo Wu,et al.  GOSemSim: an R package for measuring semantic similarity among GO terms and gene products , 2010, Bioinform..

[43]  Yadong Wang,et al.  miR2Disease: a manually curated database for microRNA deregulation in human disease , 2008, Nucleic Acids Res..

[44]  Xing Chen,et al.  MicroRNAs and complex diseases: from experimental results to computational models , 2019, Briefings Bioinform..

[45]  Jinyan Li,et al.  Grouping miRNAs of similar functions via weighted information content of gene ontology , 2016, BMC Bioinformatics.

[46]  Yang Li,et al.  HMDD v2.0: a database for experimentally supported human microRNA and disease associations , 2013, Nucleic Acids Res..

[47]  Xing Chen,et al.  RWRMDA: predicting novel human microRNA-disease associations. , 2012, Molecular bioSystems.

[48]  Yan Huang,et al.  RNALocate: a resource for RNA subcellular localizations , 2016, Nucleic Acids Res..

[49]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[50]  Yen-Han Lin,et al.  False positive reduction in protein-protein interaction predictions using gene ontology annotations , 2007, BMC Bioinformatics.

[51]  Yuriy Gusev,et al.  Computational analysis of biological functions and pathways collectively targeted by co-expressed microRNAs in cancer , 2007, BMC Bioinformatics.

[52]  Ying Xu,et al.  Prediction of functional modules based on comparative genome analysis and Gene Ontology application , 2005, Nucleic acids research.

[53]  Qionghai Dai,et al.  WBSMDA: Within and Between Score for MiRNA-Disease Association prediction , 2016, Scientific Reports.

[54]  Lukasz A. Kurgan,et al.  Comprehensive overview and assessment of computational prediction of microRNA targets in animals , 2015, Briefings Bioinform..

[55]  Zhu Yangyong,et al.  A measure of semantic similarity between gene ontology terms based on semantic pathway covering , 2006 .

[56]  Martin Reczko,et al.  DIANA miRPath v.2.0: investigating the combinatorial effect of microRNAs in pathways , 2012, Nucleic Acids Res..

[57]  Philip S. Yu,et al.  A new method to measure the semantic similarity of GO terms , 2007, Bioinform..

[58]  Xing Chen,et al.  Semi-supervised learning for potential human microRNA-disease associations inference , 2014, Scientific Reports.

[59]  Yadong Wang,et al.  Prioritization of disease microRNAs through a human phenome-microRNAome network , 2010, BMC Systems Biology.

[60]  Thomas Lengauer,et al.  Improving disease gene prioritization using the semantic similarity of Gene Ontology terms , 2010, Bioinform..

[61]  Xing Chen,et al.  HAMDA: Hybrid Approach for MiRNA-Disease Association prediction , 2017, J. Biomed. Informatics.

[62]  Athanasios Fevgas,et al.  DIANA-TarBase v7.0: indexing more than half a million experimentally supported miRNA:mRNA interactions , 2014, Nucleic Acids Res..

[63]  Lin He,et al.  MicroRNAs: small RNAs with a big role in gene regulation , 2004, Nature reviews genetics.

[64]  中尾 光輝,et al.  KEGG(Kyoto Encyclopedia of Genes and Genomes)〔和文〕 (特集 ゲノム医学の現在と未来--基礎と臨床) -- (データベース) , 2000 .

[65]  P. Sharp,et al.  Function and localization of microRNAs in mammalian cells. , 2006, Cold Spring Harbor symposia on quantitative biology.

[66]  Jacek Niklinski,et al.  MicroRNAs as novel targets and tools in cancer therapy. , 2017, Cancer letters.

[67]  V. Kim,et al.  MicroRNA maturation: stepwise processing and subcellular localization , 2002, The EMBO journal.