Large-scale prediction of microRNA-disease associations by combinatorial prioritization algorithm

Identification of the associations between microRNA molecules and human diseases from large-scale heterogeneous biological data is an important step for understanding the pathogenesis of diseases in microRNA level. However, experimental verification of microRNA-disease associations is expensive and time-consuming. To overcome the drawbacks of conventional experimental methods, we presented a combinatorial prioritization algorithm to predict the microRNA-disease associations. Importantly, our method can be used to predict microRNAs (diseases) associated with the diseases (microRNAs) without the known associated microRNAs (diseases). The predictive performance of our proposed approach was evaluated and verified by the internal cross-validations and external independent validations based on standard association datasets. The results demonstrate that our proposed method achieves the impressive performance for predicting the microRNA-disease association with the Area Under receiver operation characteristic Curve (AUC), 86.93%, which is indeed outperform the previous prediction methods. Particularly, we observed that the ensemble-based method by integrating the predictions of multiple algorithms can give more reliable and robust prediction than the single algorithm, with the AUC score improved to 92.26%. We applied our combinatorial prioritization algorithm to lung neoplasms and breast neoplasms, and revealed their top 30 microRNA candidates, which are in consistent with the published literatures and databases.

[1]  Konstantin Khrapko,et al.  A microRNA array reveals extensive regulation of microRNAs during brain development. , 2003, RNA.

[2]  C. Croce,et al.  MicroRNA gene expression deregulation in human breast cancer. , 2005, Cancer research.

[3]  Eckart Meese,et al.  MicroRNAs – Important Molecules in Lung Cancer Research , 2011, Front. Gene..

[4]  Tyler E. Miller,et al.  MicroRNA-221/222 Confers Tamoxifen Resistance in Breast Cancer by Targeting p27Kip1*♦ , 2008, Journal of Biological Chemistry.

[5]  N. Rajewsky,et al.  Deep conservation of microRNA-target relationships and 3'UTR motifs in vertebrates, flies, and nematodes. , 2006, Cold Spring Harbor symposia on quantitative biology.

[6]  C. Croce,et al.  MicroRNA signatures in human cancers , 2006, Nature Reviews Cancer.

[7]  Andrew V. Goldberg,et al.  Beyond the flow decomposition barrier , 1998, JACM.

[8]  Zhaolei Zhang,et al.  Evidence for Positive Selection on a Number of MicroRNA Regulatory Interactions during Recent Human Evolution , 2012, PLoS genetics.

[9]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[10]  Michael Kertesz,et al.  The role of site accessibility in microRNA target recognition , 2007, Nature Genetics.

[11]  Xing-Ming Zhao,et al.  Prediction of Drug Combinations by Integrating Molecular and Pharmacological Data , 2011, PLoS Comput. Biol..

[12]  Yue Zhao,et al.  RAID v2.0: an updated resource of RNA-associated interactions across organisms , 2016, Nucleic Acids Res..

[13]  C. Burge,et al.  The Widespread Impact of Mammalian MicroRNAs on mRNA Repression and Evolution , 2005, Science.

[14]  C. Burge,et al.  Prediction of Mammalian MicroRNA Targets , 2003, Cell.

[15]  Lucas C. Parra,et al.  On the Maximization of Information Flow Between Spiking Neurons , 2009, Neural Computation.

[16]  Carla Oliveira,et al.  Retraction: A TARBP2 mutation in human cancer impairs microRNA processing and DICER1 function , 2016, Nature Genetics.

[17]  Kwong-Sak Leung,et al.  ViRBase: a resource for virus–host ncRNA-associated interactions , 2014, Nucleic Acids Res..

[18]  C. Wijmenga,et al.  Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes. , 2006, American journal of human genetics.

[19]  Vassilis Georgoulias,et al.  Prognostic value of mature microRNA-21 and microRNA-205 overexpression in non-small cell lung cancer by quantitative real-time RT-PCR. , 2008, Clinical chemistry.

[20]  C. Croce,et al.  A microRNA expression signature of human solid tumors defines cancer gene targets , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Yadong Wang,et al.  Predicting human microRNA-disease associations based on support vector machine , 2010, 2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[22]  Xiangxiang Zeng,et al.  Prediction and validation of association between microRNAs and diseases by multipath methods. , 2016, Biochimica et biophysica acta.

[23]  Tao Jiang,et al.  Uncover disease genes by maximizing information flow in the phenome–interactome network , 2011, Bioinform..

[24]  R. Sharan,et al.  PREDICT: a method for inferring novel drug indications with application to personalized medicine , 2011, Molecular systems biology.

[25]  Ujjwal Maulik,et al.  Development of the human cancer microRNA network , 2010 .

[26]  Xia Li,et al.  Prioritizing cancer-related key miRNA–target interactions by integrative genomics , 2012, Nucleic acids research.

[27]  P. Robinson,et al.  Walking the interactome for prioritization of candidate disease genes. , 2008, American journal of human genetics.

[28]  D. Bartel MicroRNAs: Target Recognition and Regulatory Functions , 2009, Cell.

[29]  Ranit Aharonov,et al.  MicroRNA expression detected by oligonucleotide microarrays: system establishment and expression profiling in human tissues. , 2004, Genome research.

[30]  J Philip McCoy,et al.  Hematopoietic-specific microRNA expression in human cells. , 2006, Leukemia research.

[31]  R. Sharan,et al.  INDI: a computational framework for inferring drug interactions and their associated recommendations , 2012, Molecular systems biology.

[32]  G. Vriend,et al.  A text-mining analysis of the human phenome , 2006, European Journal of Human Genetics.

[33]  Muller Fabbri,et al.  A MicroRNA signature associated with prognosis and progression in chronic lymphocytic leukemia. , 2005, The New England journal of medicine.

[34]  Yun Xiao,et al.  Prioritizing candidate disease miRNAs by integrating phenotype associations of multiple diseases with matched miRNA and mRNA expression profiles. , 2014, Molecular bioSystems.

[35]  Qionghai Dai,et al.  WBSMDA: Within and Between Score for MiRNA-Disease Association prediction , 2016, Scientific Reports.

[36]  Q. Cui,et al.  An Analysis of Human MicroRNA and Disease Associations , 2008, PloS one.

[37]  Mohammed Al-Shalalfa,et al.  Protein network-based Lasso regression model for the construction of disease-miRNA functional interactions , 2013, EURASIP J. Bioinform. Syst. Biol..

[38]  Yufei Huang,et al.  Prediction of microRNAs Associated with Human Diseases Based on Weighted k Most Similar Neighbors , 2013, PloS one.

[39]  Xiangxiang Zeng,et al.  Integrative approaches for predicting microRNA function and prioritizing disease-related microRNA using biological interaction networks , 2016, Briefings Bioinform..

[40]  Zissimos Mourelatos,et al.  Microarray-based, high-throughput gene expression profiling of microRNAs , 2004, Nature Methods.

[41]  Yun Xiao,et al.  Prioritizing Candidate Disease miRNAs by Topological Features in the miRNA Target–Dysregulated Network: Case Study of Prostate Cancer , 2011, Molecular Cancer Therapeutics.

[42]  Paula K Shireman,et al.  Reproducibility of quantitative RT-PCR array in miRNA expression profiling and comparison with microarray analysis , 2009 .

[43]  Hailin Chen,et al.  Prediction of Associations between OMIM Diseases and MicroRNAs by Random Walk on OMIM Disease Similarity Network , 2013, TheScientificWorldJournal.

[44]  Guohua Wang,et al.  An approach for prioritizing disease-related microRNAs based on genomic data integration , 2010, 2010 3rd International Conference on Biomedical Engineering and Informatics.

[45]  Sam Griffiths-Jones,et al.  The microRNA Registry , 2004, Nucleic Acids Res..

[46]  Andrew D. Johnson,et al.  Bmc Medical Genetics an Open Access Database of Genome-wide Association Results , 2009 .

[47]  Anjali J. Koppal,et al.  Supplementary data: Comprehensive modeling of microRNA targets predicts functional non-conserved and non-canonical sites , 2010 .

[48]  Wei Gao,et al.  [Prediction of the microRNAs related to cardiovascular diseases by bioinformatics]. , 2009, Beijing da xue xue bao. Yi xue ban = Journal of Peking University. Health sciences.

[49]  Shangwei Ning,et al.  Prioritizing human cancer microRNAs based on genes’ functional consistency between microRNA and cancer , 2011, Nucleic acids research.

[50]  K. Gunsalus,et al.  Combinatorial microRNA target predictions , 2005, Nature Genetics.

[51]  Xing Chen,et al.  HGIMDA: Heterogeneous graph inference for miRNA-disease association prediction , 2016, Oncotarget.

[52]  Anton J. Enright,et al.  spongeScan: A web for detecting microRNA binding elements in lncRNA sequences , 2016, Nucleic Acids Res..

[53]  Lan Jin,et al.  RNAi induced hepatotoxicity results from loss of the first synthesized isoform of miR-122 in mice , 2016, Nature Medicine.

[54]  Edwin Wang,et al.  Aberrant allele frequencies of the SNPs located in microRNA target sites are potentially associated with human cancers , 2007, Nucleic acids research.

[55]  P. Bork,et al.  Association of genes to genetically inherited diseases using data mining , 2002, Nature Genetics.

[56]  P. Radivojac,et al.  An integrated approach to inferring gene–disease associations in humans , 2008, Proteins.

[57]  Q. Zou,et al.  Similarity computation strategies in the microRNA-disease network: a survey. , 2015, Briefings in functional genomics.

[58]  R. Bernards,et al.  Enabling personalized cancer medicine through analysis of gene-expression patterns , 2008, Nature.

[59]  Q. Cui,et al.  Principles of microRNA regulation of a human cellular signaling network , 2006, Molecular systems biology.

[60]  Keqin Li,et al.  Network Consistency Projection for Human miRNA-Disease Associations Inference , 2016, Scientific Reports.

[61]  Xing Chen,et al.  Semi-supervised learning for potential human microRNA-disease associations inference , 2014, Scientific Reports.

[62]  Yadong Wang,et al.  Prioritization of disease microRNAs through a human phenome-microRNAome network , 2010, BMC Systems Biology.

[63]  Kesheng Liu,et al.  Information Flow Analysis of Interactome Networks , 2009, PLoS Comput. Biol..

[64]  Yadong Wang,et al.  Weighted Network-Based Inference of Human MicroRNA-Disease Associations , 2010, 2010 Fifth International Conference on Frontier of Computer Science and Technology.

[65]  F. Slack,et al.  Oncomirs — microRNAs with a role in cancer , 2006, Nature Reviews Cancer.

[66]  Krishna R. Kalari,et al.  Genetic association with overall survival of taxane-treated lung cancer patients - a genome-wide association study in human lymphoblastoid cell lines followed by a clinical association study , 2012, BMC Cancer.

[67]  Yan Zeng,et al.  Alteration of microRNA expression in vinyl carbamate-induced mouse lung tumors and modulation by the chemopreventive agent indole-3-carbinol. , 2010, Carcinogenesis.

[68]  Xiangxiang Zeng,et al.  Inferring MicroRNA-Disease Associations by Random Walk on a Heterogeneous Network with Multiple Data Sources , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[69]  H. Horvitz,et al.  MicroRNA expression profiles classify human cancers , 2005, Nature.

[70]  Xia Li,et al.  ncRDeathDB: A comprehensive bioinformatics resource for deciphering network organization of the ncRNA-mediated cell death system , 2015, Autophagy.

[71]  C. Croce,et al.  Human microRNA genes are frequently located at fragile sites and genomic regions involved in cancers. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[72]  Yi Pan,et al.  Predicting MicroRNA-Disease Associations Based on Improved MicroRNA and Disease Similarities , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[73]  Y. Wang,et al.  Mammalian ncRNA-disease repository: a global view of ncRNA-mediated disease network , 2013, Cell Death and Disease.

[74]  Hailin Chen,et al.  Similarity-based methods for potential human microRNA-disease association prediction , 2013, BMC Medical Genomics.

[75]  Ivan Merelli,et al.  A multilevel data integration resource for breast cancer study , 2010, BMC Systems Biology.

[76]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[77]  Eckart Meese,et al.  Stable serum miRNA profiles as potential tool for non-invasive lung cancer diagnosis , 2011, RNA biology.

[78]  Qionghai Dai,et al.  RBMMMDA: predicting multiple types of disease-microRNA associations , 2015, Scientific Reports.

[79]  Xia Li,et al.  Walking the interactome to identify human miRNA-disease associations through the functional link between miRNA targets and disease genes , 2013, BMC Systems Biology.

[80]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2004, Nucleic Acids Res..

[81]  Xiangxiang Zeng,et al.  Prediction and Validation of Disease Genes Using HeteSim Scores , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[82]  Dennis Shasha,et al.  miRò: a miRNA knowledge base , 2009, Database J. Biol. Databases Curation.

[83]  Jan Gorodkin,et al.  Protein-driven inference of miRNA–disease associations , 2013, Bioinform..

[84]  Xiaobo Zhou,et al.  An information-flow-based model with dissipation, saturation and direction for active pathway inference , 2009, BMC Systems Biology.

[85]  Changning Liu,et al.  dbDEMC: a database of differentially expressed miRNAs in human cancers , 2010, BMC Genomics.

[86]  Q. Zou,et al.  Prediction of MicroRNA-Disease Associations Based on Social Network Analysis Methods , 2015, BioMed Research International.

[87]  Dong Wang,et al.  Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases , 2010, Bioinform..

[88]  Xing Chen,et al.  RWRMDA: predicting novel human microRNA-disease associations. , 2012, Molecular bioSystems.

[89]  Yan Huang,et al.  RNALocate: a resource for RNA subcellular localizations , 2016, Nucleic Acids Res..

[90]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[91]  Yadong Wang,et al.  miR2Disease: a manually curated database for microRNA deregulation in human disease , 2008, Nucleic Acids Res..

[92]  R. Place,et al.  miR-449a targets HDAC-1 and induces growth arrest in prostate cancer , 2009, Oncogene.

[93]  Pall I. Olason,et al.  A human phenome-interactome network of protein complexes implicated in genetic disorders , 2007, Nature Biotechnology.