Predicting human microRNA-disease associations based on support vector machine

The identification of disease-related microRNAs is vital for understanding the pathogenesis of disease at the molecular level and may lead to the design of specific molecular tools for diagnosis, treatment and prevention. Experimental identification of disease-related microRNAs poses difficulties. Computational prediction of microRNA-disease associations is one of the complementary means. However, one major issue in microRNA studies is the lack of bioinformatics programs to accurately predict microRNA-disease associations. Herein, we present a machine-learning-based approach for distinguishing positive microRNA-disease associations from negative microRNA-disease associations. A set of features was extracted for each positive and negative microRNA-disease association, and a Support Vector Machine SVM classifier was trained, which achieved the area under the ROC curve of up to 0.8884 in 10-fold cross-validation procedure, indicating that the SVM-based approach described here can be used to predict potential microRNA-disease associations and formulate testable hypotheses to guide future biological experiments.

[1]  M. Wood,et al.  Short non-coding RNA biology and neurodegenerative disorders: novel disease targets and therapeutics. , 2009, Human molecular genetics.

[2]  Byoung-Tak Zhang,et al.  miTarget: microRNA target gene prediction using a support vector machine , 2006, BMC Bioinformatics.

[3]  Doheon Lee,et al.  Predicting disease phenotypes based on the molecular networks with Condition-Responsive Correlation , 2011, Int. J. Data Min. Bioinform..

[4]  Alexander Gammerman,et al.  Sequence alignment kernel for recognition of promoter regions , 2003, Bioinform..

[5]  Xiaoyan Zhu,et al.  Discovering breast cancer drug candidates from biomedical literature , 2010, Int. J. Data Min. Bioinform..

[6]  Vladimir Cherkassky,et al.  The Nature Of Statistical Learning Theory , 1997, IEEE Trans. Neural Networks.

[7]  Michael Q. Zhang,et al.  Network-based global inference of human disease genes , 2008, Molecular systems biology.

[8]  Yadong Wang,et al.  miR2Disease: a manually curated database for microRNA deregulation in human disease , 2008, Nucleic Acids Res..

[9]  Pall I. Olason,et al.  A human phenome-interactome network of protein complexes implicated in genetic disorders , 2007, Nature Biotechnology.

[10]  Fei Li,et al.  Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine , 2005, BMC Bioinformatics.

[11]  C. Croce,et al.  MicroRNA signatures in human cancers , 2006, Nature Reviews Cancer.

[12]  Michael Kertesz,et al.  The role of site accessibility in microRNA target recognition , 2007, Nature Genetics.

[13]  Tyler E. Miller,et al.  MicroRNA-221/222 Confers Tamoxifen Resistance in Breast Cancer by Targeting p27Kip1*♦ , 2008, Journal of Biological Chemistry.

[14]  Robert A. Weinberg,et al.  Tumour invasion and metastasis initiated by microRNA-10b in breast cancer (Nature (2007) 449, (682-688)) , 2008 .

[15]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..

[16]  F. Slack,et al.  MicroRNA in cancer prognosis. , 2008, The New England journal of medicine.

[17]  Rosario M. Piro,et al.  Prediction of Human Disease Genes by Human-Mouse Conserved Coexpression Analysis , 2008, PLoS Comput. Biol..

[18]  G. Vriend,et al.  A text-mining analysis of the human phenome , 2006, European Journal of Human Genetics.

[19]  Nitai P. Bhattacharyya,et al.  MicroRNA Interaction network in human: implications of clustered microRNA in biological pathways and genetic diseases , 2009, 0901.4211.

[20]  Yi Xing,et al.  The Bifunctional microRNA miR-9/miR-9* Regulates REST and CoREST and Is Downregulated in Huntington's Disease , 2008, The Journal of Neuroscience.

[21]  Vassilis Georgoulias,et al.  Prognostic value of mature microRNA-21 and microRNA-205 overexpression in non-small cell lung cancer by quantitative real-time RT-PCR. , 2008, Clinical chemistry.

[22]  Q. Cui,et al.  An Analysis of Human MicroRNA and Disease Associations , 2008, PloS one.

[23]  William Stafford Noble,et al.  Kernel methods for predicting protein-protein interactions , 2005, ISMB.

[24]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[25]  C. Burge,et al.  Prediction of Mammalian MicroRNA Targets , 2003, Cell.

[26]  Ziv Bar-Joseph,et al.  Evaluation of different biological data and computational classification methods for use in protein interaction prediction , 2006, Proteins.

[27]  E. Miska,et al.  How microRNAs control cell division, differentiation and death. , 2005, Current opinion in genetics & development.

[28]  R. Duisters,et al.  MIRNA-133 AND MIRNA-30 REGULATE CONNECTIVE TISSUE GROWTH FACTOR: IMPLICATIONS FOR A ROLE OF MIRNAS IN MYOCARDIAL MATRIX REMODELING , 2013 .

[29]  Yadong Wang,et al.  Prioritization of disease microRNAs through a human phenome-microRNAome network , 2010, BMC Systems Biology.

[30]  Aimee L Jackson,et al.  Coordinated regulation of cell cycle transcripts by p53-Inducible microRNAs, miR-192 and miR-215. , 2008, Cancer research.