A graph regularized non-negative matrix factorization method for identifying microRNA-disease associations

Motivation MicroRNAs (miRNAs) play crucial roles in post-transcriptional regulations and various cellular processes. The identification of disease-related miRNAs provides great insights into the underlying pathogenesis of diseases at a system level. However, most existing computational approaches are biased towards known miRNA-disease associations, which is inappropriate for those new diseases or miRNAs without any known association information. Results In this study, we propose a new method with graph regularized non-negative matrix factorization in heterogeneous omics data, called GRNMF, to discover potential associations between miRNAs and diseases, especially for new diseases and miRNAs or those diseases and miRNAs with sparse known associations. First, we integrate the disease semantic information and miRNA functional information to estimate disease similarity and miRNA similarity, respectively. Considering that there is no available interaction observed for new diseases or miRNAs, a preprocessing step is developed to construct the interaction score profiles that will assist in prediction. Next, a graph regularized non-negative matrix factorization framework is utilized to simultaneously identify potential associations for all diseases. The results indicated that our proposed method can effectively prioritize disease-associated miRNAs with higher accuracy compared with other recent approaches. Moreover, case studies also demonstrated the effectiveness of GRNMF to infer unknown miRNA-disease associations for those novel diseases and miRNAs. Availability The code of GRNMF is freely available at https://github.com/XIAO-HN/GRNMF/. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  E. Marcotte,et al.  Prioritizing candidate disease genes by network-based boosting of genome-wide association data. , 2011, Genome research.

[2]  Tapio Pahikkala,et al.  Toward more realistic drug^target interaction predictions , 2014 .

[3]  Cheng Liang,et al.  Mirsynergy: detecting synergistic miRNA regulatory modules by overlapping neighbourhood expansion , 2014, Bioinform..

[4]  Zhigang Luo,et al.  Manifold Regularized Discriminative Nonnegative Matrix Factorization With Fast Gradient Descent , 2011, IEEE Transactions on Image Processing.

[5]  P. Sarnow,et al.  Modulation of Hepatitis C Virus RNA Abundance by a Liver-Specific MicroRNA , 2005, Science.

[6]  MengChu Zhou,et al.  A Nonnegative Latent Factor Model for Large-Scale Sparse Matrices in Recommender Systems via Alternating Direction Method , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Philip S. Yu,et al.  A new method to measure the semantic similarity of GO terms , 2007, Bioinform..

[8]  Jan Gorodkin,et al.  Protein-driven inference of miRNA–disease associations , 2013, Bioinform..

[9]  Hsien-Da Huang,et al.  miRTarBase 2016: updates to the experimentally validated miRNA-target interactions database , 2015, Nucleic Acids Res..

[10]  Cheng Liang,et al.  A Novel Method to Detect Functional microRNA Regulatory Modules by Bicliques Merging , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[11]  Zhu-Hong You,et al.  Using manifold embedding for assessing and predicting protein interactions from high-throughput experimental data , 2010, Bioinform..

[12]  Nectarios Koziris,et al.  TarBase 6.0: capturing the exponential growth of miRNA targets with experimental support , 2011, Nucleic Acids Res..

[13]  Haiyuan Yu,et al.  Detecting overlapping protein complexes in protein-protein interaction networks , 2012, Nature Methods.

[14]  Shangwei Ning,et al.  Prioritizing human cancer microRNAs based on genes’ functional consistency between microRNA and cancer , 2011, Nucleic acids research.

[15]  Heiko Wersing,et al.  A Model for Learning Topographically Organized Parts-Based Representations of Objects in Visual Cortex: Topographic Nonnegative Matrix Factorization , 2009, Neural Computation.

[16]  Xiaobo Zhou,et al.  Nonconvex Penalty Based Low-Rank Representation and Sparse Regression for eQTL Mapping , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[17]  Juan Xu,et al.  Prioritizing Candidate Disease miRNAs by Topological Features in the miRNA Target–Dysregulated Network: Case Study of Prostate Cancer , 2011, Molecular Cancer Therapeutics.

[18]  Xuelong Li,et al.  Graph Regularized Non-Negative Low-Rank Matrix Factorization for Image Clustering , 2017, IEEE Transactions on Cybernetics.

[19]  Qionghai Dai,et al.  WBSMDA: Within and Between Score for MiRNA-Disease Association prediction , 2016, Scientific Reports.

[20]  Tongbin Li,et al.  miRecords: an integrated resource for microRNA–target interactions , 2008, Nucleic Acids Res..

[21]  Xing Chen,et al.  Semi-supervised learning for potential human microRNA-disease associations inference , 2014, Scientific Reports.

[22]  Jiawei Luo,et al.  A novel approach for predicting microRNA-disease associations by unbalanced bi-random walk on heterogeneous network , 2017, J. Biomed. Informatics.

[23]  Cheng Liang,et al.  Collective Prediction of Disease-Associated miRNAs Based on Transduction Learning , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[24]  Xiangxiang Zeng,et al.  Integrative approaches for predicting microRNA function and prioritizing disease-related microRNA using biological interaction networks , 2016, Briefings Bioinform..

[25]  Lei Zhang,et al.  Tumor Clustering Using Nonnegative Matrix Factorization With Gene Selection , 2009, IEEE Transactions on Information Technology in Biomedicine.

[26]  Jiawei Luo,et al.  A path-based measurement for human miRNA functional similarities using miRNA-disease associations , 2016, Scientific Reports.

[27]  Yang Li,et al.  HMDD v2.0: a database for experimentally supported human microRNA and disease associations , 2013, Nucleic Acids Res..

[28]  Wen Gao,et al.  Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A Unified Framework , 2014, IEEE Transactions on Image Processing.

[29]  W. Ritchie,et al.  Predicting microRNA targets and functions: traps for the unwary , 2009, Nature Methods.

[30]  Wei Tang,et al.  dbDEMC 2.0: updated database of differentially expressed miRNAs in human cancers , 2016, Nucleic Acids Res..

[31]  Xin Gao,et al.  Adaptive graph regularized Nonnegative Matrix Factorization via feature selection , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[32]  Xing Chen,et al.  RWRMDA: predicting novel human microRNA-disease associations. , 2012, Molecular bioSystems.

[33]  Jiuyong Li,et al.  Identifying miRNAs, targets and functions , 2012, Briefings Bioinform..

[34]  Dong Wang,et al.  Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases , 2010, Bioinform..

[35]  Yun Xiao,et al.  MiRNA–miRNA synergistic network: construction via co-regulating functional modules and disease miRNA topological features , 2010, Nucleic acids research.

[36]  Di Wu,et al.  miRCancer: a microRNA-cancer association database constructed by text mining on literature , 2013, Bioinform..

[37]  Cheng Liang,et al.  Predicting MicroRNA-Disease Associations Using Kronecker Regularized Least Squares Based on Heterogeneous Omics Data , 2017, IEEE Access.

[38]  De-Shuang Huang,et al.  A Two-Stage Geometric Method for Pruning Unreliable Links in Protein-Protein Networks , 2015, IEEE Transactions on NanoBioscience.

[39]  Francisco Facchinei,et al.  Solving quasi-variational inequalities via their KKT conditions , 2014, Math. Program..

[40]  Xia Li,et al.  Prediction of potential disease-associated microRNAs based on random walk , 2015, Bioinform..

[41]  De-Shuang Huang,et al.  Independent component analysis-based penalized discriminant method for tumor classification using gene expression data , 2006, Bioinform..

[42]  Yun Xiao,et al.  Prioritizing Candidate Disease miRNAs by Topological Features in the miRNA Target–Dysregulated Network: Case Study of Prostate Cancer , 2011, Molecular Cancer Therapeutics.

[43]  Eckart Meese,et al.  Stable serum miRNA profiles as potential tool for non-invasive lung cancer diagnosis , 2011, RNA biology.

[44]  Xiaojun Wu,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Hongyu Zhao,et al.  Joint analysis of expression profiles from multiple cancers improves the identification of microRNA-gene interactions , 2013, Bioinform..

[46]  Xing-Ming Zhao,et al.  Identifying cancer-related microRNAs based on gene expression data , 2015, Bioinform..

[47]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[48]  Christos Sotiriou,et al.  MicroRNAs regulate KDM5 histone demethylases in breast cancer cells. , 2016, Molecular bioSystems.

[49]  Fernando Ortega,et al.  A non negative matrix factorization for collaborative filtering recommender systems based on a Bayesian probabilistic model , 2016, Knowl. Based Syst..

[50]  Jing Li,et al.  dbDEPC 2.0: updated database of differentially expressed proteins in human cancers , 2011, Nucleic Acids Res..

[51]  Cheng Liang,et al.  A novel motif-discovery algorithm to identify co-regulatory motifs in large transcription factor and microRNA co-regulatory networks in human , 2015, Bioinform..