Metrical Consistency NMF for Predicting Gene–Phenotype Associations

Discovering gene–phenotype associations is significant to understand the disease mechanisms. Nonnegative matrix factorization (NMF) has been widely used in computational biology for its good performance and interpretability. In this paper, we proposed a novel metrical consistency NMF (MCNMF) method for candidate gene prioritization. The MCNMF method assume that phenotype similarities, calculated from various independent ways, should be consistent in case that the associations between genes and phenotypes are completely known. Experiment results show that our method can recover the gene–phenotype associations effectively and outperform the comparative methods.

[1]  Jim Jing-Yan Wang,et al.  Max-min distance nonnegative matrix factorization , 2013, Neural Networks.

[2]  John O. Woods,et al.  Prediction and Validation of Gene-Disease Associations Using Methods Inspired by Social Network Analyses , 2013, PloS one.

[3]  Fei Wang,et al.  Community discovery using nonnegative matrix factorization , 2011, Data Mining and Knowledge Discovery.

[4]  Jim Jing-Yan Wang,et al.  Multiple graph regularized nonnegative matrix factorization , 2013, Pattern Recognit..

[5]  Zonghua Li,et al.  Nuclear norm regularized convolutional Max Pos@Top machine , 2016, Neural Computing and Applications.

[6]  Thomas S. Huang,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation. , 2011, IEEE transactions on pattern analysis and machine intelligence.

[7]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[8]  Damian Smedley,et al.  The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data , 2014, Nucleic Acids Res..

[9]  Jagdish Chandra Patra,et al.  Genome-wide inferring gene-phenotype relationship by walking on the heterogeneous network , 2010, Bioinform..

[10]  Yi Gu,et al.  Optimizing top precision performance measure of content-based image retrieval by learning similarity function , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[11]  Michael Q. Zhang,et al.  Network-based global inference of human disease genes , 2008, Molecular systems biology.

[12]  Xiaolei Wang,et al.  Non-negative matrix factorization by maximizing correntropy for cancer clustering , 2013, BMC Bioinformatics.

[13]  Roded Sharan,et al.  Associating Genes and Protein Complexes with Disease via Network Propagation , 2010, PLoS Comput. Biol..

[14]  Christie S. Chang,et al.  The BioGRID interaction database: 2013 update , 2012, Nucleic Acids Res..

[15]  Yan Wang,et al.  Predicting disease genes based on normalized protein modules and phenotype ontology , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[16]  D. Botstein,et al.  Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease , 2003, Nature Genetics.

[17]  Jim Jing-Yan Wang,et al.  Feature selection and multi-kernel learning for adaptive graph regularized nonnegative matrix factorization , 2015, Expert Syst. Appl..

[18]  Xiaojun Wu,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Tao Jiang,et al.  Uncover disease genes by maximizing information flow in the phenome–interactome network , 2011, Bioinform..

[20]  Jim Jing-Yan Wang,et al.  Beyond cross-domain learning: Multiple-domain nonnegative matrix factorization , 2014, Eng. Appl. Artif. Intell..

[21]  Xin Gao,et al.  Adaptive graph regularized Nonnegative Matrix Factorization via feature selection , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[22]  Nagarajan Natarajan,et al.  Inductive matrix completion for predicting gene–disease associations , 2014, Bioinform..