Robust non-negative matrix factorization for link prediction in complex networks using manifold regularization and sparse learning

Abstract The aim of link prediction is to disclose the underlying evolution mechanism of networks, which could be utilized to predict missing links or eliminate spurious links. However, real-world networks data usually encounters challenges,such as missing links, spurious links and random noise, which seriously hamper the prediction accuracy of existing link prediction methods. Therefore, in this paper, we propose a novel Robust Non-negative Matrix Factorization via jointly Manifold regularization and Sparse learning (MS-RNMF) method in link prediction that solves the problems. Compared to existing methods, MS-RNMF has three-fold advantages: First of all, the MS-RNMF employ manifold regularization and k-medoids algorithm jointly to preserve the network local and global topology information. Besides, the MS-RNMF adopts l 2 , 1 -norm to constrain loss function and regularization term, random noise and spurious links could be effectively remove. Finally, we employ multiplicative updating rules to learn the model parameter and prove the convergence of the algorithm. Extensive experiments results performed on eleven real-world networks demonstrate that the MS-RNMF outperforms the state-of-the-arts methods in predicting missing links , identifying spurious links and eliminating random noise.

[1]  Yu Wang,et al.  Graph regularized nonnegative matrix factorization for temporal link prediction in dynamic networks , 2018 .

[2]  Pengfei Jiao,et al.  A perturbation-based framework for link prediction via non-negative matrix factorization , 2016, Scientific Reports.

[3]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[4]  Tao Zhou,et al.  Predicting missing links and identifying spurious links via likelihood analysis , 2016, Scientific Reports.

[5]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[6]  Thomas L. Griffiths,et al.  Nonparametric Latent Feature Models for Link Prediction , 2009, NIPS.

[7]  Thomas S. Huang,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation. , 2011, IEEE transactions on pattern analysis and machine intelligence.

[8]  Linyuan Lü,et al.  Similarity index based on local paths for link prediction of complex networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Hisashi Kashima,et al.  A Parameterized Probabilistic Model of Network Evolution for Supervised Link Prediction , 2006, Sixth International Conference on Data Mining (ICDM'06).

[10]  Xiaoke Ma,et al.  Nonnegative matrix factorization algorithms for link prediction in temporal networks using graph communicability , 2017, Pattern Recognit..

[11]  M. Newman,et al.  Vertex similarity in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Chris H. Q. Ding,et al.  Robust nonnegative matrix factorization using L21-norm , 2011, CIKM '11.

[13]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[14]  Ludovic Denoyer,et al.  Temporal link prediction by integrating content and structure information , 2011, CIKM '11.

[15]  Pengfei Jiao,et al.  Link predication based on matrix factorization by fusion of multi class organizations of the network , 2017, Scientific Reports.

[16]  Wei Yu,et al.  Kernel framework based on non-negative matrix factorization for networks reconstruction and link prediction , 2017, Knowl. Based Syst..

[17]  Chris H. Q. Ding,et al.  R1-PCA: rotational invariant L1-norm principal component analysis for robust subspace factorization , 2006, ICML.

[18]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[19]  Jun Zhu,et al.  Max-Margin Nonparametric Latent Feature Models for Link Prediction , 2012, ICML.

[20]  Jiye Liang,et al.  A fusion probability matrix factorization framework for link prediction , 2018, Knowl. Based Syst..

[21]  Chuang Liu,et al.  Multi-linear interactive matrix factorization , 2014, Knowl. Based Syst..

[22]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[23]  Roger Guimerà,et al.  Missing and spurious interactions and the reconstruction of complex networks , 2009, Proceedings of the National Academy of Sciences.

[24]  Fanghua Ye,et al.  Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection , 2018, CIKM.

[25]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  Bin Li,et al.  Link prediction in multi-relational networks based on relational similarity , 2017, Inf. Sci..

[27]  Fuguo Zhang,et al.  Improving information filtering via network manipulation , 2012, ArXiv.

[28]  Pasquale De Meo,et al.  Mixing local and global information for community detection in large networks , 2013, J. Comput. Syst. Sci..

[29]  Wei Chu,et al.  Stochastic Relational Models for Discriminative Link Prediction , 2006, NIPS.

[30]  Bin Li,et al.  DeepEye: Link prediction in dynamic networks based on non-negative matrix factorization , 2018, Big Data Min. Anal..

[31]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[32]  Charles Elkan,et al.  Link Prediction via Matrix Factorization , 2011, ECML/PKDD.

[33]  Zhongfei Zhang,et al.  Dropout Training of Matrix Factorization and Autoencoder for Link Prediction in Sparse Graphs , 2015, SDM.

[34]  Nicola Parolini,et al.  Link Prediction in Criminal Networks: A Tool for Criminal Intelligence Analysis , 2016, PloS one.

[35]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[36]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[37]  Futian Wang,et al.  Measuring the robustness of link prediction algorithms under noisy environment , 2016, Scientific Reports.

[38]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[39]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.