Evaluating Link Prediction Accuracy in Dynamic Networks with Added and Removed Edges

The task of predicting future relationships in a social network, known as link prediction, has been studied extensively in the literature. Many link prediction methods have been proposed, ranging from common neighbors to probabilistic models. Recent work by Yang et al. has highlighted several challenges in evaluating link prediction accuracy. In dynamic networks where edges are both added and removed over time, the link prediction problem is more complex and involves predicting both newly added and newly removed edges. This results in new challenges in the evaluation of dynamic link prediction methods, and the recommendations provided by Yang et al. are no longer applicable, because they do not address edge removal. In this paper, we investigate several metrics currently used for evaluating accuracies of dynamic link prediction methods and demonstrate why they can be misleading in many cases. We provide several recommendations on evaluating dynamic link prediction accuracy, including separation into two categories of evaluation. Finally we propose a unified metric to characterize link prediction accuracy effectively using a single number.

[1]  Zan Huang,et al.  The Time-Series Link Prediction Problem with Applications in Communication Surveillance , 2009, INFORMS J. Comput..

[2]  Stan Matwin,et al.  Addressing the Curse of Imbalanced Training Sets: One-Sided Selection , 1997, ICML.

[3]  Corinna Cortes,et al.  Computational Methods for Dynamic Graphs , 2003 .

[4]  Zehra Cataltepe,et al.  Link prediction using time series of neighborhood-based node similarity scores , 2015, Data Mining and Knowledge Discovery.

[5]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[6]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[7]  AcarEvrim,et al.  Temporal Link Prediction Using Matrix and Tensor Factorizations , 2011 .

[8]  Krishna P. Gummadi,et al.  On the evolution of user interaction in Facebook , 2009, WOSN '09.

[9]  Tamara G. Kolda,et al.  Temporal Link Prediction Using Matrix and Tensor Factorizations , 2010, TKDD.

[10]  Srikanta J. Bedathur,et al.  Towards time-aware link prediction in evolving social networks , 2009, SNA-KDD '09.

[11]  Zoubin Ghahramani,et al.  Dynamic Probabilistic Models for Latent Feature Propagation in Social Networks , 2013, ICML.

[12]  Nitesh V. Chawla,et al.  Evaluating link prediction methods , 2014, Knowledge and Information Systems.

[13]  Alfred O. Hero,et al.  Dynamic Stochastic Blockmodels for Time-Evolving Social Networks , 2014, IEEE Journal of Selected Topics in Signal Processing.

[14]  TishbyNaftali,et al.  Euclidean Embedding of Co-occurrence Data , 2007 .

[15]  Yihong Gong,et al.  Detecting communities and their evolutions in dynamic social networks—a Bayesian approach , 2011, Machine Learning.

[16]  Gal Chechik,et al.  Euclidean Embedding of Co-occurrence Data , 2004, J. Mach. Learn. Res..

[17]  Alfred O. Hero,et al.  A shrinkage approach to tracking dynamic networks , 2011, 2011 IEEE Statistical Signal Processing Workshop (SSP).

[18]  Mohammad Al Hasan,et al.  A Survey of Link Prediction in Social Networks , 2011, Social Network Data Analytics.

[19]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[20]  Kevin S. Xu Stochastic Block Transition Models for Dynamic Networks , 2014, AISTATS.

[21]  James R. Foulds,et al.  A Dynamic Relational Infinite Feature Model for Longitudinal Social Networks , 2011, AISTATS.

[22]  Jure Leskovec,et al.  Nonparametric Multi-group Membership Model for Dynamic Networks , 2013, NIPS.