Biased random walk with restart for link prediction with graph embedding method

Abstract Link prediction is an important problem in topics of complex networks, which can be applied to many practical scenarios such as information retrieval and marketing analysis. Strategies based on random walk are commonly used to address this problem. In common practice of a random walk, a link predictor may move from one node to one of its neighbors with uniform transferring probability regardless of the characteristics of the local structure around that node, which, however, may contain useful information for a successful prediction. In this paper, we propose a refined random walk approach which incorporates graph embedding method. This approach may provide biased transferring probabilities to perform random walk so as to further exploit topological properties embedded in the network structure. The performance of proposed method is examined by comparing with other commonly used indexes. Results show that our method outperforms all these indexes reflected by better prediction accuracy.

[1]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  Edwin R. Hancock,et al.  net4Lap: Neural Laplacian Regularization for Ranking and Re-Ranking , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[3]  Jianquan Liu,et al.  Link prediction: the power of maximal entropy random walk , 2011, CIKM '11.

[4]  David F. Gleich,et al.  Neighborhood and PageRank methods for pairwise link prediction , 2020, Social Network Analysis and Mining.

[5]  Lyle H. Ungar,et al.  Statistical Relational Learning for Link Prediction , 2003 .

[6]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[7]  Ramesh R. Sarukkai,et al.  Link prediction and path analysis using Markov chains , 2000, Comput. Networks.

[8]  Christos Faloutsos,et al.  Fast Random Walk with Restart and Its Applications , 2006, Sixth International Conference on Data Mining (ICDM'06).

[9]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[10]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[11]  Alan Bundy,et al.  Dynamic Time Warping , 1984 .

[12]  Daniel R. Figueiredo,et al.  struc2vec: Learning Node Representations from Structural Identity , 2017, KDD.

[13]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[14]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[15]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[16]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[17]  Woo-Sung Jung,et al.  Accuracy Test for Link Prediction in terms of Similarity Index: The Case of WS and BA Models , 2015, ArXiv.

[18]  T. Zhou,et al.  Effective and Efficient Similarity Index for Link Prediction of Complex Networks , 2009, 0905.3558.

[19]  Linyuan Lu,et al.  SIMILARITY-BASED CLASSIFICATION IN PARTIALLY LABELED NETWORKS , 2010 .

[20]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[21]  Linyuan Lu,et al.  Link prediction based on local random walk , 2010, 1001.2467.

[22]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[23]  Michael J. Brusco,et al.  A note on using the adjusted Rand index for link prediction in networks , 2015, Soc. Networks.

[24]  Manuel Curado,et al.  Return random walks for link prediction , 2020, Inf. Sci..

[25]  M. Randic,et al.  Resistance distance , 1993 .

[26]  Jinhong Jung,et al.  Supervised and extended restart in random walks for ranking and link prediction in networks , 2017, PloS one.

[27]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.