Link prediction in complex networks: a clustering perspective

Abstract Link prediction is an open problem in the complex network, which attracts much research interest currently. However, little attention has been paid to the relation between network structure and the performance of prediction methods. In order to fill this vital gap, we try to understand how the network structure affects the performance of link prediction methods in the view of clustering. Our experiments on both synthetic and real-world networks show that as the clustering grows, the accuracy of these methods could be improved remarkably, while for the sparse and weakly clustered network, they perform poorly. We explain this through the distinguishment caused by increased clustering between the score distribution of positive and negative instances. Our finding also sheds light on the problem of how to select appropriate approaches for different networks with various densities and clusterings.

[1]  Padhraic Smyth,et al.  Prediction and ranking algorithms for event-based network data , 2005, SKDD.

[2]  David Lo,et al.  Mining interesting link formation rules in social networks , 2010, CIKM.

[3]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[4]  Jie Tang,et al.  Link Prediction of Social Networks Based on Weighted Proximity Measures , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[5]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[6]  Jon M. Kleinberg,et al.  The Directed Closure Process in Hybrid Social-Information Networks, with an Analysis of Link Formation on Twitter , 2010, ICWSM.

[7]  Beom Jun Kim Performance of networks of artificial neurons: the role of clustering. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Ying-Cheng Lai,et al.  Emergence of loop structure in scale-free networks and dynamical consequences. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[11]  Roger Guimerà,et al.  Missing and spurious interactions and the reconstruction of complex networks , 2009, Proceedings of the National Academy of Sciences.

[12]  Robert Ackland,et al.  Mapping the U.S. Political Blogosphere: Are Conservative Bloggers More Prominent? , 2005 .

[13]  Lise Getoor,et al.  Combining Collective Classification and Link Prediction , 2007, Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007).

[14]  Lise Getoor,et al.  Link mining: a survey , 2005, SKDD.

[15]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[16]  Lawrence B. Holder,et al.  Discovering Structural Anomalies in Graph-Based Data , 2007 .

[17]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[18]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[19]  Yin Zhang,et al.  Scalable proximity estimation and link prediction in online social networks , 2009, IMC '09.

[20]  David D. Jensen,et al.  The case for anomalous link discovery , 2005, SKDD.

[21]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[22]  Linyuan Lu,et al.  Link prediction based on local random walk , 2010, 1001.2467.

[23]  Mohammad Al Hasan,et al.  Link prediction using supervised learning , 2006 .