A Clustering-based Link Prediction Method in Social Networks

Link prediction is an important task in social network analysis, which also has applications in other domains like, recommender systems, molecular biology and criminal investigations. The classical methods of link prediction are based on graph topology structure and path features but few consider clustering information. The cluster in graphs is densely connected group of vertices and sparsely connected to other groups. Actually, the clustering results contain the essential information for link prediction, and these vertices common neighbors may play different roles depending on if they belong to the same cluster. Based on this assumption and characteristics of the common social networks, in this paper, we propose a link prediction method based on clustering and global information. Our experiments on both synthetic and real-world networks show that this method can improve link prediction accuracy as the number of cluster grows.

[1]  Ke Xu,et al.  Link prediction in complex networks: a clustering perspective , 2011, The European Physical Journal B.

[2]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[3]  M. Newman,et al.  Vertex similarity in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[5]  Hsinchun Chen,et al.  Link prediction approach to collaborative filtering , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[6]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  Mohammad Al Hasan,et al.  Link prediction using supervised learning , 2006 .

[8]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[9]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[10]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[11]  Ben Taskar,et al.  Link Prediction in Relational Data , 2003, NIPS.

[12]  Roger Guimerà,et al.  Missing and spurious interactions and the reconstruction of complex networks , 2009, Proceedings of the National Academy of Sciences.

[13]  Robert Ackland,et al.  Mapping the U.S. Political Blogosphere: Are Conservative Bloggers More Prominent? , 2005 .

[14]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[15]  Seymour Geisser,et al.  8. Predictive Inference: An Introduction , 1995 .

[16]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[17]  Zhen Liu,et al.  Link prediction in complex networks: A local naïve Bayes model , 2011, ArXiv.

[18]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  Lise Getoor,et al.  Link mining: a survey , 2005, SKDD.

[20]  Linyuan Lu,et al.  Link prediction based on local random walk , 2010, 1001.2467.

[21]  Lise Getoor,et al.  Link mining: a new data mining challenge , 2003, SKDD.

[22]  Pablo M. Gleiser,et al.  Community Structure in Jazz , 2003, Adv. Complex Syst..

[23]  A. Barabasi,et al.  Evolution of the social network of scientific collaborations , 2001, cond-mat/0104162.

[24]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[25]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[26]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Padhraic Smyth,et al.  Prediction and ranking algorithms for event-based network data , 2005, SKDD.

[28]  Bao-qun Yin,et al.  Power-law strength-degree correlation from resource-allocation dynamics on weighted networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  John E. Hopcroft,et al.  Using community information to improve the precision of link prediction methods , 2012, WWW.