CLP-ID: Community-based link prediction using information diffusion

Abstract In recent years, most link prediction algorithms have focused on node similarity owing to the associated low computational complexity and promising accuracy. In addition to the classical CN-based indexes, some methods are based on network features, such as community structure, information dissemination, and intermediary influence probability, which are used for link prediction. Although these methods provide new insight into the problem and achieve improvements in certain respects, they also have some limitations. For example, it is difficult to predict target links if the number of interconnections between communities is small. However, most studies aim at achieving higher link prediction accuracy even though a network obtained by these methods is not optimized for information spread. Therefore, we propose a community-based link prediction method using an information diffusion algorithm (CLP-ID) to predict the missing links. First, we present a community detection algorithm that divides the network into clusters. Then, a novel algorithm based on information diffusion and community structure is proposed to predict target links. Finally, we conduct experiments on real-world networks to validate the performance of the proposed algorithm and compare it with state-of-the-art algorithms. Statistical tests demonstrate that the proposed method significantly differs from state-of-the-art algorithms.

[1]  Deying Li,et al.  Minimum cost seed set for competitive social influence , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[2]  Dong Li,et al.  Exploiting Information Diffusion Feature for Link Prediction in Sina Weibo , 2016, Scientific reports.

[3]  G. Labianca,et al.  Group Social Capital and Group Effectiveness: The Role of Informal Socializing Ties , 2004 .

[4]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[5]  Mohammad Al Hasan,et al.  Link prediction using supervised learning , 2006 .

[6]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[7]  Le Song,et al.  Co-evolutionary Dynamics of Information Diffusion and Network Structure , 2015, WWW.

[8]  Fang Liu,et al.  Prediction of missing links based on community relevance and ruler inference , 2016, Knowl. Based Syst..

[9]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Jing Wang,et al.  Link prediction with node clustering coefficient , 2015, 1510.07819.

[11]  Jon Kleinberg,et al.  The link prediction problem for social networks , 2003, CIKM '03.

[12]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[13]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[14]  Linyuan Lü,et al.  Toward link predictability of complex networks , 2015, Proceedings of the National Academy of Sciences.

[15]  Bernhard Schölkopf,et al.  Influence Maximization in Continuous Time Diffusion Networks , 2012, ICML.

[16]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[17]  Yu Wang,et al.  Influence Maximization on Large-Scale Mobile Social Network: A Divide-and-Conquer Method , 2015, IEEE Transactions on Parallel and Distributed Systems.

[18]  Zhen Liu,et al.  Link prediction in complex networks: A local naïve Bayes model , 2011, ArXiv.

[19]  Licheng Jiao,et al.  Prediction of missing links based on multi-resolution community division , 2015 .

[20]  Bhaskar Biswas,et al.  Community-based link prediction , 2017, Multimedia Tools and Applications.

[21]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[22]  Kuldeep Singh,et al.  C2IM: Community based context-aware influence maximization in social networks , 2019, Physica A: Statistical Mechanics and its Applications.

[23]  Rajeev Rastogi,et al.  Recommendations to boost content spread in social networks , 2012, WWW.

[24]  Bhaskar Biswas,et al.  Analyzing evolutionary optimization and community detection algorithms using regression line dominance , 2017, Inf. Sci..

[25]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[27]  Kuldeep Singh,et al.  CoIM: Community-Based Influence Maximization in Social Networks , 2018 .

[28]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[29]  Zan Huang Link Prediction Based on Graph Topology: The Predictive Value of Generalized Clustering Coefficient , 2010 .

[30]  Timothy Ravasi,et al.  From link-prediction in brain connectomes and protein interactomes to the local-community-paradigm in complex networks , 2013, Scientific Reports.

[31]  Kuldeep Singh,et al.  MIM2: Multiple influence maximization across multiple social networks , 2019, Physica A: Statistical Mechanics and its Applications.

[32]  Xin Wang,et al.  Friend recommendation with content spread enhancement in social networks , 2015, Inf. Sci..

[33]  Wei Chen,et al.  Scalable influence maximization for prevalent viral marketing in large-scale social networks , 2010, KDD.

[34]  Kristina Lerman,et al.  A Visibility-based Model for Link Prediction in Social Media , 2014 .

[35]  Roger Guimerà,et al.  Missing and spurious interactions and the reconstruction of complex networks , 2009, Proceedings of the National Academy of Sciences.

[36]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[37]  Steve Gregory,et al.  Finding missing edges in networks based on their community structure , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[38]  Kuldeep Singh,et al.  Level-2 node clustering coefficient-based link prediction , 2019, Applied Intelligence.

[39]  Bhaskar Biswas,et al.  Investigating community structure in perspective of ego network , 2015, Expert Syst. Appl..

[40]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[41]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[42]  Huaiyu Wan,et al.  Predicting top-L missing links with node and link clustering information in large-scale networks , 2016 .

[43]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[44]  Kuldeep Singh,et al.  LAPSO-IM: A learning-based influence maximization approach for social networks , 2019, Appl. Soft Comput..

[45]  A. Barabasi,et al.  Evolution of the social network of scientific collaborations , 2001, cond-mat/0104162.

[46]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[47]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[48]  Bhaskar Biswas,et al.  FuzAg: Fuzzy Agglomerative Community Detection by Exploring the Notion of Self-Membership , 2018, IEEE Transactions on Fuzzy Systems.