iBridge: Inferring bridge links that diffuse information across communities

Abstract While the accuracy of link prediction has been improved continuously, the utility of the inferred new links is rarely concerned especially when it comes to information diffusion. This paper defines the utility of links based on average shortest distance and more importantly defines a special type of links named bridge links based on community structure (overlapping or not) of the network. In sociology, bridge links are usually regarded as weak ties and play a more crucial role in information diffusion. Considering that the accuracy of previous link prediction methods is high in predicting strong ties but not much high in predicting weak ties, we propose a new link prediction method named iBridge, which aims to infer new bridge links using biased structural metrics in a PU (positive and unlabeled) learning framework. The experimental results in 3 real online social networks show that iBridge outperforms several comparative link prediction methods (based on supervised learning or PU learning) in inferring the bridge links and meantime, the overall performance of inferring bridge links and non-bridge links is not compromised, thus verifying its robustness in inferring all new links.

[1]  Zhen Liu,et al.  Link prediction in complex networks: A local naïve Bayes model , 2011, ArXiv.

[2]  Tao Zhou,et al.  Link prediction in weighted networks: The role of weak ties , 2010 .

[3]  Jure Leskovec,et al.  Learning to Discover Social Circles in Ego Networks , 2012, NIPS.

[4]  Purnamrita Sarkar,et al.  Nonparametric Link Prediction in Dynamic Networks , 2012, ICML.

[5]  Hui Li,et al.  A Deep Learning Approach to Link Prediction in Dynamic Networks , 2014, SDM.

[6]  Pasquale De Meo,et al.  On Facebook, most ties are weak , 2012, Commun. ACM.

[7]  Ramesh R. Sarukkai,et al.  Link prediction and path analysis using Markov chains , 2000, Comput. Networks.

[8]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[9]  Mong-Li Lee,et al.  Mining Brokers in Dynamic Social Networks , 2015, CIKM.

[10]  Philip S. Yu,et al.  Meta-path based multi-network collective link prediction , 2014, KDD.

[11]  Philip S. Yu,et al.  A Survey of Heterogeneous Information Network Analysis , 2015, IEEE Transactions on Knowledge and Data Engineering.

[12]  Terry Hui-Ye Chiu,et al.  Propagating online social networks via different kinds of weak ties , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[13]  Junjie Wu,et al.  Weak ties: subtle role of information diffusion in online social networks. , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Phi Vu Tran,et al.  Learning to Make Predictions on Graphs with Autoencoders , 2018, 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA).

[15]  Lise Getoor,et al.  Active Learning for Networked Data , 2010, ICML.

[16]  Nitesh V. Chawla,et al.  Link Prediction and Recommendation across Heterogeneous Social Networks , 2012, 2012 IEEE 12th International Conference on Data Mining.

[17]  See-Kiong Ng,et al.  Negative Training Data Can be Harmful to Text Classification , 2010, EMNLP.

[18]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[20]  Srinivasan Parthasarathy,et al.  Local Probabilistic Models for Link Prediction , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[21]  Evaggelia Pitoura,et al.  Selecting Shortcuts for a Smaller World , 2015, SDM.

[22]  Linyuan Lu,et al.  Role of weak ties in link prediction of complex networks , 2009, CIKM-CNIKM.

[23]  Florence d'Alché-Buc,et al.  Semi-supervised Penalized Output Kernel Regression for Link Prediction , 2011, ICML.

[24]  Jianhua Ruan,et al.  A novel link prediction algorithm for reconstructing protein-protein interaction networks by topological similarity , 2013, Bioinform..

[25]  Boleslaw K. Szymanski,et al.  Towards Linear Time Overlapping Community Detection in Social Networks , 2012, PAKDD.

[26]  Lada A. Adamic,et al.  The role of social networks in information diffusion , 2012, WWW.

[27]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[28]  R. Burt Structural Holes and Good Ideas1 , 2004, American Journal of Sociology.

[29]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[30]  Yin Zhang,et al.  Scalable proximity estimation and link prediction in online social networks , 2009, IMC '09.

[31]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[32]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[33]  Daniel Dajun Zeng,et al.  A Link Prediction Approach to Anomalous Email Detection , 2006, 2006 IEEE International Conference on Systems, Man and Cybernetics.

[34]  Linyuan Lü,et al.  Toward link predictability of complex networks , 2015, Proceedings of the National Academy of Sciences.

[35]  Hui Tian,et al.  Hidden link prediction based on node centrality and weak ties , 2013 .

[36]  Philip S. Yu,et al.  Positive and Unlabeled Learning for Graph Classification , 2011, 2011 IEEE 11th International Conference on Data Mining.

[37]  Philip S. Yu,et al.  A framework for dynamic link prediction in heterogeneous networks , 2014, Stat. Anal. Data Min..

[38]  Yoshihiro Yamanishi,et al.  propagation: A fast semisupervised learning algorithm for link prediction , 2009 .

[39]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.