Reliable Link Inference for Network Data With Community Structures

Complex systems are often characterized by complex networks with links and entities. However, in many complex systems such as protein–protein interaction networks, recommender systems, and online communities, their links are hard to reveal directly, but they can be inaccurately observed by multiple data collection platforms or by a data collection platform at different times. Then, the links of the systems are inferred by the integration of the collected observations. As those data collection platforms are usually distributed over a large area and in different fields, their observations are unreliable and sensitive to the potential structures of the systems. In this paper, we consider the link inference problem in network data with community structures, in which the reliability of data collection platforms is unknown a priori and the link errors and reliability of platforms’ observations are heterogeneous to the underlying community structures of the systems. We propose an expectation maximization algorithm for link inference in a network system with community structures (EMLIC). The EMLIC algorithm is also used to infer the link errors and reliability of platforms’ observations in different communities. Experimental results on both synthetic data and eight real-world network data demonstrate that our algorithm is able to achieve lower link errors than the existing reliable link inference algorithms when the network data have community structures.

[1]  Pablo M. Gleiser,et al.  Community Structure in Jazz , 2003, Adv. Complex Syst..

[2]  Jie Liu,et al.  Multi-level learning based memetic algorithm for community detection , 2014, Appl. Soft Comput..

[3]  Gesine Reinert,et al.  Estimating the number of communities in a network , 2016, Physical review letters.

[4]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Mark E. J. Newman,et al.  Structural inference for uncertain networks , 2015, Physical review. E.

[6]  Santo Fortunato,et al.  Community detection in networks: A user guide , 2016, ArXiv.

[7]  Santo Fortunato,et al.  Network structure, metadata and the prediction of missing nodes , 2016, ArXiv.

[8]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Xiang Li,et al.  Spectral Analysis of Epidemic Thresholds of Temporal Networks , 2020, IEEE Transactions on Cybernetics.

[10]  Jianwu Dang,et al.  Robust Detection of Link Communities in Large Social Networks by Exploiting Link Semantics , 2018, AAAI.

[11]  Leto Peel,et al.  The ground truth about metadata and community detection in networks , 2016, Science Advances.

[12]  Jing Liu,et al.  A Multiobjective Evolutionary Algorithm Based on Similarity for Community Detection From Signed Social Networks , 2014, IEEE Transactions on Cybernetics.

[13]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[14]  Maoguo Gong,et al.  Enhancing community integrity of networks against multilevel targeted attacks. , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Pinyi Ren,et al.  Epidemic Information Dissemination in Mobile Social Networks With Opportunistic Links , 2015, IEEE Transactions on Emerging Topics in Computing.

[16]  Zenglin Xu,et al.  Bayesian Nonparametric Models for Multiway Data Analysis , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Carey E. Priebe,et al.  Community Detection and Classification in Hierarchical Stochastic Blockmodels , 2015, IEEE Transactions on Network Science and Engineering.

[18]  Z. Wang,et al.  The structure and dynamics of multilayer networks , 2014, Physics Reports.

[19]  Sophie Ahrens,et al.  Recommender Systems , 2012 .

[20]  Maoguo Gong,et al.  Complex Network Clustering by Multiobjective Discrete Particle Swarm Optimization Based on Decomposition , 2014, IEEE Transactions on Evolutionary Computation.

[21]  Cristopher Moore,et al.  Community detection, link prediction, and layer interdependence in multilayer networks , 2017, Physical review. E.

[22]  Shiliang Sun,et al.  A bayesian network approach to traffic flow forecasting , 2006, IEEE Transactions on Intelligent Transportation Systems.

[23]  Philip S. Yu,et al.  A Survey of Heterogeneous Information Network Analysis , 2015, IEEE Transactions on Knowledge and Data Engineering.

[24]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  A. Arenas,et al.  Models of social networks based on social distance attachment. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[27]  Mark E. J. Newman,et al.  Structure and inference in annotated networks , 2015, Nature Communications.

[28]  Charu C. Aggarwal,et al.  An Ensemble Approach to Link Prediction , 2017, IEEE Transactions on Knowledge and Data Engineering.

[29]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  Qi Xuan,et al.  Social Synchrony on Complex Networks , 2018, IEEE Transactions on Cybernetics.

[31]  Cristopher Moore,et al.  Accuracy and scaling phenomena in Internet mapping. , 2004, Physical review letters.

[32]  Bo Yang,et al.  Stochastic Blockmodeling and Variational Bayes Learning for Signed Network Analysis , 2017, IEEE Transactions on Knowledge and Data Engineering.

[33]  Jukka-Pekka Onnela,et al.  Community Structure in Time-Dependent, Multiscale, and Multiplex Networks , 2009, Science.

[34]  Deying Li,et al.  Mutual-Relationship-Based Community Partitioning for Social Networks , 2014, IEEE Transactions on Emerging Topics in Computing.

[35]  Weixiong Zhang,et al.  Joint Identification of Network Communities and Semantics via Integrative Modeling of Network Topologies and Node Contents , 2017, AAAI.

[36]  Mark E. J. Newman,et al.  An efficient and principled method for detecting communities in networks , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[37]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[38]  B. Kholodenko,et al.  The dynamic control of signal transduction networks in cancer cells , 2015, Nature Reviews Cancer.

[39]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[40]  Xun Li,et al.  Reconstruction of stochastic temporal networks through diffusive arrival times , 2017, Nature Communications.

[41]  Jian Peng,et al.  Variational Inference for Crowdsourcing , 2012, NIPS.

[42]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[43]  Roger Guimerà,et al.  Missing and spurious interactions and the reconstruction of complex networks , 2009, Proceedings of the National Academy of Sciences.

[44]  M. E. J. Newman,et al.  Network structure from rich but noisy data , 2017, Nature Physics.

[45]  David M Blei,et al.  Efficient discovery of overlapping communities in massive networks , 2013, Proceedings of the National Academy of Sciences.

[46]  P. Killworth,et al.  Informant Accuracy in Social Network Data , 1976 .