Overlapping communities and roles in networks with node attributes: Probabilistic graphical modeling, Bayesian formulation and variational inference

Abstract Community and role discovery are key tasks in network analysis. The former unveils the organization of a network, whereas the latter highlights the social functions of nodes. The integration of community discovery and role analysis has been investigated, to gain a deeper understanding of topology, i.e., the social functions fulfilled by nodes to pursue community purposes. However, hitherto, node attributes and behavioral role patterns have been ignored in the combination of both tasks. In this manuscript, we study the seamless integration of community discovery and behavioral role analysis, in the domain of networks with node attributes. In particular, we focus on unifying the two tasks, by explicitly harnessing node attributes and behavioral role patterns in a principled manner. To this end, we propose two Bayesian probabilistic generative models of networks, whose novelty consists in the interrelationship of overlapping communities, roles, their behavioral patterns and node attributes. The devised models allow for a variety of exploratory, descriptive and predictive tasks. These are carried out through mean-field variational inference, which is in turn mathematically derived and implemented into a coordinate-ascent algorithm. A wide spectrum of experiments is designed, to validate the devised models against three classes of state-of-the-art competitors using various real-world benchmark data sets from different social networking services. Our models are found to be more accurate in community detection, link prediction and attribute prediction. Notably, the gain in accuracy is robust to perturbations in the form of noise or lack of observations in either network structure or node attributes. Beside accuracy, scalability is also comparatively investigated. Finally, a qualitative demonstration of the tasks enabled by our models is developed, in which node roles are intuitively explained through an unprecedented visual representation.

[1]  Gianni Costa,et al.  Probabilistic analysis of communities and inner roles in networks: Bayesian generative models and approximate inference , 2013, Social Network Analysis and Mining.

[2]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[3]  David M. Blei,et al.  Variational Inference: A Review for Statisticians , 2016, ArXiv.

[4]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[5]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[7]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[8]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Phipps Arabie,et al.  Constructing blockmodels: How and why , 1978 .

[10]  Mark Newman,et al.  Detecting community structure in networks , 2004 .

[11]  R. Ortale,et al.  Model-Based Collaborative Personalized Recommendation on Signed Social Rating Networks , 2016, ACM Trans. Internet Techn..

[12]  Ling Huang,et al.  Joint Link Prediction and Attribute Inference Using a Social-Attribute Network , 2014, TIST.

[13]  Christopher M. Bishop,et al.  Model-based machine learning , 2013, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[14]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[15]  T. S. Evans,et al.  Clique graphs and overlapping communities , 2010, ArXiv.

[16]  Alex Pothen,et al.  PARTITIONING SPARSE MATRICES WITH EIGENVECTORS OF GRAPHS* , 1990 .

[17]  Gianni Costa,et al.  Mining Overlapping Communities and Inner Role Assignments through Bayesian Mixed-Membership Models of Networks with Context-Dependent Interactions , 2018, ACM Trans. Knowl. Discov. Data.

[18]  Renaud Lambiotte,et al.  Line graphs of weighted networks for overlapping communities , 2010 .

[19]  R. Lambiotte,et al.  Line graphs, link partitions, and overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Boleslaw K. Szymanski,et al.  Overlapping community detection in networks: The state-of-the-art and comparative study , 2011, CSUR.

[21]  Edoardo M. Airoldi,et al.  Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[22]  H. White,et al.  “Structural Equivalence of Individuals in Social Networks” , 2022, The SAGE Encyclopedia of Research Design.

[23]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[24]  Ryan A. Rossi,et al.  Role Discovery in Networks , 2014, IEEE Transactions on Knowledge and Data Engineering.

[25]  S. Wasserman,et al.  Building stochastic blockmodels , 1992 .

[26]  Amir Hossein Rasekh,et al.  Application of data mining methods for link prediction in social networks , 2013, Social Network Analysis and Mining.

[27]  Zoubin Ghahramani,et al.  Probabilistic machine learning and artificial intelligence , 2015, Nature.

[28]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[29]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[30]  Michalis Vazirgiannis,et al.  Clustering and Community Detection in Directed Networks: A Survey , 2013, ArXiv.

[31]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email , 2007, J. Artif. Intell. Res..

[32]  Youngdo Kim,et al.  Map equation for link communities. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.