A Mean-Field Variational Bayesian Approach to Detecting Overlapping Communities with Inner Roles Using Poisson Link Generation

A novel model-based machine-learning approach is presented for the unsupervised and exploratory analysis of node affiliations to overlapping communities with roles in networks. At the heart of our approach is a new Bayesian probabilistic generative model of directed networks, that treats roles as abstract behavioral classes explaining node linking behavior. A generalized weighted instance of directed affiliation modeling rules the strength of node participation in communities with whichever role through Gamma priors. Moreover, link establishment between nodes is governed by a Poisson distribution. The latter is parameterized so that, the stronger the affiliations of two nodes to common communities with respective roles, the more likely it is the formation of a connection. A coordinate-ascent algorithm is designed to implement mean-field variational inference for affiliation analysis and link prediction. A comparative experimentation on real-world networks demonstrates the superiority of our approach in community compactness, link prediction and scalability.

[1]  Gianni Costa,et al.  A Bayesian Hierarchical Approach for Exploratory Analysis of Communities and Roles in Social Networks , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[2]  Salvatore J. Stolfo,et al.  Segmentation and Automated Social Hierarchy Detection through Email Network Analysis , 2009, WebKDD/SNA-KDD.

[3]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[4]  Gianni Costa,et al.  Probabilistic analysis of communities and inner roles in networks: Bayesian generative models and approximate inference , 2013, Social Network Analysis and Mining.

[5]  Tina Eliassi-Rad,et al.  Applying latent dirichlet allocation to group discovery in large graphs , 2009, SAC '09.

[6]  Christos Faloutsos,et al.  HCDF: A Hybrid Community Discovery Framework , 2010, SDM.

[7]  Gianni Costa,et al.  A unified generative Bayesian model for community discovery and role assignment based upon latent interaction factors , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[8]  David M. Blei,et al.  Scalable Recommendation with Hierarchical Poisson Factorization , 2015, UAI.

[9]  Yong-Yeol Ahn,et al.  Topological Cluster Analysis Reveals the Systemic Organization of the Caenorhabditis elegans Connectome , 2011, PLoS Comput. Biol..

[10]  R. Ortale,et al.  Model-Based Collaborative Personalized Recommendation on Signed Social Rating Networks , 2016, ACM Trans. Internet Techn..

[11]  A. Banerjee,et al.  Social Topic Models for Community Extraction , 2008 .

[12]  Jure Leskovec,et al.  Learning to Discover Social Circles in Ego Networks , 2012, NIPS.

[13]  Jure Leskovec,et al.  Detecting cohesive and 2-mode communities indirected and undirected networks , 2014, WSDM.

[14]  Jure Leskovec,et al.  Structure and Overlaps of Ground-Truth Communities in Networks , 2014, TIST.

[15]  Silvio Lattanzi,et al.  Affiliation networks , 2009, STOC '09.

[16]  S. Brenner,et al.  The structure of the nervous system of the nematode Caenorhabditis elegans. , 1986, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[17]  John Yen,et al.  An LDA-based Community Structure Discovery Approach for Large-Scale Social Networks , 2007, 2007 IEEE Intelligence and Security Informatics.

[18]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[19]  N. Chatterjee,et al.  Understanding the mind of a worm: hierarchical network structure underlying nervous system function in C. elegans. , 2008, Progress in brain research.