Statistical clustering of temporal networks through a dynamic stochastic block model

Statistical node clustering in discrete time dynamic networks is an emerging field that raises many challenges. Here, we explore statistical properties and frequentist inference in a model that combines a stochastic block model (SBM) for its static part with independent Markov chains for the evolution of the nodes groups through time. We model binary data as well as weighted dynamic random graphs (with discrete or continuous edges values). Our approach, motivated by the importance of controlling for label switching issues across the different time steps, focuses on detecting groups characterized by a stable within group connectivity behavior. We study identifiability of the model parameters , propose an inference procedure based on a variational expectation maximization algorithm as well as a model selection criterion to select for the number of groups. We carefully discuss our initialization strategy which plays an important role in the method and compare our procedure with existing ones on synthetic datasets. We also illustrate our approach on dynamic contact networks, one of encounters among high school students and two others on animal interactions. An implementation of the method is available as a R package called dynsbm.

[1]  Mark E. J. Newman,et al.  Stochastic blockmodels and community structure in networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  E. Xing,et al.  A state-space mixed membership blockmodel for dynamic network tomography , 2008, 0901.0135.

[3]  A. Moore,et al.  Dynamic social network analysis using latent space models , 2005, SKDD.

[4]  Alessandro Vespignani,et al.  Detecting rich-club ordering in complex networks , 2006, physics/0602134.

[5]  Xiaolin Zheng,et al.  Dynamic Social Network Analysis Using Latent Space Model and an Integrated Clustering Algorithm , 2009, 2009 Eighth IEEE International Conference on Dependable, Autonomic and Secure Computing.

[6]  Edoardo M. Airoldi,et al.  Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[7]  Christophe Ambroise,et al.  Fast online graph clustering via Erdös-Rényi mixture , 2008, Pattern Recognit..

[8]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine-mediated learning.

[9]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[10]  Ramayya Krishnan,et al.  Persistent Community Detection in Dynamic Social Networks , 2014, PAKDD.

[11]  Xiangyu Chang,et al.  Asymptotic Normality of Maximum Likelihood and its Variational Approximation for Stochastic Blockmodels , 2012, ArXiv.

[12]  B. Lyon,et al.  Across-year social stability shapes network structure in wintering migrant sparrows. , 2014, Ecology letters.

[13]  Eric D. Kolaczyk,et al.  Statistical Analysis of Network Data , 2009 .

[14]  Eric D. Kolaczyk,et al.  Statistical Analysis of Network Data: Methods and Models , 2009 .

[15]  Alain Barrat,et al.  Contact Patterns among High School Students , 2014, PloS one.

[16]  Christophe Ambroise,et al.  Variational Bayesian inference and complexity control for stochastic block models , 2009, 0912.2873.

[17]  Edoardo M. Airoldi,et al.  A Survey of Statistical Network Models , 2009, Found. Trends Mach. Learn..

[18]  Christophe Ambroise,et al.  Strategies for online inference of model-based clustering in large and growing networks , 2009, 0910.2034.

[19]  Yihong Gong,et al.  Detecting communities and their evolutions in dynamic social networks—a Bayesian approach , 2011, Machine Learning.

[20]  William J. Byrne,et al.  Convergence Theorems for Generalized Alternating Minimization Procedures , 2005, J. Mach. Learn. Res..

[21]  Elchanan Mossel,et al.  Reconstruction and estimation in the planted partition model , 2012, Probability Theory and Related Fields.

[22]  Charles Bouveyron,et al.  The random subgraph model for the analysis of an ecclesiastical network in Merovingian Gaul , 2012, 1212.5497.

[23]  H. Teicher Identifiability of Mixtures of Product Measures , 1967 .

[24]  L. Hubert,et al.  Comparing partitions , 1985 .

[25]  Naonori Ueda,et al.  Dynamic Infinite Relational Model for Time-varying Relational Data Analysis , 2010, NIPS.

[26]  Peter D. Hoff,et al.  Latent Space Approaches to Social Network Analysis , 2002 .

[27]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[28]  Petter Holme,et al.  Modern temporal network theory: a colloquium , 2015, The European Physical Journal B.

[29]  Charles Bouveyron,et al.  The dynamic random subgraph model for the clustering of evolving networks , 2016, Computational Statistics.

[30]  Simon Haykin,et al.  Selected topics in signal processing , 1989 .

[31]  Morten Mørup,et al.  Modeling Temporal Evolution and Multiscale Structure in Networks , 2013, ICML.

[32]  B. Leroux Maximum-likelihood estimation for hidden Markov models , 1992 .

[33]  C. Matias,et al.  New consistent and asymptotically normal parameter estimates for random‐graph mixture models , 2012 .

[34]  Franck Picard,et al.  A mixture model for random graphs , 2008, Stat. Comput..

[35]  Zoubin Ghahramani,et al.  Dynamic Probabilistic Models for Latent Feature Propagation in Social Networks , 2013, ICML.

[36]  C. Matias,et al.  Parameter identifiability in a class of random graph mixture models , 2010, 1006.0826.

[37]  Catherine Matias,et al.  MODELING HETEROGENEITY IN RANDOM GRAPHS THROUGH LATENT SPACE MODELS: A SELECTIVE REVIEW , 2014 .

[38]  C. Matias,et al.  Identifiability of parameters in latent structure models with many observed variables , 2008, 0809.5032.

[39]  Gérard Govaert,et al.  Assessing a Mixture Model for Clustering with the Integrated Completed Likelihood , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Ilya R. Fischhoff,et al.  Similar but Different: Dynamic Social Network Analysis Highlights Fundamental Differences between the Fission-Fusion Societies of Two Equid Species, the Onager and Grevy’s Zebra , 2015, PloS one.

[41]  Alfred O. Hero,et al.  Dynamic Stochastic Blockmodels for Time-Evolving Social Networks , 2014, IEEE Journal of Selected Topics in Signal Processing.

[42]  T. Snijders Statistical Models for Social Networks , 2011 .

[43]  Alain Celisse,et al.  Consistency of maximum-likelihood and variational estimators in the Stochastic Block Model , 2011, 1105.3288.