Distributed community detection in dynamic graphs

Inspired by the increasing interest in self-organizing social opportunistic networks, we investigate the problem of distributed detection of unknown communities in dynamic random graphs. As a formal framework, we consider the dynamic version of the well-studied Planted Bisection Model dyn- G ( n , p , q ) where the node set n ] of the network is partitioned into two unknown communities and, at every time step, each possible edge ( u , v ) is active with probability p if both nodes belong to the same community, while it is active with probability q (with q ? p ) otherwise. We also consider a time-Markovian generalization of this model.We propose a distributed protocol based on the popular Label-Propagation approach and prove that, when the ratio p / q is larger than n b (for an arbitrarily small constant b 0 ), the protocol finds the right "planted" partition in O ( log ? n ) time even when the snapshots of the dynamic graph are sparse and disconnected (i.e., when p = ? ( 1 / n ) ).

[1]  Pietro Liò,et al.  Towards real-time community detection in large networks. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  M. Barber,et al.  Detecting network communities by propagating labels under constraints. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Laurent Massoulié,et al.  The diameter of opportunistic mobile networks , 2007, CoNEXT '07.

[4]  Konstantinos Psounis,et al.  An analytical study of fundamental mobility properties for encounter-based protocols , 2008, Int. J. Auton. Adapt. Commun. Syst..

[5]  Jean-Yves Le Boudec,et al.  Power Law and Exponential Decay of Intercontact Times between Mobile Devices , 2007, IEEE Transactions on Mobile Computing.

[6]  R. Karp,et al.  Algorithms for graph partitioning on the planted partition model , 2001 .

[7]  Marcelo Dias de Amorim,et al.  Performance of Opportunistic Epidemic Routing on Edge-Markovian Dynamic Graphs , 2011, IEEE Transactions on Communications.

[8]  Pan Hui,et al.  Distributed community detection in delay tolerant networks , 2007, MobiArch '07.

[9]  Pierre Fraigniaud,et al.  Parsimonious flooding in dynamic graphs , 2009, PODC '09.

[10]  J. Kleinberg,et al.  Networks, Crowds, and Markets , 2010 .

[11]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Ravi B. Boppana,et al.  Eigenvalues and graph bisection: An average-case analysis , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[13]  Sriram V. Pemmaraju,et al.  On the Analysis of a Label Propagation Algorithm for Community Detection , 2012, ICDCN.

[14]  C. Lee Giles,et al.  Self-Organization and Identification of Web Communities , 2002, Computer.

[15]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[16]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[17]  M. Newman Spread of epidemic disease on networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  T. Murata,et al.  Advanced modularity-specialized label propagation algorithm for detecting communities in networks , 2009, 0910.1154.

[19]  Martin E. Dyer,et al.  The Solution of Some Random NP-Hard Problems in Polynomial Expected Time , 1989, J. Algorithms.

[20]  Andrea E. F. Clementi,et al.  Flooding time in edge-Markovian dynamic graphs , 2008, PODC '08.

[21]  T. Snijders,et al.  Estimation and Prediction for Stochastic Blockmodels for Graphs with Latent Block Structure , 1997 .

[22]  Frank Thomson Leighton,et al.  Graph bisection algorithms with good average case behavior , 1984, Comb..

[23]  Alex Pentland,et al.  Reality mining: sensing complex social systems , 2006, Personal and Ubiquitous Computing.

[24]  Chen Avin,et al.  How to Explore a Fast-Changing World (Cover Time of a Simple Random Walk on Evolving Graphs) , 2008, ICALP.

[25]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Christophe Diot,et al.  Impact of Human Mobility on Opportunistic Forwarding Algorithms , 2007, IEEE Transactions on Mobile Computing.

[27]  Alexandre Proutière,et al.  Hop limited flooding over dynamic networks , 2011, 2011 Proceedings IEEE INFOCOM.

[28]  M. Newman,et al.  Identifying the role that animals play in their social networks , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[29]  Gennaro Cordasco,et al.  Label propagation algorithm: a semi-synchronous approach , 2012, Int. J. Soc. Netw. Min..

[30]  Paolo Santi,et al.  A measurement-based study of beaconing performance in IEEE 802.11p vehicular networks , 2012, 2012 Proceedings IEEE INFOCOM.

[31]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[32]  Marcelo Dias de Amorim,et al.  The Accordion Phenomenon: Analysis, Characterization, and Impact on DTN Routing , 2009, IEEE INFOCOM 2009.

[33]  Mark Jerrum,et al.  The Metropolis Algorithm for Graph Bisection , 1998, Discret. Appl. Math..

[34]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  Andrea E. F. Clementi,et al.  Information Spreading in Stationary Markovian Evolving Graphs , 2009, IEEE Transactions on Parallel and Distributed Systems.

[36]  V. Latora,et al.  Complex networks: Structure and dynamics , 2006 .