Community Detection on Evolving Graphs

Clustering is a fundamental step in many information-retrieval and data-mining applications. Detecting clusters in graphs is also a key tool for finding the community structure in social and behavioral networks. In many of these applications, the input graph evolves over time in a continual and decentralized manner, and, to maintain a good clustering, the clustering algorithm needs to repeatedly probe the graph. Furthermore, there are often limitations on the frequency of such probes, either imposed explicitly by the online platform (e.g., in the case of crawling proprietary social networks like twitter) or implicitly because of resource limitations (e.g., in the case of crawling the web). In this paper, we study a model of clustering on evolving graphs that captures this aspect of the problem. Our model is based on the classical stochastic block model, which has been used to assess rigorously the quality of various static clustering methods. In our model, the algorithm is supposed to reconstruct the planted clustering, given the ability to query for small pieces of local information about the graph, at a limited rate. We design and analyze clustering algorithms that work in this model, and show asymptotically tight upper and lower bounds on their accuracy. Finally, we perform simulations, which demonstrate that our main asymptotic results hold true also in practice.

[1]  Adel Javanmard,et al.  Phase transitions in semidefinite relaxations , 2015, Proceedings of the National Academy of Sciences.

[2]  Kevin S. Xu Stochastic Block Transition Models for Dynamic Networks , 2014, AISTATS.

[3]  Emmanuel Abbe,et al.  Community Detection in General Stochastic Block models: Fundamental Limits and Efficient Algorithms for Recovery , 2015, 2015 IEEE 56th Annual Symposium on Foundations of Computer Science.

[4]  Elchanan Mossel,et al.  Consistency thresholds for the planted bisection model , 2016 .

[5]  Eli Upfal,et al.  PageRank on an evolving graph , 2012, KDD.

[6]  Alfred O. Hero,et al.  Dynamic Stochastic Blockmodels: Statistical Models for Time-Evolving Networks , 2013, SBP.

[7]  Ravi Kumar,et al.  Structure and evolution of online social networks , 2006, KDD '06.

[8]  Charles Bouveyron,et al.  The dynamic random subgraph model for the clustering of evolving networks , 2016, Computational Statistics.

[9]  Emmanuel Abbe,et al.  Exact Recovery in the Stochastic Block Model , 2014, IEEE Transactions on Information Theory.

[10]  Dorothea Wagner,et al.  Clustering Evolving Networks , 2014, Algorithm Engineering.

[11]  Edoardo M. Airoldi,et al.  Consistent estimation of dynamic and multi-layer block models , 2014, ICML.

[12]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[13]  Edoardo M. Airoldi,et al.  Stochastic blockmodels with growing number of classes , 2010, Biometrika.

[14]  Laurent Massoulié,et al.  Community detection thresholds and the weak Ramanujan property , 2013, STOC.

[15]  Eli Upfal,et al.  Algorithms on evolving graphs , 2012, ITCS '12.

[16]  Amin Coja-Oghlan,et al.  Graph Partitioning via Adaptive Spectral Techniques , 2009, Combinatorics, Probability and Computing.

[17]  Philip M. Long,et al.  Finding Planted Partitions in Nearly Linear Time using Arrested Spectral Clustering , 2010, ICML.

[18]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[19]  Bin Yu,et al.  Spectral clustering and the high-dimensional stochastic blockmodel , 2010, 1007.1684.