Accelerating Link Community Detection in Social Networks

Online social networks such as Facebook have become the most popular sites on the Internet. These networks usually contain millions or even billions of registered users and the users can interact with each other, which results in highly connected communities of friends, families or occupations. The communities in social networks are usually overlapped or even nested, but most of overlapping detection methods have trouble scaling to large networks. In this paper, we propose a fast link clustering (FLC) algorithm to discover link communities. By studying power-law degree distribution of online social networks, we propose working with a reduced graph that has fewer nodes and links but nonetheless captures key community structure. Experiments demonstrate efficiency and accuracy on different real networks ranging from small-scale traditional benchmarks to large-scale ground-truth social networks. FLC can accurately discover network communities as well as the overlaps between communities, and meanwhile it can scale to online social networks with millions of nodes.

[1]  J. Kumpula,et al.  Sequential algorithm for fast clique percolation. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  Yong-Yeol Ahn,et al.  Communities and Hierarchical Organization of Links in Complex Networks , 2009 .

[3]  Chengcui Zhang,et al.  A fast parallel modularity optimization algorithm (FPMQA) for community detection in online social network , 2013, Knowl. Based Syst..

[4]  R. Lambiotte,et al.  Line graphs, link partitions, and overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  Jure Leskovec,et al.  Structure and Overlaps of Communities in Networks , 2012, KDD 2012.

[6]  Donald E. Knuth,et al.  The Stanford GraphBase - a platform for combinatorial computing , 1993 .

[7]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[8]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[9]  E A Leicht,et al.  Mixture models and exploratory analysis in networks , 2006, Proceedings of the National Academy of Sciences.

[10]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[12]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[13]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[14]  Steve Gregory,et al.  Finding overlapping communities in networks by label propagation , 2009, ArXiv.

[15]  T. Vicsek,et al.  Weighted network modules , 2007, cond-mat/0703706.

[16]  Wei Ren,et al.  Simple probabilistic algorithm for detecting community structure. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[17]  Alex Pothen,et al.  PARTITIONING SPARSE MATRICES WITH EIGENVECTORS OF GRAPHS* , 1990 .

[18]  Jure Leskovec,et al.  Defining and evaluating network communities based on ground-truth , 2012, Knowledge and Information Systems.

[19]  Krishna P. Gummadi,et al.  Measurement and analysis of online social networks , 2007, IMC '07.