An Efficient and Effective Overlapping Communities Discovery Based on Agglomerative Graph

Community discovery is a popular way to solve the personal service recommendation problem and has recently attracted more and more attentions of the researchers. The communities are often practically overlapping with each other, thus more and more research focus on the problem of overlapping communities detection. A common drawback of the existing algorithms to this problem is the low efficiency when dealing the large scale network. In this paper, we propose a graph compression based overlapping communities discovery algorithm, which greatly enhances the power of handling large networks even using a single computer. First, a graph compression based social network model, namely agglomerative graph, is introduced, which is a lossless compression to the original network. Then, inspired by the idea of iteration based on the selected seeds, the algorithm expands the selected seeds to the communities by optimizing the proposed community fitness function iteratively. Finally, it merges the communities of high similarity with each other to get the final results. Since the network is lossless compressed, and massive redundant computations are avoided, the results can be exactly obtained in an efficient and effective way. The experiments based on both real and synthetic datasets demonstrate efficiency and effectiveness of the proposal method in detecting overlapping communities over large scale networks.

[1]  Tam'as Vicsek,et al.  Modularity measure of networks with overlapping communities , 2009, 0910.5072.

[2]  Lei Li,et al.  Social context-aware trust inference for trust enhancement in social network based recommendations on service providers , 2013, World Wide Web.

[3]  Zibin Zheng,et al.  Mashup Service Recommendation Based on User Interest and Social Network , 2013, 2013 IEEE 20th International Conference on Web Services.

[4]  Charu C. Aggarwal,et al.  Social Network Data Analytics , 2011 .

[5]  Fergal Reid,et al.  Detecting highly overlapping community structure by greedy clique expansion , 2010, KDD 2010.

[6]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[7]  Kyomin Jung,et al.  LinkSCAN*: Overlapping community detection using the link-space transformation , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[8]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[9]  R. Lambiotte,et al.  Line graphs, link partitions, and overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Mao-Bin Hu,et al.  Detect overlapping and hierarchical community structure in networks , 2008, ArXiv.

[11]  Zibin Zheng,et al.  Trace Norm Regularized Matrix Factorization for Service Recommendation , 2013, 2013 IEEE 20th International Conference on Web Services.

[12]  Liang Chen,et al.  Instant Recommendation for Web Services Composition , 2014, IEEE Transactions on Services Computing.

[13]  Haixun Wang,et al.  Online search of overlapping communities , 2013, SIGMOD '13.