Local community detection algorithm based on links and content

Community detection is an important field in research of social networks. There exist a lot of algorithms which most of them are based on the density of connections between groups of nodes. On the one hand, the error and lack of links may lead to great impact on the result of community detection. On the other hand, there are users with deep relation but without much communication, so the density of connections can't represent whether the users belong to the same community or not. With the network becoming more and more complicated, the traditional global method will cost much time and space. In this paper, we proposed a local method based on links and content, and the method focuses on particular users' communities. The results on Enron email dataset have shown the superior performance and accuracy rate of our proposed method in community detection.

[1]  Steve Gregory,et al.  Finding overlapping communities in networks by label propagation , 2009, ArXiv.

[2]  Bernhard Flury Book ReviewAlgorithms for clustering data : Anil K. Jain and Richard C. Dubes Prentice Hall Advanced Reference Series in Computer Science Prentice Hall, Englewood Cliffs, NJ, 1988 , 1989 .

[3]  Srinivasan Parthasarathy,et al.  Efficient community detection in large networks using content and links , 2012, WWW.

[4]  Kevin J. Lang,et al.  Communities from seed sets , 2006, WWW '06.

[5]  S.,et al.  An Efficient Heuristic Procedure for Partitioning Graphs , 2022 .

[6]  D. Spielman,et al.  Spectral partitioning works: planar graphs and finite element meshes , 1996, Proceedings of 37th Conference on Foundations of Computer Science.

[7]  Mason A. Porter,et al.  Communities in Networks , 2009, ArXiv.

[8]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[9]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[10]  A. Clauset Finding local community structure in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[12]  Ramesh Nallapati,et al.  Joint latent topic models for text and citations , 2008, KDD.

[13]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[14]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[15]  Eric P. Xing,et al.  Spatial compactness meets topical consistency: jointly modeling links and content for community detection , 2014, WSDM.

[16]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[17]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[18]  Yun Chi,et al.  Combining link and content for community detection: a discriminative approach , 2009, KDD.

[19]  Shahram Khadivi,et al.  Improving the quality of overlapping community detection through link addition based on topic similarity , 2015, 2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP).

[20]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[21]  E A Leicht,et al.  Community structure in directed networks. , 2007, Physical review letters.

[22]  Inderjit S. Dhillon,et al.  Overlapping community detection using seed set expansion , 2013, CIKM.

[23]  Jon M. Kleinberg,et al.  Community membership identification from small seed sets , 2014, KDD.

[24]  Rong Ge,et al.  Joint cluster analysis of attribute and relationship data withouta-priori specification of the number of clusters , 2007, KDD '07.