A semantic overlapping community detection algorithm based on field sampling

The LFT could resolve the issue of presetting the number of communities.The link-block cluster method could detect the semantic overlapping communities.The SLW can evaluate the weight of link with multidimensional semantic coordinate.The proposed semantic modularity to evaluate the detected semantic communities. The traditional semantic social network (SSN) community detection algorithms need to preset the number of the communities and could not detect the overlapping communities. To solve the issue of presetting the number of communities, we present a clustering algorithm for community detection based on the link-field-topic (LFT) model suggested. For the process of clustering is independent of context sampling, the number of communities is not necessary to be preset. To solve the issue of overlapping community detection, we establish the semantic link weight (SLW) depending on the analysis of LFT, to evaluate the semantic weight of links for each sampling field. The proposed clustering algorithm is based on the SLW which could separate the SSN into clustering units. As a result, the intersection on several units is the overlapping part. Finally, we establish semantic modularity (SQ) involving SQ1 and SQ2 for the evaluation of the detected semantic communities. The efficiency and feasibility of the LFT model and the semantic modularity is verified by experimental analysis.

[1]  Thomas L. Griffiths,et al.  Learning Systems of Concepts with an Infinite Relational Model , 2006, AAAI.

[2]  Thomas L. Griffiths,et al.  Probabilistic author-topic models for information discovery , 2004, KDD.

[3]  L. Venkata Subramaniam,et al.  Using content and interactions for discovering communities in social networks , 2012, WWW.

[4]  Mao-Bin Hu,et al.  Detect overlapping and hierarchical community structure in networks , 2008, ArXiv.

[5]  L. Venkata Subramaniam,et al.  Probabilistic model for discovering topic based communities in social networks , 2011, CIKM '11.

[6]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[7]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Andrew McCallum,et al.  Group and topic discovery from relations and text , 2005, LinkKDD '05.

[9]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[10]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Junghoo Cho,et al.  Social-network analysis using topic models , 2012, SIGIR '12.

[12]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Dayou Liu,et al.  A Markov random walk under constraint for discovering overlapping communities in complex networks , 2011, ArXiv.

[14]  Christos Faloutsos,et al.  HCDF: A Hybrid Community Discovery Framework , 2010, SDM.

[15]  Deng Cai,et al.  Topic modeling with network regularization , 2008, WWW.

[16]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[17]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Hongyuan Zha,et al.  Probabilistic models for discovering e-communities , 2006, WWW '06.

[19]  Zoubin Ghahramani,et al.  Semi-supervised learning : from Gaussian fields to Gaussian processes , 2003 .

[20]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks , 2005, IJCAI.

[21]  John Yen,et al.  An LDA-based Community Structure Discovery Approach for Large-Scale Social Networks , 2007, 2007 IEEE Intelligence and Security Informatics.

[22]  Steve Gregory,et al.  Finding overlapping communities in networks by label propagation , 2009, ArXiv.

[23]  Jiawei Han,et al.  Latent Community Topic Analysis: Integration of Community Discovery with Topic Modeling , 2012, TIST.

[24]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[25]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email , 2007, J. Artif. Intell. Res..

[26]  John Yen,et al.  Probabilistic Community Discovery Using Hierarchical Latent Gaussian Mixture Model , 2007, AAAI.

[27]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[28]  John Yen,et al.  HSN-PAM: Finding Hierarchical Probabilistic Groups from Large-Scale Networks , 2007 .

[29]  A. Banerjee,et al.  Social Topic Models for Community Extraction , 2008 .

[30]  Tina Eliassi-Rad,et al.  Applying latent dirichlet allocation to group discovery in large graphs , 2009, SAC '09.