Community detection in social networks

The expansion of the web and emergence of a large number of social networking sites (SNS) have empowered users to easily interconnect on a shared platform. A social network can be represented by a graph consisting of a set of nodes and edges connecting these nodes. The nodes represent the individuals/entities, and the edges correspond to the interactions among them. The tendency of people with similar tastes, choices, and preferences to get associated in a social network leads to the formation of virtual clusters or communities. Detection of these communities can be beneficial for numerous applications such as finding a common research area in collaboration networks, finding a set of likeminded users for marketing and recommendations, and finding protein interaction networks in biological networks. A large number of community‐detection algorithms have been proposed and applied to several domains in the literature. This paper presents a survey of the existing algorithms and approaches for the detection of communities in social networks. We also discuss some of the applications of community detection. WIREs Data Mining Knowl Discov 2016, 6:115–135. doi: 10.1002/widm.1178

[1]  Bin Wu,et al.  A link clustering based overlapping community detection algorithm , 2013, Data Knowl. Eng..

[2]  Ebrahim Bagheri,et al.  Community detection in social networks , 2017, Encycl. Semantic Comput. Robotic Intell..

[3]  Abeer El-Korany,et al.  Semantic Topics Modeling Approach for Community Detection , 2013 .

[4]  Svetha Venkatesh,et al.  Hyper-community detection in the blogosphere , 2010, WSM@MM.

[5]  ZhengYou Xia,et al.  Community detection based on a semantic network , 2012, Knowl. Based Syst..

[6]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  A. Orłowski,et al.  Genetic Algorithms Approach to Community Detection , 2010 .

[8]  Derek Greene,et al.  Tracking the Evolution of Communities in Dynamic Social Networks , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[9]  Jiawei Han,et al.  A Particle-and-Density Based Evolutionary Clustering Method for Dynamic Networks , 2009, Proc. VLDB Endow..

[10]  Steve Gregory,et al.  A Fast Algorithm to Find Overlapping Communities in Networks , 2008, ECML/PKDD.

[11]  Wei Hu,et al.  Finding Statistically Significant Communities in Networks with Weighted Label Propagation , 2013 .

[12]  Michel Crampes,et al.  Survey on Social Community Detection , 2013, Social Media Retrieval.

[13]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Lisans Tezi,et al.  GRADUATE SCHOOL OF NATURAL AND APPLIED SCIENCES (MASTER THESIS) DETERMINATION OF STEROLS IN VEGETABLE OIL SAMPLES , 2006 .

[15]  Sanjukta Bhowmick,et al.  Fast Community Detection for Dynamic Complex Networks , 2011, CompleNet.

[16]  A Díaz-Guilera,et al.  Self-similar community structure in a network of human interactions. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[17]  Steve Gregory,et al.  Finding Overlapping Communities Using Disjoint Community Detection Algorithms , 2009, CompleNet.

[18]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Wei Chen,et al.  A game-theoretic framework to identify overlapping communities in social networks , 2010, Data Mining and Knowledge Discovery.

[21]  Bradley S. Rees,et al.  Overlapping Community Detection by Collective Friendship Group Inference , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[22]  M. Fiedler Algebraic connectivity of graphs , 1973 .

[23]  Boleslaw K. Szymanski,et al.  LabelRankT: incremental community detection in dynamic networks via label propagation , 2013, DyNetMM '13.

[24]  Alex Pothen,et al.  PARTITIONING SPARSE MATRICES WITH EIGENVECTORS OF GRAPHS* , 1990 .

[25]  Bernardo A. Huberman,et al.  E-Mail as Spectroscopy: Automated Discovery of Community Structure within Organizations , 2005, Inf. Soc..

[26]  YuanBo,et al.  Detecting functional modules in the yeast protein--protein interaction network , 2006 .

[27]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[28]  A. Folkesson IT and society , 2013 .

[29]  R. Lambiotte,et al.  Line graphs, link partitions, and overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.

[31]  Boleslaw K. Szymanski,et al.  Towards Linear Time Overlapping Community Detection in Social Networks , 2012, PAKDD.

[32]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[33]  Clara Pizzuti,et al.  Overlapped community detection in complex networks , 2009, GECCO.

[34]  Huan Liu,et al.  Community Detection and Mining in Social Media , 2010, Community Detection and Mining in Social Media.

[35]  Mao-Bin Hu,et al.  Detect overlapping and hierarchical community structure in networks , 2008, ArXiv.

[36]  Bin Wu,et al.  Community detection in large-scale social networks , 2007, WebKDD/SNA-KDD '07.

[37]  Dino Pedreschi,et al.  A classification for community discovery methods in complex networks , 2011, Stat. Anal. Data Min..

[38]  Mansoureh Takaffoli,et al.  Community Evolution Mining in Dynamic Social Networks , 2011 .

[39]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[40]  A. Arenas,et al.  Community analysis in social networks , 2004 .

[41]  Mason A. Porter,et al.  Communities in Networks , 2009, ArXiv.

[42]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[43]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[44]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[45]  Jian Li,et al.  Scalable community detection in massive social networks using MapReduce , 2013, IBM J. Res. Dev..

[46]  Mark E. J. Newman,et al.  Stochastic blockmodels and community structure in networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[47]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[48]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[49]  Clara Pizzuti,et al.  A Multiobjective Genetic Algorithm to Find Communities in Complex Networks , 2012, IEEE Transactions on Evolutionary Computation.

[50]  Konstantin Avrachenkov,et al.  Cooperative Game Theory Approaches for Network Partitioning , 2017, COCOON.

[51]  Yong Zhou,et al.  Overlapping Community Detection by Local Community Expansion , 2015, J. Inf. Sci. Eng..

[52]  Nagarajan Natarajan,et al.  Community detection in content-sharing social networks , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[53]  Malik Magdon-Ismail,et al.  Finding communities by clustering a graph into overlapping subgraphs , 2005, IADIS AC.

[54]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[55]  Nam P. Nguyen,et al.  Dynamic Social Community Detection and Its Applications , 2014, PloS one.

[56]  Burleigh B. Gardner,et al.  Deep South: A Social Anthropological Study of Caste and Class , 1942 .

[57]  Claudio Castellano,et al.  Community Structure in Graphs , 2007, Encyclopedia of Complexity and Systems Science.

[58]  Mark E. J. Newman,et al.  Community detection and graph partitioning , 2013, ArXiv.

[59]  Jingchun Chen,et al.  Detecting functional modules in the yeast protein-protein interaction network , 2006, Bioinform..

[60]  David D. Jensen,et al.  Graph clustering with network structure indices , 2007, ICML '07.

[61]  Jianpei Zhang,et al.  An overlapping semantic community detection algorithm base on the ARTs multiple sampling models , 2015, Expert Syst. Appl..

[62]  Inderjit S. Dhillon,et al.  Overlapping community detection using seed set expansion , 2013, CIKM.

[63]  Boleslaw K. Szymanski,et al.  Overlapping community detection in networks: The state-of-the-art and comparative study , 2011, CSUR.

[64]  Yun Chi,et al.  On evolutionary spectral clustering , 2009, TKDD.

[65]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[66]  R. Guimerà,et al.  Modularity from fluctuations in random graphs and complex networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[67]  Zhen Zhou,et al.  Community Detection Based on an Improved Modularity , 2012, CCPR.

[68]  John Sheppard,et al.  Hierarchical Fuzzy Spectral Clustering in Social Networks using Spectral Characterization , 2015, FLAIRS.

[69]  Pooya Moradian Zadeh,et al.  A Multi-Population Cultural Algorithm for Community Detection in Social Networks , 2015, ANT/SEIT.

[70]  Tanya Y. Berger-Wolf,et al.  A framework for analysis of dynamic social networks , 2006, KDD '06.

[71]  Yun Chi,et al.  Facetnet: a framework for analyzing communities and their evolutions in dynamic networks , 2008, WWW.

[72]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[73]  Illés J. Farkas,et al.  CFinder: locating cliques and overlapping modules in biological networks , 2006, Bioinform..

[74]  Jae-Gil Lee,et al.  Parallel community detection on large graphs with MapReduce and GraphChi , 2016, Data Knowl. Eng..

[75]  Alexander G. Nikolaev,et al.  On efficient use of entropy centrality for social network analysis and community detection , 2015, Soc. Networks.

[76]  Santo Fortunato,et al.  Finding Statistically Significant Communities in Networks , 2010, PloS one.

[77]  Duanbing Chen,et al.  A fast algorithm for community detection in temporal network , 2015 .

[78]  Aboul Ella Hassanien,et al.  Genetic Algorithms for community detection in social networks , 2012, 2012 12th International Conference on Intelligent Systems Design and Applications (ISDA).

[79]  Moustafa Ghanem,et al.  Multidimensional community detection in Twitter , 2013, 8th International Conference for Internet Technology and Secured Transactions (ICITST-2013).

[80]  Ying Ding,et al.  Community detection: Topological vs. topical , 2011, J. Informetrics.

[81]  Daniel Dajun Zeng,et al.  Extracting evolutionary communities in community question answering , 2014, J. Assoc. Inf. Sci. Technol..

[82]  Michael R. Lyu,et al.  Incorporating Implicit Link Preference Into Overlapping Community Detection , 2015, AAAI.

[83]  Boleslaw K. Szymanski,et al.  LabelRank: A stabilized label propagation algorithm for community detection in networks , 2013, 2013 IEEE 2nd Network Science Workshop (NSW).

[84]  T. S. Evans,et al.  Clique graphs and overlapping communities , 2010, ArXiv.

[85]  Petter Holme,et al.  Subnetwork hierarchies of biochemical pathways , 2002, Bioinform..

[86]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[87]  Nam P. Nguyen,et al.  An adaptive approximation algorithm for community detection in dynamic scale-free networks , 2013, 2013 Proceedings IEEE INFOCOM.

[88]  Qiang Wang,et al.  Topic oriented community detection through social objects and link analysis in social networks , 2012, Knowl. Based Syst..

[89]  Fabien L. Gandon,et al.  SemTagP: Semantic Community Detection in Folksonomies , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[90]  Nitesh V. Chawla,et al.  Identifying and evaluating community structure in complex networks , 2010, Pattern Recognit. Lett..

[91]  Dong Zhou,et al.  Translation techniques in cross-language information retrieval , 2012, CSUR.

[92]  V. Carchiolo,et al.  Extending the definition of modularity to directed graphs with overlapping communities , 2008, 0801.1647.

[93]  Clara Pizzuti,et al.  Overlapping Community Discovery Methods: A Survey , 2014, Social Networks: Analysis and Case Studies.

[94]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[95]  S. Dongen Graph clustering by flow simulation , 2000 .

[96]  Muhammad Abulaish,et al.  OCMiner: A density-based overlapping community detection method for social networks , 2015, Intell. Data Anal..

[97]  Clara Pizzuti,et al.  GA-Net: A Genetic Algorithm for Community Detection in Social Networks , 2008, PPSN.

[98]  Tanya Y. Berger-Wolf,et al.  A framework for community identification in dynamic social networks , 2007, KDD '07.

[99]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[100]  Hamidreza Alvari,et al.  Community detection in dynamic social networks: A game-theoretic approach , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[101]  Fergal Reid,et al.  Detecting highly overlapping community structure by greedy clique expansion , 2010, KDD 2010.

[102]  Yuqing Zhai,et al.  An Improved Collaborative Filtering Recommendation Algorithm Based on Community Detection in Social Networks , 2015, GECCO.

[103]  Haluk Bingol,et al.  Community Detection in Complex Networks Using Genetic Algorithms , 2006, 0711.0491.

[104]  Hamidreza Alvari,et al.  Detecting Overlapping Communities in Social Networks by Game Theory and Structural Equivalence Concept , 2011, AICI.

[105]  Jure Leskovec,et al.  {SNAP Datasets}: {Stanford} Large Network Dataset Collection , 2014 .

[106]  Jun Yu,et al.  Adaptive clustering algorithm for community detection in complex networks. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[107]  S. Borgatti,et al.  Analyzing Clique Overlap , 2009 .

[108]  Xin Liu,et al.  Effective Algorithm for Detecting Community Structure in Complex Networks Based on GA and Clustering , 2007, International Conference on Computational Science.

[109]  Shie Mannor,et al.  Overlapping Community Detection by Online Cluster Aggregation , 2015, ArXiv.

[110]  Francesco Folino,et al.  An Evolutionary Multiobjective Approach for Community Discovery in Dynamic Networks , 2014, IEEE Transactions on Knowledge and Data Engineering.

[111]  Jiawei Han,et al.  CHRONICLE: A Two-Stage Density-Based Clustering Algorithm for Dynamic Networks , 2009, Discovery Science.

[112]  M. Newman Analysis of weighted networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[113]  Xin Yu,et al.  A semantic overlapping community detection algorithm based on field sampling , 2015, Expert Syst. Appl..

[114]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[115]  Esra Betul Koc Ozturk,et al.  Smarandache Curves according to Curves on a Spacelike Surface in Minkowski 3-Space , 2014 .

[116]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[117]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[118]  Steve Gregory,et al.  Finding overlapping communities in networks by label propagation , 2009, ArXiv.

[119]  J. Pinney,et al.  Betweenness-based decomposition methods for social and biological networks , 2006 .

[120]  T.S.Evans,et al.  Line graphs of weighted networks for overlapping communities , 2009, 0912.4389.

[121]  Huaiyu Wan,et al.  Balanced Multi-Label Propagation for Overlapping Community Detection in Social Networks , 2012, Journal of Computer Science and Technology.

[122]  A. Barabasi,et al.  Quantifying social group evolution , 2007, Nature.

[123]  R. Luce,et al.  A method of matrix analysis of group structure , 1949, Psychometrika.

[124]  Steve Gregory,et al.  An Algorithm to Find Overlapping Community Structure in Networks , 2007, PKDD.