Exorcising the Demon: Angel, Efficient Node-Centric Community Discovery

Community discovery is one of the most challenging tasks in social network analysis. During the last decades, several algorithms have been proposed with the aim of identifying communities in complex networks, each one searching for mesoscale topologies having different and peculiar characteristics. Among such vast literature, an interesting family of Community Discovery algorithms, designed for the analysis of social network data, is represented by overlapping, node-centric approaches. In this work, following such line of research, we propose Angel, an algorithm that aims to lower the computational complexity of previous solutions while ensuring the identification of high-quality overlapping partitions. We compare Angel, both on synthetic and real-world datasets, against state of the art community discovery algorithms designed for the same community definition. Our experiments underline the effectiveness and efficiency of the proposed methodology, confirmed by its ability to constantly outperform the identified competitors.

[1]  Boleslaw K. Szymanski,et al.  Overlapping community detection in networks: The state-of-the-art and comparative study , 2011, CSUR.

[2]  Tomas Olovsson,et al.  A local seed selection algorithm for overlapping community detection , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[3]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Santo Fortunato,et al.  Community detection in networks: Structural communities versus ground truth , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  Dino Pedreschi,et al.  Node-centric Community Discovery: From static to dynamic social network analysis , 2017, Online Soc. Networks Media.

[6]  John E. Hopcroft,et al.  Use of Local Group Information to Identify Communities in Networks , 2015, ACM Trans. Knowl. Discov. Data.

[7]  Dino Pedreschi,et al.  DEMON: a local-first discovery method for overlapping communities , 2012, KDD.

[8]  Jianjun Li A two-step rejection procedure for testing multiple hypotheses , 2008 .

[9]  Dino Pedreschi,et al.  A classification for community discovery methods in complex networks , 2011, Stat. Anal. Data Min..

[10]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Giulio Rossetti,et al.  CDLIB: a python library to extract, compare and evaluate communities from complex networks , 2019, Applied Network Science.

[12]  Giulio Rossetti,et al.  A Novel Approach to Evaluate Community Detection Algorithms on Ground Truth , 2016, CompleNet.

[13]  Dino Pedreschi,et al.  Community-centric analysis of user engagement in Skype social network , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[14]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[15]  R. Karp,et al.  Algorithms for graph partitioning on the planted partition model , 2001 .

[16]  Boleslaw K. Szymanski,et al.  Towards Linear Time Overlapping Community Detection in Social Networks , 2012, PAKDD.

[17]  M. Friedman The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance , 1937 .

[18]  Michalis Vazirgiannis,et al.  Clustering and Community Detection in Directed Networks: A Survey , 2013, ArXiv.

[19]  Silvio Lattanzi,et al.  Ego-Splitting Framework: from Non-Overlapping to Overlapping Clusters , 2017, KDD.

[20]  Giulio Rossetti,et al.  Community Discovery in Dynamic Networks , 2017, ACM Comput. Surv..