Distributed community detection: Finding neighborhoods in a complex world using synthetic coordinates

In this paper, we propose an algorithm that finds the entire community structure of a network, based on local interactions between neighboring nodes and on an unsupervised centralized clustering algorithm. The novelty of the proposed approach is the fact that the algorithm is based on the use of network coordinates computed by a distributed algorithm. The current paper not only presents an efficient distributed community finding algorithm, but also demonstrates that network coordinates could be used to derive efficient solutions to a variety of problems. Experimental results and comparisons with other methods from literature are presented for a variety of benchmark graphs with known community structure, derived by varying a number of graph parameters.

[1]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[2]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Yannis Manolopoulos,et al.  CDNs Content Outsourcing via Generalized Communities , 2009, IEEE Transactions on Knowledge and Data Engineering.

[4]  Robert Tappan Morris,et al.  Vivaldi: a decentralized network coordinate system , 2004, SIGCOMM '04.

[5]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[6]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[7]  Yiannis Kompatsiaris,et al.  Bridge Bounding: A Local Approach for Efficient Community Discovery in Complex Networks , 2009, 0902.0871.

[8]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[9]  Charu C. Aggarwal,et al.  Graph Clustering , 2010, Encyclopedia of Machine Learning and Data Mining.

[10]  Xiuzhen Huang,et al.  K-Means Clustering Algorithms: Implementation and Comparison , 2007 .

[11]  C. Lee Giles,et al.  Self-Organization and Identification of Web Communities , 2002, Computer.

[12]  Satu Elisa Schaeffer,et al.  Graph Clustering , 2017, Encyclopedia of Machine Learning and Data Mining.

[13]  Li Wei,et al.  Network Traffic Classification Using K-means Clustering , 2007 .

[14]  T. Vicsek,et al.  Clique percolation in random networks. , 2005, Physical review letters.

[15]  Hillol Kargupta,et al.  Approximate Distributed K-Means Clustering over a Peer-to-Peer Network , 2009, IEEE Transactions on Knowledge and Data Engineering.

[16]  Hillol Kargupta,et al.  K-Means Clustering Over a Large, Dynamic Network , 2006, SDM.

[17]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.