Locating Communities on Real Dataset Graphs Using Synthetic Coordinates

One of the fundamental problems in social networking with a lot of potential applications is to detect effectively the communities that are created by the users' interaction. Other applications, such as finding web communities, uncovering the structure of social networks, or even analyzing a graph's structure to uncover Internet attacks are equally as important. All these problems converge to a common goal, the development of flexible and efficient local community detection algorithms. In this paper, we demonstrate the performance of an algorithm that uncovers the entire community structure of a network, based solely on local interactions between neighboring nodes and a distributed clustering algorithm. The proposed algorithm, named VCD, is based on the distributed computation of a synthetic coordinates for each graph node. Experimental results and comparisons with another method from the literature (Lancichinetti et al.) are presented. The algorithm is also tested on two real dataset graphs from the SNAP: Stanford Large Network Dataset Collection. In all cases the experimental results demonstrate the high performance of our algorithm in terms of accuracy to detect communities, and its computational efficiency.

[1]  Davide Modolo,et al.  Explorative visualization and analysis of a social network for arts: the case of deviantART , 2011 .

[2]  Robert Tappan Morris,et al.  Vivaldi: a decentralized network coordinate system , 2004, SIGCOMM '04.

[3]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[4]  Shiuh-Jeng Wang,et al.  Hierarchical key derivation scheme for group-oriented communication systems , 2010, Int. J. Inf. Technol. Commun. Convergence.

[5]  Paraskevi Fragopoulou,et al.  Distributed community detection: Finding neighborhoods in a complex world using synthetic coordinates , 2011, 2011 IEEE Symposium on Computers and Communications (ISCC).

[6]  Yannis Manolopoulos,et al.  CDNs Content Outsourcing via Generalized Communities , 2009, IEEE Transactions on Knowledge and Data Engineering.

[7]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[9]  C. Lee Giles,et al.  Self-Organization and Identification of Web Communities , 2002, Computer.

[10]  Satu Elisa Schaeffer,et al.  Graph Clustering , 2017, Encyclopedia of Machine Learning and Data Mining.

[11]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Charu C. Aggarwal,et al.  Graph Clustering , 2010, Encyclopedia of Machine Learning and Data Mining.