Fast Computation of Modularity in Agglomerative Clustering Methods for Community Discovery

In this paper, we propose a fast method for successive modularity computations in agglomerative clustering for community discovery. Our method is based on a new data structure to maintain two statistical values derived by the definition of modularity between two adjacent (temporary) communities. We prove the correctness and efficiency of the method. Then, we perform it on standard datasets. The experimental results show that our method improves the efficiency significantly.

[1]  Neo D. Martinez,et al.  Food-web structure and network theory: The role of connectance and size , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Guojun Gan,et al.  Data Clustering: Theory, Algorithms, and Applications (ASA-SIAM Series on Statistics and Applied Probability) , 2007 .

[3]  Pablo M. Gleiser,et al.  Community Structure in Jazz , 2003, Adv. Complex Syst..

[4]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.

[5]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[6]  P. Jaccard,et al.  Etude comparative de la distribution florale dans une portion des Alpes et des Jura , 1901 .

[7]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[8]  Shujing Li,et al.  A Similarity Measure Method of Nodes for Community Detection , 2011 .

[9]  D J PRICE,et al.  NETWORKS OF SCIENTIFIC PAPERS. , 1965, Science.

[10]  S. Strogatz Exploring complex networks , 2001, Nature.

[11]  Roger Guimerà,et al.  Correction for Sales-Pardo et al., Extracting the hierarchical organization of complex systems , 2007, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Jianhong Wu,et al.  Data clustering - theory, algorithms, and applications , 2007 .

[13]  John Scott Social Network Analysis , 1988 .

[14]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[15]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  Alex Pothen,et al.  PARTITIONING SPARSE MATRICES WITH EIGENVECTORS OF GRAPHS* , 1990 .

[17]  Hosein Alizadeh,et al.  A New Method for Improving the Performance of K Nearest Neighbor using Clustering Technique , 2009, J. Convergence Inf. Technol..

[18]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.