Active semisupervised community detection based on asymmetric similarity measure

Semisupervised community detection algorithms use prior knowledge to improve the performance of discovering the community structure of a complex network. However, getting those prior knowledge is quite expensive and time consuming in many real-world applications. This paper proposes an active semisupervised community detection algorithm based on the similarities between nodes. First, it transforms a given complex network into a weighted directed network based on the proposed asymmetric similarity method, some informative nodes are selected to be the labeled nodes by using an active mechanism. Second, the proposed algorithm discovers the community structure of a complex network by propagating the community labels of labeled nodes to their neighbors based on the similarity between a node and a community. Finally, the performance of the proposed algorithm is evaluated with three real networks and one synthetic network and the experimental results show that the proposed method has a better performance compared with some other community detection algorithms.

[1]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[2]  Nitesh V. Chawla,et al.  Identifying and evaluating community structure in complex networks , 2010, Pattern Recognit. Lett..

[3]  Ulrik Brandes,et al.  On variants of shortest-path betweenness centrality and their generic computation , 2008, Soc. Networks.

[4]  Zhong-Yuan Zhang,et al.  Enhanced Community Structure Detection in Complex Networks with Partial Background Information , 2012, Scientific Reports.

[5]  Rebecca Hwa,et al.  Sample Selection for Statistical Parsing , 2004, CL.

[6]  Ulrik Brandes,et al.  On Modularity Clustering , 2008, IEEE Transactions on Knowledge and Data Engineering.

[7]  M. Barber,et al.  Detecting network communities by propagating labels under constraints. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  T. Murata,et al.  Advanced modularity-specialized label propagation algorithm for detecting communities in networks , 2009, 0910.1154.

[9]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[10]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Benjamin H. Good,et al.  Performance of modularity maximization in practical contexts. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[15]  Liang Zhao,et al.  Semi-supervised learning guided by the modularity measure in complex networks , 2012, Neurocomputing.

[16]  Xiaoke Ma,et al.  Semi-supervised clustering algorithm for community structure detection in complex networks , 2010 .

[17]  Stefan Wrobel,et al.  Active Hidden Markov Models for Information Extraction , 2001, IDA.

[18]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[19]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  William A. Gale,et al.  A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[21]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Yiannis Kompatsiaris,et al.  Community detection in Social Media , 2012, Data Mining and Knowledge Discovery.

[23]  Pietro Liò,et al.  Towards real-time community detection in large networks. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.