Overlapping Community and Node Discovery Algorithm Based on Edge Similarity

Most of community detection algorithms are designed from the perspective of nodes, which usually neglect the overlapping structure in networks. Whereas some of them hold the weakness of high complexity, inaccuracy and low stability. To solve the above issues, an overlapping community and node discovery algorithm based on edge similarity is proposed. In this paper, is established according to the incidence matrix. Then the algorithm is proceeding on line graph and finally the community detection results are restored to the original network, thus overlapping community and nodes are discovered. Several experiments are carried out on different datasets, demonstrating that the proposed algorithm is effective. Introduction Most of the existing community detection algorithms [1] in complex network [2] are designed from the perspective of nodes, in which a certain node can be divided into only one community. However, in real networks, some nodes usually belong to several communities, which are known as overlapping nodes [3] (or overlapping community [4]).For the overlapping nodes are related to many communities simultaneously, they usually act as a bridge between different communities and play an important role in networks. In the face of increasing scale of networks, further research on overlapping community and node discovery is of great significance. Mining overlapping nodes in networks is usually achieved by discovering overlapping communities [5]. Although these researches had some achievements in the past, it is still lacking. Pallaet al. [6] proposed the famous Clique Percolation Method in 2005, however, the main problem is that a parameter k is difficult to determine. Gregory put forward the CONGA [7] algorithm, where as the effect of the algorithm heavily depends on the empirical parameters. Generally, as for the current algorithm of overlapping nodes detection, there are many problems such as high complexity, low accuracy etc. To solve the above issues, this paper proposes an overlapping community and node discovery algorithm based on edge similarity. The object of study is transformed from node to edge, and then the overlapping nodes are discovered by the conversion between the line graph and the original network. Algorithm Description Line Graph Usually, the network is represented by the adjacency matrix of the node, but the adjacency matrix can only describe the relation between the nodes in the network graph. In order to express the relation between two edges, it is necessary to be mutually transformed with the incidence matrix.

[1]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[2]  Ruoxi Xiang,et al.  Complex network analysis of time series , 2014 .

[3]  R. Lambiotte,et al.  Line graphs, link partitions, and overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Stephen Roberts,et al.  Overlapping community detection using Bayesian non-negative matrix factorization. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  T. Murata,et al.  Advanced modularity-specialized label propagation algorithm for detecting communities in networks , 2009, 0910.1154.

[6]  Anne Gatchell,et al.  Link Prediction in Social Networks 1 , 2013 .

[7]  A. Barabasi,et al.  Quantifying social group evolution , 2007, Nature.

[8]  Steve Gregory,et al.  An Algorithm to Find Overlapping Community Structure in Networks , 2007, PKDD.

[9]  Adrien Todeschini,et al.  Exchangeable random measures for sparse and modular graphs with overlapping communities , 2016, Journal of the Royal Statistical Society: Series B (Statistical Methodology).

[10]  H E Stanley,et al.  Classes of small-world networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Sahil Kakkar,et al.  Discovering overlapping community structure in networks through co-clustering , 2016, 2016 International Conference on Inventive Computation Technologies (ICICT).

[12]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[13]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.