Community detection in social network with pairwisely constrained symmetric non-negative matrix factorization

Non-negative Matrix Factorization (NMF) aims to find two non-negative matrices whose product approximates the original matrix well, and is widely used in clustering condition with good physical interpretability and universal applicability. Detecting communities with NMF can keep non-negative network physical definition and effectively capture communities-based structure in the low dimensional data space. However some NMF methods in community detection did not concern with more network inner structures or existing ground-truth community information. In this paper, we propose a novel pairwisely constrained non-negative symmetric matrix factorization (PCSNMF) method, which not only consider symmetric community structures of undirected network, but also takes into consideration the pairwise constraints generated from some ground-truth group information to enhance the community detection. We compare our approaches with other NMF-based methods in three social networks, and experimental results for community detection show that our approaches are all feasible and achieve better community detection results.

[1]  Fei Wang,et al.  Community discovery using nonnegative matrix factorization , 2011, Data Mining and Knowledge Discovery.

[2]  CaiDeng,et al.  Constrained Nonnegative Matrix Factorization for Image Representation , 2012 .

[3]  Mingwei Leng,et al.  Active Semi-supervised Community Detection Algorithm with Label Propagation , 2013, DASFAA.

[4]  Thomas S. Huang,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation. , 2011, IEEE transactions on pattern analysis and machine intelligence.

[5]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Chris H. Q. Ding,et al.  Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization , 2008, SIGIR '08.

[7]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Christine Nardini,et al.  LEARNING OVERLAPPING COMMUNITIES IN COMPLEX NETWORKS VIA NON-NEGATIVE MATRIX FACTORIZATION , 2011 .

[9]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[10]  Yixin Cao,et al.  Identifying overlapping communities as well as hubs and outliers via nonnegative matrix factorization , 2013, Scientific Reports.

[11]  Rong Jin,et al.  Semi-supervised Learning with Weakly-Related Unlabeled Data: Towards Better Text Categorization , 2008, NIPS.

[12]  Bao-Gang Hu,et al.  Pairwise Constraints-Guided Non-negative Matrix Factorization for Document Clustering , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[13]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Zhong-Yuan Zhang,et al.  Enhanced Community Structure Detection in Complex Networks with Partial Background Information , 2012, Scientific Reports.

[15]  Xiaoke Ma,et al.  Semi-supervised clustering algorithm for community structure detection in complex networks , 2010 .

[16]  Edoardo M. Airoldi,et al.  Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[17]  Fan Chung Graham,et al.  Local Graph Partitioning using PageRank Vectors , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[18]  Pierre Hansen,et al.  Improving heuristics for network modularity maximization using an exact algorithm , 2011, Discret. Appl. Math..

[19]  Zhong-Yuan Zhang,et al.  Enhanced Community Structure Detection in Complex Networks with Partial Background Information , 2013, Scientific reports.

[20]  Jure Leskovec,et al.  Defining and evaluating network communities based on ground-truth , 2012, Knowledge and Information Systems.

[21]  Xin Liu,et al.  Document clustering based on non-negative matrix factorization , 2003, SIGIR.

[22]  Yihong Gong,et al.  Document clustering by concept factorization , 2004, SIGIR '04.

[23]  Chris H. Q. Ding,et al.  Symmetric Nonnegative Matrix Factorization for Graph Clustering , 2012, SDM.

[24]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[25]  A Díaz-Guilera,et al.  Self-similar community structure in a network of human interactions. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[27]  Weiyi Meng,et al.  Improving Performance of Web Services Query Matchmaking with Automated Knowledge Acquisition , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).