A framework of community detection based on individual labels in attribute networks

Abstract Community detection is an important problem for understanding the structure and function of complex networks and has attracted a lot of attention in recent decades. Most community detection algorithms only focus on the topology of networks. However, there is still much valuable information hidden in the networks, such as the attributes or content of the nodes and the useful prior information. Obviously, taking full advantage of these resources can improve the effectiveness of community detection. In this paper, we present a semi-supervised community detection framework named SCDAN (Semi-supervised Community Detection in Attribute Networks), in which a non-negative matrix factorization model is utilized to effectively integrate network topology, node attributes and individual labels simultaneously. The comparative experiments on real-world networks show that SCDAN significantly improves the performance of community detection and provides semantic interpretation of communities.

[1]  Xingyuan Wang,et al.  Model for multi-messages spreading over complex networks considering the relationship between messages , 2017, Commun. Nonlinear Sci. Numer. Simul..

[2]  Koh Takeuchi,et al.  Non-Negative Multiple Matrix Factorization , 2013, IJCAI.

[3]  Siddhartha Bhattacharyya,et al.  Large-Scale Network Analysis for Online Social Brand Advertising , 2016, MIS Q..

[4]  John Scott What is social network analysis , 2010 .

[5]  Feiping Nie,et al.  Clustering and projected clustering with adaptive neighbors , 2014, KDD.

[6]  Xiaoke Ma,et al.  Semi-supervised clustering algorithm for community structure detection in complex networks , 2010 .

[7]  Xingyuan Wang,et al.  Uncovering overlapping community structures by the key bi-community and intimate degree in bipartite networks , 2014 .

[8]  James Demmel,et al.  Parallel numerical linear algebra , 1993, Acta Numerica.

[9]  Zhong-Yuan Zhang,et al.  Enhanced Community Structure Detection in Complex Networks with Partial Background Information , 2012, Scientific Reports.

[10]  Yun Chi,et al.  Combining link and content for community detection: a discriminative approach , 2009, KDD.

[11]  Xingyuan Wang,et al.  Approximating web communities using subspace decomposition , 2014, Knowl. Based Syst..

[12]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Fei Wang,et al.  Community discovery using nonnegative matrix factorization , 2011, Data Mining and Knowledge Discovery.

[14]  Amos Maritan,et al.  Size and form in efficient transportation networks , 1999, Nature.

[15]  Dong Liu,et al.  Semi-supervised community detection based on discrete potential theory , 2014 .

[16]  Armen E. Allahverdyan,et al.  Community detection with and without prior information , 2009, ArXiv.

[17]  Xing-yuan Wang,et al.  Overlapping community detection using neighborhood ratio matrix , 2015 .

[18]  Xing-yuan Wang,et al.  Detecting one-mode communities in bipartite networks by bipartite clustering triangular , 2016 .

[19]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[20]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[21]  Charu C. Aggarwal,et al.  Community Detection with Edge Content in Social Media Networks , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[22]  M. Feinberg,et al.  Structural Sources of Robustness in Biochemical Reaction Networks , 2010, Science.

[23]  Xingyuan Wang,et al.  Epidemic spreading in time-varying community networks , 2014, Chaos.

[24]  Xin Liu,et al.  Document clustering based on non-negative matrix factorization , 2003, SIGIR.

[25]  Han Zhao,et al.  Identifying influential nodes in complex networks with community structure , 2013, Knowl. Based Syst..

[26]  Xingyuan Wang,et al.  A preferential attachment strategy for connectivity link addition strategy in improving the robustness of interdependent networks , 2017 .

[27]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Dong Liu,et al.  Semi-supervised community detection using label propagation , 2014 .

[29]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[30]  Xing-yuan Wang,et al.  Uncovering the overlapping community structure of complex networks by maximal cliques , 2014 .

[31]  Xiaochun Cao,et al.  Improving the Efficiency and Effectiveness of Community Detection via Prior-Induced Equivalent Super-Network , 2017, Scientific Reports.

[32]  Xing-yuan Wang,et al.  Detecting community structure via the maximal sub-graphs and belonging degrees in complex networks , 2014 .

[33]  Xingyuan Wang,et al.  Detecting overlapping communities by seed community in weighted complex networks , 2013 .

[34]  Tina Eliassi-Rad,et al.  Correcting evaluation bias of relational classifiers with network cross validation , 2010, Knowledge and Information Systems.

[35]  Renato D. C. Monteiro,et al.  Group Sparsity in Nonnegative Matrix Factorization , 2012, SDM.

[36]  Xing-yuan Wang,et al.  Detecting communities by the core-vertex and intimate degree in complex networks , 2013 .

[37]  Xing-yuan Wang,et al.  Detecting overlapping communities in networks using the maximal sub-graph and the clustering coefficient , 2014 .

[38]  Mark E. J. Newman,et al.  Stochastic blockmodels and community structure in networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[39]  Xiaochun Cao,et al.  Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model , 2017, PloS one.

[40]  Xing-yuan Wang,et al.  Asymmetric intimacy and algorithm for detecting communities in bipartite networks , 2016 .

[41]  Xiaohui Yan,et al.  Clustering short text using Ncut-weighted non-negative matrix factorization , 2012, CIKM.

[42]  Xingyuan Wang,et al.  Community detection using local neighborhood in complex networks , 2015 .

[43]  Aram Galstyan,et al.  Statistical mechanics of semi-supervised clustering in sparse graphs , 2011, UAI.

[44]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[45]  Jing Lei,et al.  Network Cross-Validation for Determining the Number of Communities in Network Data , 2014, 1411.1715.

[46]  Xingyuan Wang,et al.  Model of epidemic control based on quarantine and message delivery , 2016, Physica A: Statistical Mechanics and its Applications.

[47]  Dong Liu,et al.  Effective Semisupervised Community Detection Using Negative Information , 2015 .

[48]  Haesun Park,et al.  Sparse Nonnegative Matrix Factorization for Clustering , 2008 .

[49]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[50]  Neil J. Hurley,et al.  Computational Statistics and Data Analysis , 2022 .

[51]  Chao Liu,et al.  Distributed nonnegative matrix factorization for web-scale dyadic data analysis on mapreduce , 2010, WWW '10.

[52]  D. F. Saldana,et al.  How Many Communities Are There? , 2014, 1412.1684.