Spectral clustering-based network community detection with node attributes

Identifying communities is an important problem in network analysis. Various approaches have been proposed in the literature, but most of them either rely on the topological structure of the network or the node attributes, with few integrating both aspects. Here we propose a community detection approach based on spectral clustering combining information on both the network structure and node attributes (SpcSA). Some of the attributes may not describe the communities we are trying to detect correctly. These irrelevant attributes can add noise and lower the overall accuracy of community detection. To determine how much each attribute contributes to community detection, our method introduces a mechanism by which attribute weights can adjust themselves. We demonstrate the effectiveness of the proposed method through numerical simulation and with real-world data.

[1]  X ZhengAlice,et al.  A Survey of Statistical Network Models , 2010 .

[2]  Zongben Xu,et al.  Community Detection for Clustered Attributed Graphs via a Variational EM Algorithm , 2014, BigDataScience '14.

[3]  Thomas Seidl,et al.  Spectral Subspace Clustering for Graphs with Feature Vectors , 2013, 2013 IEEE 13th International Conference on Data Mining.

[4]  Guido Caldarelli,et al.  Large Scale Structure and Dynamics of Complex Networks: From Information Technology to Finance and Natural Science , 2007 .

[5]  Mikhail Belkin,et al.  Consistency of spectral clustering , 2008, 0804.0678.

[6]  Chieh-Yuan Tsai,et al.  Developing a feature weight self-adjustment mechanism for a K-means clustering algorithm , 2008, Comput. Stat. Data Anal..

[7]  Yuan Zhang,et al.  Community Detection in Networks with Node Features , 2015, Electronic Journal of Statistics.

[8]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[9]  Yuan Zhang,et al.  Detecting Overlapping Communities in Networks Using Spectral Methods , 2014, SIAM J. Math. Data Sci..

[10]  Joshua T. Vogelstein,et al.  Covariate-assisted spectral clustering , 2014, Biometrika.

[11]  Edoardo M. Airoldi,et al.  A Survey of Statistical Network Models , 2009, Found. Trends Mach. Learn..

[12]  Edoardo M. Airoldi,et al.  Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[13]  Mark E. J. Newman,et al.  Structure and inference in annotated networks , 2015, Nature Communications.

[14]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[15]  E. Lazega Introduction : Collegial Phenomenon : The Social Mechanisms of Cooperation Among Peers in a Corporate Law Partnership , 2001 .

[16]  Stephen E. Fienberg,et al.  A Brief History of Statistical Models for Network Analysis and Open Challenges , 2012 .

[17]  A. Raftery,et al.  Model‐based clustering for social networks , 2007 .

[18]  St'ephane Robin,et al.  Uncovering latent structure in valued graphs: A variational approach , 2010, 1011.1813.

[19]  Mark E. J. Newman,et al.  Stochastic blockmodels and community structure in networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Hong Cheng,et al.  Graph Clustering Based on Structural/Attribute Similarities , 2009, Proc. VLDB Endow..

[21]  Hong Cheng,et al.  A model-based approach to attributed graph clustering , 2012, SIGMOD Conference.

[22]  Stephen E. Fienberg Introduction to papers on the modeling and analysis of network data , 2010 .

[23]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Vladimir Batagelj,et al.  Exploratory Social Network Analysis with Pajek , 2005 .

[25]  Bin Yu,et al.  Spectral clustering and the high-dimensional stochastic blockmodel , 2010, 1007.1684.

[26]  Fei Wang,et al.  Integrated KL (K-means - Laplacian) Clustering: A New Clustering Approach by Combining Attribute Data and Pairwise Relations , 2009, SDM.

[27]  Yuguo Chen,et al.  A block model for node popularity in networks with community structure , 2018 .

[28]  Hong Cheng,et al.  Clustering Large Attributed Graphs: A Balance between Structural and Attribute Similarities , 2011, TKDD.

[29]  P. Bickel,et al.  Role of normalization in spectral clustering for stochastic blockmodels , 2013, 1310.1495.

[30]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[31]  Christophe Ambroise,et al.  Clustering based on random graph model embedding vertex features , 2009, Pattern Recognit. Lett..

[32]  Jignesh M. Patel,et al.  Efficient aggregation for graph summarization , 2008, SIGMOD Conference.

[33]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[34]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[35]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.