Spectral Clustering of Signed Graphs via Matrix Power Means

Signed graphs encode positive (attractive) and negative (repulsive) relations between nodes. We extend spectral clustering to signed graphs via the one-parameter family of Signed Power Mean Laplacians, defined as the matrix power mean of normalized standard and signless Laplacians of positive and negative edges. We provide a thorough analysis of the proposed approach in the setting of a general Stochastic Block Model that includes models such as the Labeled Stochastic Block Model and the Censored Block Model. We show that in expectation the signed power mean Laplacian captures the ground truth clusters under reasonable settings where state-of-the-art approaches fail. Moreover, we prove that the eigenvalues and eigenvector of the signed power mean Laplacian concentrate around their expectation under reasonable conditions in the general Stochastic Block Model. Extensive experiments on random graphs and real world datasets confirm the theoretically predicted behaviour of the signed power mean Laplacian and show that it compares favourably with state-of-the-art methods.

[1]  Inderjit S. Dhillon,et al.  Scalable clustering of signed networks using balance normalized cut , 2012, CIKM.

[2]  Laurent Massoulié,et al.  Edge Label Inference in Generalized Stochastic Block Models: from Spectral Theory to Impossibility Results , 2014, COLT.

[3]  Jure Leskovec,et al.  Signed networks in social media , 2010, CHI.

[4]  Charu C. Aggarwal,et al.  A Survey of Signed Network Mining in Social Media , 2015, ACM Comput. Surv..

[5]  João Ricardo Sato,et al.  Functional clustering of time series gene expression data by Granger causality , 2012, BMC Systems Biology.

[6]  Martin Stoll,et al.  Node classification for signed networks using diffuse interface methods , 2018, ArXiv.

[7]  Sahin Albayrak,et al.  Spectral Analysis of Signed Graphs for Clustering, Prediction and Visualization , 2010, SDM.

[8]  Dario Fasino,et al.  A modularity based spectral method for simultaneous community and anti-community detection , 2017, ArXiv.

[9]  Bin Yu,et al.  Spectral clustering and the high-dimensional stochastic blockmodel , 2010, 1007.1684.

[10]  Peter Davies,et al.  SPONGE: A generalized eigenproblem for clustering signed networks , 2019, AISTATS.

[11]  Subhadeep Paul,et al.  Spectral and matrix factorization methods for consistent community detection in multi-layer networks , 2017, The Annals of Statistics.

[12]  Jure Leskovec,et al.  {SNAP Datasets}: {Stanford} Large Network Dataset Collection , 2014 .

[13]  Yang Xiang,et al.  SNE: Signed Network Embedding , 2017, PAKDD.

[14]  Dimitris K. Tasoulis,et al.  Financial forecasting through unsupervised clustering and neural networks , 2006, Oper. Res..

[15]  Florent Krzakala,et al.  Spectral Clustering of graphs with the Bethe Hessian , 2014, NIPS.

[16]  Charu C. Aggarwal,et al.  Signed Network Embedding in Social Media , 2017, SDM.

[17]  Nagarajan Natarajan,et al.  Exploiting longer cycles for link prediction in signed networks , 2011, CIKM '11.

[18]  Laurent Massoulié,et al.  Community Detection in the Labelled Stochastic Block Model , 2012, ArXiv.

[19]  F. Harary,et al.  STRUCTURAL BALANCE: A GENERALIZATION OF HEIDER'S THEORY1 , 1977 .

[20]  Matthias Hein,et al.  Clustering Signed Networks with the Geometric Mean of Laplacians , 2016, NIPS.

[21]  YU BIN,et al.  IMPACT OF REGULARIZATION ON SPECTRAL CLUSTERING , 2016 .

[22]  Fan Chung Graham,et al.  On the Spectra of General Random Graphs , 2011, Electron. J. Comb..

[23]  Tai Qin,et al.  Regularized Spectral Clustering under the Degree-Corrected Stochastic Blockmodel , 2013, NIPS.

[24]  Florent Krzakala,et al.  Spectral detection in the censored block model , 2015, 2015 IEEE International Symposium on Information Theory (ISIT).

[25]  Shiping Liu,et al.  Multi-way dual Cheeger constants and spectral bounds of graphs , 2014, 1401.3147.

[26]  Claudio Gentile,et al.  On the Troll-Trust Model for Edge Sign Prediction in Social Networks , 2016, AISTATS.

[27]  Mahdi Jalili,et al.  Ranking Nodes in Signed Social Networks , 2014, Social Network Analysis and Mining.

[28]  Varun Jog,et al.  Information-theoretic bounds for exact recovery in weighted stochastic block models using the Renyi divergence , 2015, ArXiv.

[29]  Jean Gallier,et al.  Spectral Theory of Unsigned and Signed Graphs. Applications to Graph Clustering: a Survey , 2016, ArXiv.

[30]  R. Bhatia Positive Definite Matrices , 2007 .

[31]  A. Rinaldo,et al.  Consistency of spectral clustering in stochastic block models , 2013, 1312.2050.

[32]  P. Bickel,et al.  Role of normalization in spectral clustering for stochastic blockmodels , 2013, 1310.1495.

[33]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[34]  Mihai Cucuringu,et al.  An MBO scheme for clustering and semi-supervised clustering of signed networks , 2019, Communications in Mathematical Sciences.

[35]  Alexandre Proutière,et al.  Optimal Cluster Recovery in the Labeled Stochastic Block Model , 2015, NIPS.

[36]  Fan Chung Graham,et al.  Spectral Clustering of Graphs with General Degrees in the Extended Planted Partition Model , 2012, COLT.

[37]  Amit Singer,et al.  Decoding Binary Node Labels from Censored Edge Measurements: Phase Transition and Efficient Recovery , 2014, IEEE Transactions on Network Science and Engineering.

[38]  James A. Davis Clustering and Structural Balance in Graphs , 1977 .

[39]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[40]  S. Sethuraman,et al.  Consistency of modularity clustering on random geometric graphs , 2016, The Annals of Applied Probability.

[41]  Junghwan Kim,et al.  SIDE: Representation Learning in Signed Directed Networks , 2018, WWW.

[42]  Madhav Desai,et al.  A characterization of the smallest eigenvalue of a graph , 1994, J. Graph Theory.

[43]  Edoardo M. Airoldi,et al.  Consistent estimation of dynamic and multi-layer block models , 2014, ICML.

[44]  F. Harary On the notion of balance of a signed graph. , 1953 .

[45]  Andrew V. Knyazev,et al.  On spectral partitioning of signed graphs , 2017, CSC.

[46]  R. Subramanian,et al.  Inequalities between means of positive operators , 1978, Mathematical Proceedings of the Cambridge Philosophical Society.

[47]  Emmanuel Abbe,et al.  Community detection and stochastic block models: recent developments , 2017, Found. Trends Commun. Inf. Theory.

[48]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[49]  Daniel A. Keim,et al.  Visual market sector analysis for financial time series data , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[50]  Joel A. Tropp,et al.  An Introduction to Matrix Concentration Inequalities , 2015, Found. Trends Mach. Learn..

[51]  Dean P. Foster,et al.  Semantic Word Clusters Using Signed Spectral Clustering , 2017, ACL.

[52]  Christos Faloutsos,et al.  Edge Weight Prediction in Weighted Signed Networks , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[53]  Charu C. Aggarwal,et al.  Node Classification in Signed Social Networks , 2016, SDM.

[54]  Matthias Hein,et al.  The Power Mean Laplacian for Multilayer Graph Clustering , 2018, AISTATS.

[55]  Patrick Doreian,et al.  Partitioning signed social networks , 2009, Soc. Networks.

[56]  George T. Cantwell,et al.  Balance in signed networks , 2018, Physical review. E.

[57]  Avrim Blum,et al.  Correlation Clustering , 2004, Machine Learning.

[58]  Fan Chung Graham,et al.  Dirichlet PageRank and Ranking Algorithms Based on Trust and Distrust , 2013, Internet Math..

[59]  Jiliang Tang,et al.  Signed Graph Convolutional Network , 2018, ArXiv.

[60]  Venkatesan Guruswami,et al.  Correlation clustering with a fixed number of clusters , 2005, SODA '06.

[61]  Martin Stoll,et al.  Node Classification for Signed Social Networks Using Diffuse Interface Methods , 2018, ECML/PKDD.

[62]  Po-Ling Loh,et al.  Optimal rates for community estimation in the weighted stochastic block model , 2017, The Annals of Statistics.

[63]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[64]  Tengyao Wang,et al.  A useful variant of the Davis--Kahan theorem for statisticians , 2014, 1405.0680.

[65]  P. Bullen Handbook of means and their inequalities , 1987 .