Exact Blind Community Detection From Signals on Multiple Graphs

Networks and data supported on graphs have become ubiquitous in the sciences and engineering. This paper studies the ‘blind’ community detection problem, where we seek to infer the community structure of a graph model given the observation of independent graph signals on a set of nodes whose connections are unknown. We model each observation as filtered white noise, where the underlying network structure varies with every observation. These varying network structures are modeled as independent realizations of a latent planted partition model (PPM), justifying our assumption of a constant underlying community structure over all observations. Under certain conditions on the graph filter and PPM parameters, we suggest simple algorithms for determining (i) the number of latent communities and (ii) the associated partitions of the PPM. We then prove statistical guarantees in the asymptotic and non-asymptotic sampling cases. Numerical experiments on real and synthetic data demonstrate the efficacy of these algorithms.

[1]  Antonio Ortega,et al.  Graph Learning From Data Under Laplacian and Structural Constraints , 2016, IEEE Journal of Selected Topics in Signal Processing.

[2]  Mark E. J. Newman,et al.  An efficient and principled method for detecting communities in networks , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Gonzalo Mateos,et al.  Proximal-Gradient Algorithms for Tracking Cascades Over Social Networks , 2014, IEEE Journal of Selected Topics in Signal Processing.

[4]  Elchanan Mossel,et al.  Reconstruction and estimation in the planted partition model , 2012, Probability Theory and Related Fields.

[5]  Santiago Segarra,et al.  Connecting the Dots: Identifying Network Structure via Graph Signal Processing , 2018, IEEE Signal Processing Magazine.

[6]  Pascal Frossard,et al.  The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains , 2012, IEEE Signal Processing Magazine.

[7]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[8]  Chandler Davis The rotation of eigenvectors by a perturbation , 1963 .

[9]  Georgios B. Giannakis,et al.  Inference of Gene Regulatory Networks with Sparse Structural Equation Models Exploiting Genetic Perturbations , 2013, PLoS Comput. Biol..

[10]  Vassilis Kalofolias,et al.  How to Learn a Graph from Smooth Signals , 2016, AISTATS.

[11]  John N. Tsitsiklis,et al.  Blind identification of stochastic block models from dynamical observations , 2019, SIAM J. Math. Data Sci..

[12]  S. Rombouts,et al.  Consistent resting-state networks across healthy subjects , 2006, Proceedings of the National Academy of Sciences.

[13]  F. Bunea,et al.  On the sample covariance matrix estimator of reduced effective rank population matrices, with applications to fPCA , 2012, 1212.5321.

[14]  Santiago Segarra,et al.  Network Topology Inference from Spectral Templates , 2016, IEEE Transactions on Signal and Information Processing over Networks.

[15]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Bin Yu,et al.  Spectral clustering and the high-dimensional stochastic blockmodel , 2010, 1007.1684.

[17]  Shahin Shahrampour,et al.  Reconstruction of directed networks from consensus dynamics , 2013, 2013 American Control Conference.

[18]  S. Strogatz Exploring complex networks , 2001, Nature.

[19]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[20]  Christos Boutsidis,et al.  Spectral Clustering via the Power Method - Provably , 2013, ICML.

[21]  Emmanuel Abbe,et al.  Community detection and stochastic block models: recent developments , 2017, Found. Trends Commun. Inf. Theory.

[22]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[23]  Nick S. Jones,et al.  Community detection in networks with unobserved edges , 2018, ArXiv.

[24]  Hoi-To Wai,et al.  Estimating Centrality Blindly From Low-Pass Filtered Graph Signals , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[25]  Anna Scaglione,et al.  Estimating Social Opinion Dynamics Models From Voting Records , 2018, IEEE Transactions on Signal Processing.

[26]  Amit Kumar,et al.  A simple linear time ( 1+ ε)- approximation algorithm for geometric k-means clustering in any dimensions , 2004 .

[27]  Joshua B. Tenenbaum,et al.  Discovering Structure by Learning Sparse Graphs , 2010 .

[28]  Santiago Segarra,et al.  Network inference from consensus dynamics , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[29]  O. Sporns Discovering the Human Connectome , 2012 .

[30]  Georgios B. Giannakis,et al.  Kernel-Based Structural Equation Models for Topology Identification of Directed Networks , 2016, IEEE Transactions on Signal Processing.

[31]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[32]  W. Kahan,et al.  The Rotation of Eigenvectors by a Perturbation. III , 1970 .

[33]  Santiago Segarra,et al.  Network Inference From Consensus Dynamics With Unknown Parameters , 2019, IEEE Transactions on Signal and Information Processing over Networks.

[34]  Santiago Segarra,et al.  Blind Community Detection From Low-Rank Excitations of a Graph Filter , 2018, IEEE Transactions on Signal Processing.

[35]  Pascal Frossard,et al.  Learning Laplacian Matrix in Smooth Graph Signal Representations , 2014, IEEE Transactions on Signal Processing.

[36]  Santiago Segarra,et al.  Optimal Graph-Filter Design and Applications to Distributed Linear Network Operators , 2017, IEEE Transactions on Signal Processing.

[37]  E A Leicht,et al.  Mixture models and exploratory analysis in networks , 2006, Proceedings of the National Academy of Sciences.

[38]  P. Bickel,et al.  Covariance regularization by thresholding , 2009, 0901.3079.

[39]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[40]  Santiago Segarra,et al.  Blind Inference of Centrality Rankings from Graph Signals , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[41]  Roman Vershynin,et al.  Introduction to the non-asymptotic analysis of random matrices , 2010, Compressed Sensing.

[42]  Alejandro Ribeiro,et al.  A Graph Signal Processing Perspective on Functional Brain Imaging , 2018, Proceedings of the IEEE.

[43]  Santiago Segarra,et al.  Spectral Partitioning of Time-varying Networks with Unobserved Edges , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[44]  Ronald L. Rivest,et al.  Introduction to Algorithms, 3rd Edition , 2009 .

[45]  Santiago Segarra,et al.  Blind Identification of Graph Filters , 2016, IEEE Transactions on Signal Processing.

[46]  Reza Olfati-Saber,et al.  Consensus and Cooperation in Networked Multi-Agent Systems , 2007, Proceedings of the IEEE.

[47]  Amit Kumar,et al.  A simple linear time (1 + /spl epsiv/)-approximation algorithm for k-means clustering in any dimensions , 2004, 45th Annual IEEE Symposium on Foundations of Computer Science.

[48]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[49]  Anna Scaglione,et al.  Active Sensing of Social Networks , 2016, IEEE Transactions on Signal and Information Processing over Networks.

[50]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[51]  Santo Fortunato,et al.  Community detection in networks: A user guide , 2016, ArXiv.

[52]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[53]  Mason A. Porter,et al.  Random walks and diffusion on networks , 2016, ArXiv.

[54]  Mark E. J. Newman,et al.  Stochastic blockmodels and community structure in networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[55]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[56]  Thomas Kailath,et al.  Detection of signals by information theoretic criteria , 1985, IEEE Trans. Acoust. Speech Signal Process..

[57]  Mark Newman,et al.  Networks: An Introduction , 2010 .