Graph Wavelets for Multiscale Community Mining

We develop a signal processing approach to the multiscale detection of communities in networks, that is of groups of nodes well connected together. The method relies on carefully engineered wavelets on graphs to introduce the notion of scale and to obtain a local view of the graph from each node. Computing the correlations between wavelets centered at different nodes, one has access to a notion of similarity between nodes, thereby enabling a clustering procedure that groups nodes according to their community at the scale of analysis. By using a collection of random vectors to estimate the correlation between the nodes, we show that the method is suitable for the analysis of large graphs. Furthermore, we introduce a notion of partition stability and a statistical test allowing us to assess which scales of analysis of the network are relevant. The effectiveness of the method is discussed first on multiscale graph benchmarks, then on real data of social networks and on models for signal processing on graphs.

[1]  K. Kaski,et al.  Limited resolution in complex network community detection with Potts model approach , 2006 .

[2]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[3]  Benjamin King Step-Wise Clustering Procedures , 1967 .

[4]  Roger Guimerà,et al.  Extracting the hierarchical organization of complex systems , 2007, Proceedings of the National Academy of Sciences.

[5]  A. Vespignani,et al.  The architecture of complex weighted networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Katrina Whittingham,et al.  Seeing the bigger picture. , 2011, Nursing standard (Royal College of Nursing (Great Britain) : 1987).

[7]  Jean-Loup Guillaume,et al.  Community cores in evolving networks , 2012, WWW.

[8]  Mark Crovella,et al.  Graph wavelets for spatial traffic analysis , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[9]  Richard F. Betzel,et al.  Multi-scale community organization of the human structural connectome and its relationship with resting-state functional connectivity , 2013, Network Science.

[10]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[11]  Sergios Theodoridis,et al.  Adaptation and Learning over Complex Networks , 2013 .

[12]  Pascal Frossard,et al.  Chebyshev polynomial approximation for distributed signal processing , 2011, 2011 International Conference on Distributed Computing in Sensor Systems and Workshops (DCOSS).

[13]  Ciro Cattuto,et al.  High-Resolution Measurements of Face-to-Face Contact Patterns in a Primary School , 2011, PloS one.

[14]  Michael G. Rabbat,et al.  GANC: Greedy Agglomerative Normalized Cut , 2011, ArXiv.

[15]  Renaud Lambiotte,et al.  Multi-scale modularity in complex networks , 2010, 8th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks.

[16]  R. Coifman,et al.  Diffusion Wavelets , 2004 .

[17]  J. Denehy,et al.  Seeing The Bigger Picture , 2007, The Journal of school nursing : the official publication of the National Association of School Nurses.

[18]  Marc Moonen,et al.  Seeing the Bigger Picture: How Nodes Can Learn Their Place Within a Complex Ad Hoc Network Topology , 2013, IEEE Signal Processing Magazine.

[19]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[20]  Joel C. Miller,et al.  Efficient Generation of Networks with Given Expected Degrees , 2011, WAW.

[21]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[22]  Dimitri Van De Ville,et al.  Tight Wavelet Frames on Multislice Graphs , 2013, IEEE Transactions on Signal Processing.

[23]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[24]  Dimitri Van De Ville,et al.  Machine Learning with Brain Graphs: Predictive Modeling Approaches for Functional Imaging in Systems Neuroscience , 2013, IEEE Signal Processing Magazine.

[25]  J. Reichardt,et al.  Statistical mechanics of community detection. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  Sanjukta Bhowmick,et al.  Constant Communities in Complex Networks , 2013, Scientific Reports.

[27]  José M. F. Moura,et al.  Discrete Signal Processing on Graphs , 2012, IEEE Transactions on Signal Processing.

[28]  Robert Tibshirani,et al.  Estimating the number of clusters in a data set via the gap statistic , 2000 .

[29]  F. Chung,et al.  The average distances in random graphs with given expected degrees , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[31]  Pierre Borgnat,et al.  Multiscale Detection of Stable Communities Using Wavelets on Networks , 2013 .

[32]  Jean-Charles Delvenne,et al.  Markov Dynamics as a Zooming Lens for Multiscale Community Detection: Non Clique-Like Communities and the Field-of-View Limit , 2011, PloS one.

[33]  Pierre Vandergheynst,et al.  Wavelets on Graphs via Spectral Graph Theory , 2009, ArXiv.

[34]  Alain Barrat,et al.  Bootstrapping under constraint for the assessment of group behavior in human contact networks , 2012, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  Pierre Borgnat,et al.  Multiscale community mining in networks using the graph wavelet transform of random vectors , 2013, 2013 IEEE Global Conference on Signal and Information Processing.

[36]  M. A. Munoz,et al.  Improved spectral algorithm for the detection of network communities , 2005 .

[37]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[38]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[39]  Pierre Borgnat,et al.  Multiscale community mining in networks using spectral graph wavelets , 2012, 21st European Signal Processing Conference (EUSIPCO 2013).

[40]  U. Feige,et al.  Spectral Graph Theory , 2015 .

[41]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[42]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[43]  M. Mitrovic,et al.  Spectral and dynamical properties in classes of sparse networks with mesoscopic inhomogeneities. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[44]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[45]  M. Cugmas,et al.  On comparing partitions , 2015 .

[46]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[47]  Alex Arenas,et al.  Analysis of the structure of complex networks at different resolution levels , 2007, physics/0703218.

[48]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.