Detecting Structure-correlated Attributes on Graphs

Do users from Carnegie Mellon University form social communities on Facebook? In this paper, we focus on a task of detecting structure-correlated attributes on a graph. A structure-correlated attribute means that the node set activated by the attribute form a community in the graph. This task is relevant to many applications including identifying structure-correlated attributes in social networks, special events in the urban traffic networks, unusual brain activity in the brain connectivity networks, and viruses in cyber-physical systems. To solve this task, we formulate a statistical hypothesis testing to decide if the given attribute activates a community in a graph with interfering by the Bernoulli noise. We propose two statistics: graph wavelet statistic and graph scan statistic. Both are shown to be efficient and statistically effective to detect activations. The intuition behind the proposed statistics is that we study the interaction between graph structure and the given attribute, that is, we denoise the attribute based on the graph structure and localize the underlying community in the graph. We then test the proposed hypothesis tests on simulated data to validate the effectiveness and robustness of the proposed methods. We further apply the proposed methods to two real-world applications: high air pollution detection and ranking attributes for community detection in a coauthorship network collected from IEEE Xplore. The experimental results show that the proposed graph wavelet statistic and graph scan statistic are effective and efficient.

[1]  Yuan Zhang,et al.  Community Detection in Networks with Node Features , 2015, Electronic Journal of Statistics.

[2]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[3]  E. Candès,et al.  Detection of an anomalous cluster in a network , 2010, 1001.3209.

[4]  Pier Luigi Dragotti,et al.  Sampling and Reconstruction of Sparse Signals on Circulant Graphs - An Introduction to Graph-FRI , 2016, ArXiv.

[5]  Biming Tian,et al.  Anomaly detection in wireless sensor networks: A survey , 2011, J. Netw. Comput. Appl..

[6]  Pascal Frossard,et al.  Learning Parametric Dictionaries for Signals on Graphs , 2014, IEEE Transactions on Signal Processing.

[7]  Yue M. Lu,et al.  A Spectral Graph Uncertainty Principle , 2012, IEEE Transactions on Information Theory.

[8]  H. Vincent Poor,et al.  Distributed Hybrid Power State Estimation Under PMU Sampling Phase Errors , 2014, IEEE Transactions on Signal Processing.

[9]  Akshay Krishnamurthy,et al.  Detecting Activations over Graphs using Spanning Tree Wavelet Bases , 2012, AISTATS.

[10]  Pierre Vandergheynst,et al.  A Multiscale Pyramid Transform for Graph Signals , 2013, IEEE Transactions on Signal Processing.

[11]  José M. F. Moura,et al.  Discrete Signal Processing on Graphs , 2012, IEEE Transactions on Signal Processing.

[12]  Pierre Vandergheynst,et al.  Wavelets on Graphs via Spectral Graph Theory , 2009, ArXiv.

[13]  Georgios B. Giannakis,et al.  Kernel-Based Reconstruction of Graph Signals , 2016, IEEE Transactions on Signal Processing.

[14]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[15]  Jelena Kovacevic,et al.  Discrete Signal Processing on Graphs: Sampling Theory , 2015, IEEE Transactions on Signal Processing.

[16]  Yan-lin Zhang,et al.  Fine particulate matter (PM2.5) in China at a city level , 2015, Scientific Reports.

[17]  Sunil K. Narang,et al.  Compact Support Biorthogonal Wavelet Filterbanks for Arbitrary Undirected Graphs , 2012, IEEE Transactions on Signal Processing.

[18]  Robert D. Nowak,et al.  Distilled sensing: selective sampling for sparse signal recovery , 2009, AISTATS.

[19]  Jelena Kovacevic,et al.  Signal Representations on Graphs: Tools and Applications , 2015, ArXiv.

[20]  D. O. North,et al.  An Analysis of the factors which determine signal/noise discrimination in pulsed-carrier systems , 1963 .

[21]  Pierre Borgnat,et al.  Graph Wavelets for Multiscale Community Mining , 2014, IEEE Transactions on Signal Processing.

[22]  Sergio Barbarossa,et al.  Signals on Graphs: Uncertainty Principle and Sampling , 2015, IEEE Transactions on Signal Processing.

[23]  Akshay Krishnamurthy,et al.  Minimax structured normal means inference , 2015, 2016 IEEE International Symposium on Information Theory (ISIT).

[24]  José M. F. Moura,et al.  Signal denoising on graphs via graph filtering , 2014, 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[25]  Sunil K. Narang,et al.  Unidirectional graph-based wavelet transforms for efficient data gathering in sensor networks , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[26]  Alessandro Rinaldo,et al.  Sparsistency of the Edge Lasso over Graphs , 2012, AISTATS.

[27]  Pascal Frossard,et al.  Clustering on Multi-Layer Graphs via Subspace Analysis on Grassmann Manifolds , 2013, IEEE Transactions on Signal Processing.

[28]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Jerome L. Myers,et al.  Research Design and Statistical Analysis , 1991 .

[30]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[31]  Vladimir Kolmogorov,et al.  What energy functions can be minimized via graph cuts? , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Emmanuel Abbe,et al.  Exact Recovery in the Stochastic Block Model , 2014, IEEE Transactions on Information Theory.

[33]  Pascal Frossard,et al.  The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains , 2012, IEEE Signal Processing Magazine.

[34]  Pengfei Liu,et al.  Local-Set-Based Graph Signal Reconstruction , 2014, IEEE Transactions on Signal Processing.

[35]  A. Robert Calderbank,et al.  Detecting Weak but Hierarchically-Structured Patterns in Networks , 2010, AISTATS.

[36]  Quanzheng Li,et al.  Matched Signal Detection on Graphs: Theory and Application to Brain Network Classification , 2013, IPMI.

[37]  Michelle Wilde IEEE Xplore Digital Library , 2016 .

[38]  H. Vincent Poor,et al.  Nonparametric detection of an anomalous disk over a two-dimensional lattice network , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[39]  José M. F. Moura,et al.  Signal Recovery on Graphs: Variation Minimization , 2014, IEEE Transactions on Signal Processing.

[40]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[41]  Akshay Krishnamurthy,et al.  Near-optimal Anomaly Detection in Graphs using Lovasz Extended Scan Statistic , 2013, NIPS.

[42]  Jure Leskovec,et al.  Overlapping community detection at scale: a nonnegative matrix factorization approach , 2013, WSDM.

[43]  Jelena Kovacevic,et al.  Signal Recovery on Graphs: Fundamental Limits of Sampling Strategies , 2015, IEEE Transactions on Signal and Information Processing over Networks.

[44]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[45]  Antonio Ortega,et al.  Submitted to Ieee Transactions on Signal Processing 1 Efficient Sampling Set Selection for Bandlimited Graph Signals Using Graph Spectral Proxies , 2022 .

[46]  Santiago Segarra,et al.  Stationary Graph Processes and Spectral Estimation , 2016, IEEE Transactions on Signal Processing.

[47]  Akshay Krishnamurthy Minimaxity in Structured Normal Means Inference , 2015, ArXiv.

[48]  Alessandro Rinaldo,et al.  Changepoint Detection over Graphs with the Spectral Scan Statistic , 2012, AISTATS.

[49]  Sunil K. Narang,et al.  Signal processing techniques for interpolation in graph structured data , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[50]  James H. Garrett,et al.  Semi-Supervised Multiresolution Classification Using Adaptive Graph Filtering With Application to Indirect Bridge Structural Health Monitoring , 2014, IEEE Transactions on Signal Processing.

[51]  Soummya Kar,et al.  Distributed Consensus Algorithms in Sensor Networks With Imperfect Communication: Link Failures and Channel Noise , 2007, IEEE Transactions on Signal Processing.

[52]  Ian Witten,et al.  Data Mining , 2000 .

[53]  Vivek K Goyal,et al.  Foundations of Signal Processing , 2014 .

[54]  Michael G. Rabbat,et al.  Approximating signals supported on graphs , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[55]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[56]  Sunil K. Narang,et al.  Perfect Reconstruction Two-Channel Wavelet Filter Banks for Graph Structured Data , 2011, IEEE Transactions on Signal Processing.

[57]  Santiago Segarra,et al.  Sampling of Graph Signals With Successive Local Aggregations , 2015, IEEE Transactions on Signal Processing.

[58]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[59]  Alexander J. Smola,et al.  Trend Filtering on Graphs , 2014, J. Mach. Learn. Res..

[60]  M. Ledoux The concentration of measure phenomenon , 2001 .

[61]  Alfred O. Hero,et al.  Local Fiedler vector centrality for detection of deep and overlapping communities in networks , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).