Spectral Anomaly Detection in Very Large Graphs: Models, Noise, and Computational Complexity

Anomaly detection in massive networks has numerous theoretical and computational challenges, especially as the behavior to be detected becomes small in comparison to the larger network. This presentation focuses on recent results in three key technical areas, specifically geared toward spectral methods for detection. We first discuss recent models for network behavior, and how their structure can be exploited for efficient computation of the principal eigenspace of the graph. In addition to the stochasticity of background activity, a graph of interest may be observed through a noisy or imperfect mechanism, which may hinder the detection process. A few simple noise models are discussed, and we demonstrate the ability to fuse multiple corrupted observations and recover detection performance. Finally, we discuss the challenges in scaling the spectral algorithms to large-scale high-performance computing systems, and present preliminary recommendations to achieve good performance with current parallel eigensolvers.

[1]  Andy B. Yoo,et al.  A scalable eigensolver for large scale-free graphs using 2D graph partitioning , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[2]  Benjamin A. Miller,et al.  Sparse matrix partitioning for parallel eigenanalysis of large static and dynamic graphs , 2014, 2014 IEEE High Performance Extreme Computing Conference (HPEC).

[3]  Michael M. Wolf,et al.  Detecting Anomalies in Very Large Graphs ∗ , 2014 .

[4]  Christos Faloutsos,et al.  R-MAT: A Recursive Model for Graph Mining , 2004, SDM.

[5]  Patrick J. Wolfe,et al.  A Spectral Framework for Anomalous Subgraph Detection , 2014, IEEE Transactions on Signal Processing.

[6]  Benjamin A. Miller,et al.  Efficient anomaly detection in dynamic, attributed graphs: Emerging phenomena and big data , 2013, 2013 IEEE International Conference on Intelligence and Security Informatics.

[7]  Benjamin A. Miller,et al.  Spectral subgraph detection with corrupt observations , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Xiao Zhang,et al.  Spectra of random graphs with community structure and arbitrary degrees , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Richard B. Lehoucq,et al.  Anasazi software for the numerical solution of large-scale eigenvalue problems , 2009, TOMS.

[10]  Sivasankaran Rajamanickam,et al.  Scalable matrix computations on large scale-free graphs using 2D graph partitioning , 2013, 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[11]  Polina Golland,et al.  From Connectivity Models to Region Labels: Identifying Foci of a Neurological Disorder , 2013, IEEE Transactions on Medical Imaging.

[12]  Patrick J. Wolfe,et al.  Detection Theory for Graphs , 2013 .

[13]  Mark S Handcock,et al.  MODELING SOCIAL NETWORKS FROM SAMPLED DATA. , 2010, The annals of applied statistics.

[14]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Hisashi Kashima,et al.  Eigenspace-based anomaly detection in computer systems , 2004, KDD.