Dynamic Anomaly Detection Using Vector Autoregressive Model

Identifying vandal users or attackers hidden in dynamic online social network data has been shown a challenging problem. In this work, we develop a dynamic attack/anomaly detection approach using a novel combination of the graph spectral features and the restricted Vector Autoregressive (rVAR) model. Our approach utilizes the time series modeling method on the non-randomness metric derived from the graph spectral features to capture the abnormal activities and interactions of individuals. Furthermore, we demonstrate how to utilize Granger causality test on the fitted rVAR model to identify causal relationships of user activities, which could be further translated to endogenous and/or exogenous influences for each individual’s anomaly measures. We conduct empirical evaluations on the Wikipedia vandal detection dataset to demonstrate efficacy of our proposed approach.

[1]  Zhi-Hua Zhou,et al.  Line Orthogonality in Adjacency Eigenspace with Application to Community Partition , 2011, IJCAI.

[2]  Yihong Gong,et al.  Incremental Spectral Clustering With Application to Monitoring of Evolving Blog Communities , 2007, SDM.

[3]  Edo Liberty,et al.  Simple and deterministic matrix sketching , 2012, KDD.

[4]  J. Geweke,et al.  Measures of Conditional Linear Dependence and Feedback between Time Series , 1984 .

[5]  Jimeng Sun,et al.  Less is More: Compact Matrix Decomposition for Large Sparse Graphs , 2007, SDM.

[6]  M. Shyu,et al.  A Novel Anomaly Detection Scheme Based on Principal Component Classifier , 2003 .

[7]  Ana Bianco,et al.  Outlier Detection in Regression Models with ARIMA Errors Using Robust Estimates , 2001 .

[8]  Xiaowei Ying,et al.  Spectrum based fraud detection in social networks , 2011, ICDE.

[9]  C. Granger Investigating causal relations by econometric models and cross-spectral methods , 1969 .

[10]  Xintao Wu,et al.  Analysis of Spectral Space Properties of Directed Graphs Using Matrix Perturbation Theory with Application in Graph Partition , 2015, 2015 IEEE International Conference on Data Mining.

[11]  Ivor W. Tsang,et al.  Improved Nyström low-rank approximation and error analysis , 2008, ICML '08.

[12]  Xiaowei Ying,et al.  On Randomness Measures for Social Networks , 2009, SDM.

[13]  Zhi-Hua Zhou,et al.  A spectral approach to detecting subtle anomalies in graphs , 2013, Journal of Intelligent Information Systems.

[14]  R. Tsay,et al.  Outliers in multivariate time series , 2000 .

[15]  V. S. Subrahmanian,et al.  VEWS: A Wikipedia Vandal Early Warning System , 2015, KDD.

[16]  C. Granger Investigating Causal Relations by Econometric Models and Cross-Spectral Methods , 1969 .

[17]  S. Johansen Estimation and Hypothesis Testing of Cointegration Vectors in Gaussian Vector Autoregressive Models , 1991 .

[18]  Hisashi Kashima,et al.  Eigenspace-based anomaly detection in computer systems , 2004, KDD.