Radar: Residual Analysis for Anomaly Detection in Attributed Networks

Attributed networks are pervasive in different domains, ranging from social networks, gene regulatory networks to financial transaction networks. This kind of rich network representation presents challenges for anomaly detection due to the heterogeneity of two data representations. A vast majority of existing algorithms assume certain properties of anomalies are given a prior. Since various types of anomalies in real-world attributed networks coexist, the assumption that priori knowledge regarding anomalies is available does not hold. In this paper, we investigate the problem of anomaly detection in attributed networks generally from a residual analysis perspective, which has been shown to be effective in traditional anomaly detection problems. However, it is a non-trivial task in attributed networks as interactions among instances complicate the residual modeling process. Methodologically, we propose a learning framework to characterize the residuals of attribute information and its coherence with network information for anomaly detection. By learning and analyzing the residuals, we detect anomalies whose behaviors are singularly different from the majority. Experiments on real datasets show the effectiveness and generality of the proposed framework.

[1]  Steven C. H. Hoi,et al.  Malicious URL Detection using Machine Learning: A Survey , 2017, ArXiv.

[2]  Yizhou Sun,et al.  On community outliers and their efficient detection in information networks , 2010, KDD.

[3]  Huan Liu,et al.  Robust Unsupervised Feature Selection on Networked Data , 2016, SDM.

[4]  Klemens Böhm,et al.  Local context selection for outlier ranking in graphs with multiple numeric node attributes , 2014, SSDBM '14.

[5]  Changjun Jiang,et al.  Modeling Document Networks with Tree-Averaged Copula Regularization , 2017, WSDM.

[6]  Emmanuel Müller,et al.  Focused clustering and outlier detection in large attributed graphs , 2014, KDD.

[7]  Leman Akoglu,et al.  Scalable Anomaly Ranking of Attributed Neighborhoods , 2016, SDM.

[8]  Klemens Böhm,et al.  Statistical Selection of Congruent Subspaces for Mining Attributed Graphs , 2013, 2013 IEEE 13th International Conference on Data Mining.

[9]  Huan Liu,et al.  Gleaning Wisdom from the Past: Early Detection of Emerging Rumors in Social Media , 2017, SDM.

[10]  Klemens Böhm,et al.  Ranking outlier nodes in subspaces of attributed graphs , 2013, 2013 IEEE 29th International Conference on Data Engineering Workshops (ICDEW).

[11]  Huan Liu,et al.  Toward Personalized Relational Learning , 2017, SDM.

[12]  Danai Koutra,et al.  Graph based anomaly detection and description: a survey , 2014, Data Mining and Knowledge Discovery.

[13]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[14]  Xiao Huang,et al.  Accelerated Local Anomaly Detection via Resolving Attributed Networks , 2017, IJCAI.

[15]  Xiao Huang,et al.  Label Informed Attributed Network Embedding , 2017, WSDM.

[16]  Huan Liu,et al.  Toward Time-Evolving Feature Selection on Dynamic Networks , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[17]  Jinbo Bi,et al.  Active learning via transductive experimental design , 2006, ICML.

[18]  Huan Liu,et al.  CoSelect: Feature Selection with Instance Selection for Social Media Data , 2013, SDM.

[19]  Kwang-Ho Ro,et al.  Outlier detection for high-dimensional data , 2015 .

[20]  Charu C. Aggarwal,et al.  Outlier Analysis , 2013, Springer New York.

[21]  Huan Liu,et al.  Multi-Label Informed Feature Selection , 2016, IJCAI.

[22]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[23]  Jingrui He,et al.  On the Connectivity of Multi-layered Networks: Models, Measures and Optimal Control , 2015, 2015 IEEE International Conference on Data Mining.

[24]  Liu Huan,et al.  Toward Time-Evolving Feature Selection on Dynamic Networks , 2016 .

[25]  Yiyuan She,et al.  Outlier Detection Using Nonconvex Penalized Regression , 2010, ArXiv.

[26]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[27]  Xiaowei Xu,et al.  SCAN: a structural clustering algorithm for networks , 2007, KDD '07.

[28]  Jingrui He,et al.  MUVIR: Multi-View Rare Category Detection , 2015, IJCAI.

[29]  Diane J. Cook,et al.  Graph-based anomaly detection , 2003, KDD '03.

[30]  Vipin Kumar,et al.  Feature bagging for outlier detection , 2005, KDD '05.

[31]  Xiao Huang,et al.  Accelerated Attributed Network Embedding , 2017, SDM.

[32]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[33]  Hanghang Tong,et al.  Non-Negative Residual Matrix Factorization with Application to Graph Anomaly Detection , 2011, SDM.

[34]  Xin Wang,et al.  Learning Personalized Preference of Strong and Weak Ties for Social Recommendation , 2017, WWW.