Learning Node Abnormality with Weak Supervision

Graph anomaly detection aims to identify the atypical substructures and has attracted an increasing amount of research attention due to its profound impacts in a variety of application domains, including social network analysis, security, finance, and many more. The lack of prior knowledge of the ground-truth anomaly has been a major obstacle in acquiring fine-grained annotations (e.g., anomalous nodes), therefore, a plethora of existing methods have been developed either with a limited number of node-level supervision or in an unsupervised manner. Nonetheless, annotations for coarse-grained graph elements (e.g., a suspicious group of nodes), which often require marginal human effort in terms of time and expertise, are comparatively easier to obtain. Therefore, it is appealing to investigate anomaly detection in a weakly-supervised setting and to establish the intrinsic relationship between annotations at different levels of granularity. In this paper, we tackle the challenging problem of weakly-supervised graph anomaly detection with coarse-grained supervision by (1) proposing a novel architecture of graph neural network with attention mechanism named WEDGE that can identify the critical node-level anomaly given a few labels of anomalous subgraphs, and (2) designing a novel objective with contrastive loss that facilitates node representation learning by enforcing distinctive representations between normal and abnormal graph elements. Through extensive evaluations on real-world datasets, we corroborate the efficacy of our proposed method, improving AUC-ROC by up to 16.48% compared to the best competitor.

[1]  H. Tong,et al.  Node Classification Beyond Homophily: Towards a General Solution , 2023, KDD.

[2]  Hanghang Tong,et al.  Adversarial Attacks on Multi-Network Mining: Problem Definition and Fast Solutions , 2023, IEEE Transactions on Knowledge and Data Engineering.

[3]  Shirui Pan,et al.  GOOD-D: On Unsupervised Graph Out-Of-Distribution Detection , 2022, WSDM.

[4]  T. Abdelzaher,et al.  Dissecting Cross-Layer Dependency Inference on Multi-Layered Inter-Dependent Networks , 2022, CIKM.

[5]  N. Chawla,et al.  Toward Graph Minimally-Supervised Learning , 2022, KDD.

[6]  H. Tong,et al.  JuryGCN: Quantifying Jackknife Uncertainty on Graph Convolutional Networks , 2022, KDD.

[7]  Jianheng Tang,et al.  Rethinking Graph Neural Networks for Anomaly Detection , 2022, ICML.

[8]  Kaize Ding,et al.  Eliciting Structural and Semantic Global Knowledge in Unsupervised Graph Contrastive Learning , 2022, AAAI.

[9]  Hanghang Tong,et al.  Data Augmentation for Deep Graph Learning , 2022, SIGKDD Explor..

[10]  Khoa T. Phan,et al.  Generative and Contrastive Self-Supervised Learning for Graph Anomaly Detection , 2021, IEEE Transactions on Knowledge and Data Engineering.

[11]  Hanghang Tong,et al.  New Frontiers of Multi-Network Mining: Recent Developments and Future Trend , 2021, KDD.

[12]  Hanghang Tong,et al.  Attent: Active Attributed Network Alignment , 2021, WWW.

[13]  Hanghang Tong,et al.  BRIGHT: A Bridging Algorithm for Network Alignment , 2021, WWW.

[14]  G. Karypis,et al.  Anomaly Detection on Attributed Networks via Contrastive Self-Supervised Learning , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Hanghang Tong,et al.  Few-shot Network Anomaly Detection via Cross-network Meta-learning , 2021, WWW.

[16]  Zhangyang Wang,et al.  Graph Contrastive Learning with Augmentations , 2020, NeurIPS.

[17]  Tianwen Jiang,et al.  Error-Bounded Graph Anomaly Loss for GNNs , 2020, CIKM.

[18]  Philip S. Yu,et al.  Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters , 2020, CIKM.

[19]  James Caverlee,et al.  Next-item Recommendation with Sequential Hypergraphs , 2020, SIGIR.

[20]  Kaveh Hassani,et al.  Contrastive Multi-View Representation Learning on Graphs , 2020, ICML.

[21]  Zi Huang,et al.  GCN-Based User Representation Learning for Unifying Robust Recommendation and Fraudster Detection , 2020, SIGIR.

[22]  Geoffrey E. Hinton,et al.  A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[23]  Hanghang Tong,et al.  Towards Real Time Team Optimization , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[24]  Ross B. Girshick,et al.  Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Hanghang Tong,et al.  MrMine: Multi-resolution Multi-network Embedding , 2019, CIKM.

[26]  Jun Zhou,et al.  A Semi-Supervised Graph Attentive Network for Financial Fraud Detection , 2019, 2019 IEEE International Conference on Data Mining (ICDM).

[27]  Hanghang Tong,et al.  ADMIRING: Adversarial Multi-network Mining , 2019, 2019 IEEE International Conference on Data Mining (ICDM).

[28]  Dong Li,et al.  Spam Review Detection with Graph Convolutional Networks , 2019, CIKM.

[29]  Anton van den Hengel,et al.  Deep Anomaly Detection with Deviation Networks , 2019, KDD.

[30]  Phillip Isola,et al.  Contrastive Multiview Coding , 2019, ECCV.

[31]  Bowen Zhou,et al.  Multiple instance learning with graph neural networks , 2019, ArXiv.

[32]  Alexander Binder,et al.  Deep Semi-Supervised Anomaly Detection , 2019, ICLR.

[33]  Huan Liu,et al.  Deep Anomaly Detection on Attributed Networks , 2019, SDM.

[34]  Kilian Q. Weinberger,et al.  Simplifying Graph Convolutional Networks , 2019, ICML.

[35]  Huan Liu,et al.  Interactive Anomaly Detection on Attributed Networks , 2019, WSDM.

[36]  Jure Leskovec,et al.  How Powerful are Graph Neural Networks? , 2018, ICLR.

[37]  Oriol Vinyals,et al.  Representation Learning with Contrastive Predictive Coding , 2018, ArXiv.

[38]  Jure Leskovec,et al.  Graph Convolutional Policy Network for Goal-Directed Molecular Graph Generation , 2018, NeurIPS.

[39]  Jakub M. Tomczak,et al.  Attention-based Deep Multiple Instance Learning , 2018, ICML.

[40]  Mirella Lapata,et al.  Multiple Instance Learning Networks for Fine-Grained Sentiment Analysis , 2017, TACL.

[41]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[42]  Huan Liu,et al.  Radar: Residual Analysis for Anomaly Detection in Attributed Networks , 2017, IJCAI.

[43]  Hanghang Tong,et al.  FIRST: Fast Interactive Attributed Subgraph Matching , 2017, KDD.

[44]  Randy C. Paffenroth,et al.  Anomaly Detection with Robust Deep Autoencoders , 2017, KDD.

[45]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[46]  Yale Song,et al.  Learning from Noisy Labels with Distillation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[47]  Ji Feng,et al.  Deep MIML Network , 2017, AAAI.

[48]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[49]  Lei Xie,et al.  FASCINATE: Fast Cross-Layer Dependency Inference on Multi-layered Networks , 2016, KDD.

[50]  Leman Akoglu,et al.  Collective Opinion Spam Detection: Bridging Review Networks and Metadata , 2015, KDD.

[51]  Misha Denil,et al.  From Group to Individual Labels Using Deep Features , 2015, KDD.

[52]  Cordelia Schmid,et al.  Weakly Supervised Object Localization with Multi-Fold Multiple Instance Learning , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[54]  Klemens Böhm,et al.  Local context selection for outlier ranking in graphs with multiple numeric node attributes , 2014, SSDBM '14.

[55]  Mohammad Ali Abbasi,et al.  Social Media Mining: An Introduction , 2014 .

[56]  Danai Koutra,et al.  Graph based anomaly detection and description: a survey , 2014, Data Mining and Knowledge Discovery.

[57]  Zhuowen Tu,et al.  Weakly supervised histopathology cancer image segmentation and classification , 2014, Medical Image Anal..

[58]  Klemens Böhm,et al.  Ranking outlier nodes in subspaces of attributed graphs , 2013, 2013 IEEE 29th International Conference on Data Engineering Workshops (ICDEW).

[59]  Jure Leskovec,et al.  From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews , 2013, WWW.

[60]  Rong Jin,et al.  Understanding bag-of-words model: a statistical framework , 2010, Int. J. Mach. Learn. Cybern..

[61]  Lise Getoor,et al.  Collective Classification in Network Data , 2008, AI Mag..

[62]  Zhi-Hua Zhou,et al.  Multi-instance multi-label learning , 2008, Artif. Intell..

[63]  Zhi-Hua Zhou,et al.  Multi-instance learning by treating instances as non-I.I.D. samples , 2008, ICML '09.

[64]  Sanjay Ranka,et al.  Conditional Anomaly Detection , 2007, IEEE Transactions on Knowledge and Data Engineering.

[65]  Christos Faloutsos,et al.  Fast Random Walk with Restart and Its Applications , 2006, Sixth International Conference on Data Mining (ICDM'06).

[66]  Damon P. Coppola Introduction to International Disaster Management , 2006 .

[67]  Xin Xu,et al.  Logistic Regression and Boosting for Labeled Bags of Instances , 2004, PAKDD.

[68]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[69]  O. Maron,et al.  Multiple-Instance Learning for Natural Scene Classification , 1998, ICML.

[70]  Jingrui He,et al.  HiDDen: Hierarchical Dense Subgraph Detection with Application to Financial Fraud Detection , 2017, SDM.

[71]  Hans-Peter Kriegel,et al.  Interpreting and Unifying Outlier Scores , 2011, SDM.