GEDet: Detecting Erroneous Nodes with A Few Examples

Detecting nodes with erroneous values in real-world graphs remains challenging due to the lack of examples and various error scenarios. We demonstrate GEDet, an error detection engine that can detect erroneous nodes in graphs with a few examples. The GEDet framework tackles error detection as a few-shot node classification problem. We invite the attendees to experience the following unique features. (1) Few-shot detection. Users only need to provide a few examples of erroneous nodes to perform error detection with GEDet. GEDet achieves desirable accuracy with (a) a graph augmentation module, which automatically generates synthetic examples to learn the classifier, and (b) an adversarial detection module, which improves classifiers to better distinguish erroneous nodes from both cleaned nodes and synthetic examples. We show that GEDet significantly improves the state-of-the-art error detection methods. (2) Diverse error scenarios. GEDet profiles data errors with a built-in library of transformation functions from correct values to errors. Users can also easily “plug in” new error types or examples. (3) User-centric detection. GEDet supports (a) an active learning mode to engage users to verify detected results, and adapts the error detection process accordingly; and (b) visual interfaces to interpret and track detected errors. PVLDB Reference Format: Sheng Guan, Hanchao Ma, Sutanay Choudhury, and Yinghui Wu. GEDet: Detecting Erroneous Nodes with A Few Examples. PVLDB, 14(12): 2875-2878, 2021. doi:10.14778/3476311.3476367

[1]  Yinghui Wu,et al.  Functional Dependencies for Graphs , 2016, SIGMOD Conference.

[2]  Philip S. Yu,et al.  A Comprehensive Survey on Graph Neural Networks , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[3]  Xiao Huang,et al.  Accelerated Local Anomaly Detection via Resolving Attributed Networks , 2017, IJCAI.

[4]  Michael Stonebraker,et al.  Detecting Data Errors: Where are we and what needs to be done? , 2016, Proc. VLDB Endow..

[5]  Yinghui Wu,et al.  GEDet: Adversarially Learned Few-shot Detection of Erroneous Nodes in Graphs , 2020, 2020 IEEE International Conference on Big Data (Big Data).

[6]  James T. Kwok,et al.  Generalizing from a Few Examples , 2019, ACM Comput. Surv..

[7]  Heiko Paulheim,et al.  Knowledge graph refinement: A survey of approaches and evaluation methods , 2016, Semantic Web.