论文信息 - Adversarial Discriminative Denoising for Distant Supervision Relation Extraction

Adversarial Discriminative Denoising for Distant Supervision Relation Extraction

Distant supervision has been widely used to generate labeled data automatically for relation extraction by aligning knowledge base with text. However, it introduces much noise, which can severely impact the performance of relation extraction. Recent studies have attempted to remove the noise explicitly from the generated data but they suffer from (1) the lack of an effective way of introducing explicit supervision to the denoising process and (2) the difficulty of optimization caused by the sampling action in denoising result evaluation. To solve these issues, we propose an adversarial discriminative denoising framework, which provides an effective way of introducing human supervision and exploiting it along with the potentially useful information underlying the noisy data in a unified framework. Besides, we employ a continuous approximation of sampling action to guarantee the holistic denoising framework to be differentiable. Experimental results show that very little human supervision is sufficient for our approach to outperform the state-of-the-art methods significantly.

[1] Daniel Jurafsky,et al. Distant supervision for relation extraction without labeled data , 2009, ACL.

[2] William Yang Wang,et al. DSGAN: Generative Adversarial Training for Distant Supervision Relation Extraction , 2018, ACL.

[3] Zhiyuan Liu,et al. Neural Relation Extraction with Selective Attention over Instances , 2016, ACL.

[4] William Yang Wang,et al. Robust Distant Supervision Relation Extraction via Deep Reinforcement Learning , 2018, ACL.

[5] Andrew McCallum,et al. Modeling Relations and Their Mentions without Labeled Text , 2010, ECML/PKDD.

[6] Li Zhao,et al. Reinforcement Learning for Relation Classification From Noisy Data , 2018, AAAI.

[7] Jun Zhao,et al. Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks , 2015, EMNLP.