论文信息 - Best from Top k Versus Top 1: Improving Distant Supervision Relation Extraction with Deep Reinforcement Learning - 字舞流文

Best from Top k Versus Top 1: Improving Distant Supervision Relation Extraction with Deep Reinforcement Learning

Distant supervision relation extraction is a promising approach to find new relation instances from large text corpora. Most previous works employ the top 1 strategy, i.e., predicting the relation of a sentence with the highest confidence score, which is not always the optimal solution. To improve distant supervision relation extraction, this work applies the best from top k strategy to explore the possibility of relations with lower confidence scores. We approach the best from top k strategy using a deep reinforcement learning framework, where the model learns to select the optimal relation among the top k candidates for better predictions. Specifically, we employ a deep Q-network, trained to optimize a reward function that reflects the extraction performance under distant supervision. The experiments on three public datasets - of news articles, Wikipedia and biomedical papers - demonstrate that the proposed strategy improves the performance of traditional state-of-the-art relation extractors significantly. We achieve an improvement of 5.13% in average F\(_1\)-score over four competitive baselines.

Zhiqiang Gao | Qian Liu | Yaocheng Gui | Tingming Lu

[1] Li Zhao,et al. Reinforcement Learning for Relation Classification From Noisy Data , 2018, AAAI.

[2] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[3] Luke S. Zettlemoyer,et al. Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations , 2011, ACL.

[4] William Yang Wang,et al. Robust Distant Supervision Relation Extraction via Deep Reinforcement Learning , 2018, ACL.

[5] Andrew McCallum,et al. Modeling Relations and Their Mentions without Labeled Text , 2010, ECML/PKDD.

[6] Oren Etzioni,et al. Modeling Missing Data in Distant Supervision for Information Extraction , 2013, TACL.

[7] Anna Lisa Gentile,et al. Mining Relations from Unstructured Content , 2018, PAKDD.

[8] Regina Barzilay,et al. Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning , 2016, EMNLP.

[9] Ramesh Nallapati,et al. Multi-instance Multi-label Learning for Relation Extraction , 2012, EMNLP.

[10] Jari Björne,et al. BioInfer: a corpus for information extraction in the biomedical domain , 2007, BMC Bioinformatics.

[11] Daniel S. Weld,et al. Type-Aware Distantly Supervised Relation Extraction with Linked Arguments , 2014, EMNLP.

[12] Xin Luna Dong,et al. CERES: Distantly Supervised Relation Extraction from the Semi-Structured Web , 2018, Proc. VLDB Endow..

[13] Daniel Jurafsky,et al. Distant supervision for relation extraction without labeled data , 2009, ACL.

[14] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.

[15] Jun Zhao,et al. Large Scaled Relation Extraction With Reinforcement Learning , 2018, AAAI.

[16] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[17] Jian Su,et al. Exploring Various Knowledge in Relation Extraction , 2005, ACL.

[18] Zhiyuan Liu,et al. Neural Relation Extraction with Selective Attention over Instances , 2016, ACL.

[19] Jun Zhao,et al. Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks , 2015, EMNLP.