Dilated Convolutional Networks Incorporating Soft Entity Type Constraints for Distant Supervised Relation Extraction

Although distant supervised relation extraction has been widely used, its performance is weakened by the wrong labeling problem. During eliminating noise, many previous work failed to grasp the trade-off between long dependencies and computational complexity while using neural networks, and ignored the potential noise in external knowledge. In this paper, we propose a neural model incorporating dilated convolutional neural networks with soft entity type constraints to jointly tackle the above two issues. Rather than using traditional convolutional neural networks or recurrent neural networks, we propose dilated convolutional networks as the sentence encoder so as to capture long dependencies in large scale context and improve the robustness against local noise while keeping the efficiency of computation. Furthermore, we utilize entity types, which is considerd as external knowledge with noise, to denoise in relation classification by taking the constraints between entity types and relations into account and learning more precise entity types and attention weights simultaneously. Experimental results on the New York Times dataset show that, our model incorporating dilated convolutional networks with soft entity type constraints promotes distant supervised relation extraction and achieves state-of-art results compared with baselines.

[1]  Ramesh Nallapati,et al.  Multi-instance Multi-label Learning for Relation Extraction , 2012, EMNLP.

[2]  Jun Zhao,et al.  Relation Classification via Convolutional Deep Neural Network , 2014, COLING.

[3]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[4]  Ralph Grishman,et al.  Discovering Relations among Named Entities from Large Corpora , 2004, ACL.

[5]  Wei Shi,et al.  Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification , 2016, ACL.

[6]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[7]  Andrew McCallum,et al.  Fast and Accurate Entity Recognition with Iterated Dilated Convolutions , 2017, EMNLP.

[8]  Jason Weston,et al.  Connecting Language and Knowledge Bases with Embedding Models for Relation Extraction , 2013, EMNLP.

[9]  Dong-Hong Ji,et al.  Unsupervised Feature Selection for Relation Extraction , 2005, IJCNLP.

[10]  Zhiyuan Liu,et al.  Neural Relation Extraction with Selective Attention over Instances , 2016, ACL.

[11]  William Yang Wang,et al.  Deep Residual Learning for Weakly-Supervised Relation Extraction , 2017, EMNLP.

[12]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[13]  Razvan C. Bunescu,et al.  Subsequence Kernels for Relation Extraction , 2005, NIPS.

[14]  Jian Su,et al.  Exploring Various Knowledge in Relation Extraction , 2005, ACL.

[15]  Markus Krötzsch,et al.  Wikidata , 2014, Commun. ACM.

[16]  Jun Zhao,et al.  Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks , 2015, EMNLP.

[17]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[18]  Luke S. Zettlemoyer,et al.  Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations , 2011, ACL.

[19]  Andrew McCallum,et al.  Modeling Relations and Their Mentions without Labeled Text , 2010, ECML/PKDD.

[20]  Ronen Feldman,et al.  Clustering for unsupervised relation identification , 2007, CIKM '07.

[21]  Jun Zhao,et al.  Distant Supervision for Relation Extraction with Sentence-Level Attention and Entity Descriptions , 2017, AAAI.