Relation Extraction via Position-Enhanced Convolutional Neural Network

Recently, deep neural network based methods have been widely used in relation extraction, which is an important task for knowledge base population, question answering and other natural language applications, to learn proper features from entities pairs and other sentence parts to extract relations from text. As a kind of important information, the value of position is always been underestimated, which causes a low weight of position information in various models and finally hurts the performance of relation extraction task. To alleviate this issue, we propose a position-enhanced embedding model based on convolutional neural network. In this model, we split the sentence representation into three parts based on the entity pairs in the sentence, and use three independent convolutional networks to learn features. Furthermore, we concatenate the output from different branches and employ a softmax layer to compute the probability for each relation. Experimental results on wildly used datasets achieve considerable improvements on relation extraction as compared with baselines, which shows that our proposed model can make full use of position information.

[1]  Jun Zhao,et al.  Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks , 2015, EMNLP.

[2]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[3]  Jun Zhao,et al.  Relation Classification via Convolutional Deep Neural Network , 2014, COLING.

[4]  Ralph Grishman,et al.  Employing Word Representations and Regularization for Domain Adaptation of Relation Extraction , 2014, ACL.

[5]  Heng Ji,et al.  A Dependency-Based Neural Network for Relation Classification , 2015, ACL.

[6]  D. Sánchez,et al.  Automatic information extraction from the Web , 2004 .

[7]  Ngoc Thang Vu,et al.  Combining Recurrent and Convolutional Neural Networks for Relation Classification , 2016, NAACL.

[8]  Nanda Kambhatla,et al.  Combining Lexical, Syntactic, and Semantic Features with Maximum Entropy Models for Information Extraction , 2004, ACL.

[9]  Bowen Zhou,et al.  Classifying Relations by Ranking with Convolutional Neural Networks , 2015, ACL.

[10]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[11]  Zhi Jin,et al.  Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Paths , 2015, EMNLP.

[12]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[13]  Dong Wang,et al.  Relation Classification via Recurrent Neural Network , 2015, ArXiv.

[14]  Ralph Grishman,et al.  Combining Neural Networks and Log-linear Models to Improve Relation Extraction , 2015, ArXiv.

[15]  Ralph Grishman,et al.  NYU's English ACE 2005 System Description , 2005 .

[16]  Andrew Y. Ng,et al.  Semantic Compositionality through Recursive Matrix-Vector Spaces , 2012, EMNLP.

[17]  Dmitry Zelenko,et al.  Kernel Methods for Relation Extraction , 2002, J. Mach. Learn. Res..

[18]  Dejing Dou,et al.  Chain Based RNN for Relation Classification , 2015, NAACL.

[19]  Razvan C. Bunescu,et al.  Subsequence Kernels for Relation Extraction , 2005, NIPS.

[20]  Jian Su,et al.  Exploring Various Knowledge in Relation Extraction , 2005, ACL.

[21]  Ralph Grishman,et al.  Relation Extraction: Perspective from Convolutional Neural Networks , 2015, VS@HLT-NAACL.

[22]  Takashi Chikayama,et al.  Simple Customization of Recursive Neural Networks for Semantic Relation Classification , 2013, EMNLP.