论文信息 - Relation Classification via Recurrent Neural Network - 字舞流文

Relation Classification via Recurrent Neural Network

Deep learning has gained much success in sentence-level relation classification. For example, convolutional neural networks (CNN) have delivered competitive performance without much effort on feature engineering as the conventional pattern-based methods. Thus a lot of works have been produced based on CNN structures. However, a key issue that has not been well addressed by the CNN-based method is the lack of capability to learn temporal features, especially long-distance dependency between nominal pairs. In this paper, we propose a simple framework based on recurrent neural networks (RNN) and compare it with CNN-based model. To show the limitation of popular used SemEval-2010 Task 8 dataset, we introduce another dataset refined from MIMLRE(Angeli et al., 2014). Experiments on two different datasets strongly indicates that the RNN-based model can deliver better performance on relation classification, and it is particularly capable of learning long-distance relation patterns. This makes it suitable for real-world applications where complicated expressions are often involved.

Dong Wang | Dongxu Zhang | Dong Wang | Dongxu Zhang

[1] Gerhard Weikum,et al. Combining linguistic and statistical analysis to extract relations from web documents , 2006, KDD '06.

[2] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[3] Ralph Grishman,et al. Relation Extraction: Perspective from Convolutional Neural Networks , 2015, VS@HLT-NAACL.

[4] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[5] Yoshua Bengio,et al. Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.

[6] Paul J. Werbos,et al. Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.

[7] Mikael Bodén,et al. A guide to recurrent neural networks and backpropagation , 2001 .

[8] Daniel Jurafsky,et al. Distant supervision for relation extraction without labeled data , 2009, ACL.

[9] Mo Yu. Factor-based Compositional Embedding Models , 2014 .

[10] Jun Zhao,et al. Relation Classification via Convolutional Deep Neural Network , 2014, COLING.

[11] Geoffrey E. Hinton,et al. Learning sets of filters using back-propagation , 1987 .

[12] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[13] Nanda Kambhatla,et al. Combining Lexical, Syntactic, and Semantic Features with Maximum Entropy Models for Information Extraction , 2004, ACL.

[14] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[15] Andrew Y. Ng,et al. Semantic Compositionality through Recursive Matrix-Vector Spaces , 2012, EMNLP.

[16] Razvan C. Bunescu,et al. Subsequence Kernels for Relation Extraction , 2005, NIPS.

[17] Lukás Burget,et al. Recurrent neural network based language model , 2010, INTERSPEECH.

[18] Razvan C. Bunescu,et al. A Shortest Path Dependency Kernel for Relation Extraction , 2005, HLT.

[19] Preslav Nakov,et al. SemEval-2010 Task 8: Multi-Way Classification of Semantic Relations Between Pairs of Nominals , 2009, SEW@NAACL-HLT.

[20] Dongyan Zhao,et al. Semantic Relation Classification via Convolutional Neural Networks with Simple Negative Sampling , 2015, EMNLP.

[21] Andrew McCallum,et al. Relation Extraction with Matrix Factorization and Universal Schemas , 2013, NAACL.

[22] Klaus-Robert Müller,et al. Efficient BackProp , 2012, Neural Networks: Tricks of the Trade.

[23] Ramesh Nallapati,et al. Multi-instance Multi-label Learning for Relation Extraction , 2012, EMNLP.

[24] Luke S. Zettlemoyer,et al. Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations , 2011, ACL.

[25] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[26] Zhi Jin,et al. Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Paths , 2015, EMNLP.

[27] Bowen Zhou,et al. Classifying Relations by Ranking with Convolutional Neural Networks , 2015, ACL.

[28] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[29] Andrew McCallum,et al. Modeling Relations and Their Mentions without Labeled Text , 2010, ECML/PKDD.

[30] Rabab Kreidieh Ward,et al. Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[31] Christopher D. Manning,et al. Combining Distant and Partial Supervision for Relation Extraction , 2014, EMNLP.