Relation Classification in Scientific Papers Based on Convolutional Neural Network

Scientific papers are important for scholars to track trends in specific research areas. With the increase in the number of scientific papers, it is difficult for scholars to read all the papers to extract emerging or noteworthy knowledge. Paper modeling can help scholars master the key information in scientific papers, and relation classification (RC) between entity pairs is a major approach to paper modeling. To the best of our knowledge, most of the state-of-the-art RC methods are using entire sentence’s context information as input. However, long sentences have too much noise information, which is useless for classification. In this paper, a flexible context is selected as the input information for convolution neural network (CNN), which greatly reduces the noise. Moreover, we find that entity type is another important feature for RC. Based on these findings, we construct a typical CNN architecture to learn features from raw texts automatically, and use a softmax function to classify the entity pairs. Our experiment on SemEval-2018 task 7 dataset yields a macro-F1 value of 83.91%, ranking first among all participants.

[1]  Hai Zhao,et al.  A Stacking Gated Neural Architecture for Implicit Discourse Relation Classification , 2016, EMNLP.

[2]  Jun Zhao,et al.  Relation Classification via Convolutional Deep Neural Network , 2014, COLING.

[3]  Roy Schwartz,et al.  Minimally Supervised Classification to Semantic Categories using Automatically Acquired Symmetric Patterns , 2014, COLING.

[4]  Jun Guo,et al.  An empirical convolutional neural network approach for semantic relation classification , 2016, Neurocomputing.

[5]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[6]  Zornitsa Kozareva Cause-Effect Relation Learning , 2012, TextGraphs@ACL.

[7]  Behrang Q. Zadeh,et al.  The ACL RD-TEC 2.0: A Language Resource for Evaluating Term Extraction and Entity Recognition Methods , 2016, LREC.

[8]  Dragomir R. Radev,et al.  The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics , 2008, LREC.

[9]  Wei Luo,et al.  IRCMS at SemEval-2018 Task 7 : Evaluating a basic CNN Method and Traditional Pipeline Method for Relation Classification , 2018, SemEval@NAACL-HLT.

[10]  Jun Xu,et al.  A Unified Architecture for Semantic Role Labeling and Relation Classification , 2016, COLING.

[11]  Ramesh Nallapati,et al.  Multi-instance Multi-label Learning for Relation Extraction , 2012, EMNLP.

[12]  Wei Luo,et al.  A Semantic Representation Enhancement Method for Chinese News Headline Classification , 2017, NLPCC.

[13]  Dongyan Zhao,et al.  Semantic Relation Classification via Convolutional Neural Networks with Simple Negative Sampling , 2015, EMNLP.

[14]  Christopher D. Manning,et al.  Combining Distant and Partial Supervision for Relation Extraction , 2014, EMNLP.

[15]  Dong Wang,et al.  Relation Classification via Recurrent Neural Network , 2015, ArXiv.