Discourse Relation Prediction: Revisiting Word Pairs with Convolutional Networks

Word pairs across argument spans have been shown to be effective for predicting the discourse relation between them. We propose an approach to distill knowledge from word pairs for discourse relation classification with convolutional neural networks by incorporating joint learning of implicit and explicit relations. Our novel approach of representing the input as word pairs achieves state-of-the-art results on four-way classification of both implicit and explicit relations as well as one of the binary classification tasks. For explicit relation prediction, we achieve around 20% error reduction on the four-way task. At the same time, compared to a two-layered Bi-LSTM-CRF model, our model is able to achieve these results with half the number of learnable parameters and approximately half the amount of training time.

[1]  Yang Liu,et al.  Implicit Discourse Relation Classification via Multi-Task Neural Networks , 2016, AAAI.

[2]  Owen Rambow,et al.  Building and Refining Rhetorical-Semantic Relation Models , 2007, HLT-NAACL.

[3]  Hwee Tou Ng,et al.  CoNLL 2016 Shared Task on Multilingual Shallow Discourse Parsing , 2016, CoNLL.

[4]  Ani Nenkova,et al.  Automatic sense prediction for implicit discourse relations in text , 2009, ACL.

[5]  Jacob Eisenstein,et al.  Closing the Gap: Domain Adaptation from Explicit to Implicit Discourse Relations , 2015, EMNLP.

[6]  Kathleen McKeown,et al.  Aggregated Word Pair Features for Implicit Discourse Relation Disambiguation , 2013, ACL.

[7]  Livio Robaldo,et al.  The Penn Discourse TreeBank 2.0. , 2008, LREC.

[8]  Daniel Marcu,et al.  An Unsupervised Approach to Recognizing Discourse Relations , 2002, ACL.

[9]  Xuanjing Huang,et al.  Implicit Discourse Relation Detection via a Deep Architecture with Gated Relevance Network , 2016, ACL.

[10]  William S. Horton,et al.  Why or what next? Eye movements reveal expectations about discourse direction , 2010 .

[11]  Ruihong Huang,et al.  Improving Implicit Discourse Relation Classification by Modeling Inter-dependencies of Discourse Units in a Paragraph , 2018, NAACL.

[12]  Min-Yen Kan,et al.  SWIM: A Simple Word Interaction Model for Implicit Discourse Relation Recognition , 2017, IJCAI.

[13]  Ani Nenkova,et al.  Easily Identifiable Discourse Relations , 2008, COLING.

[14]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[15]  Zheng-Yu Niu,et al.  Multi-task Attention-based Neural Networks for Implicit Discourse Relationship Representation and Identification , 2017, EMNLP.

[16]  Hai Zhao,et al.  Deep Enhanced Representation for Implicit Discourse Relation Recognition , 2018, COLING.

[17]  Yang Liu,et al.  Recognizing Implicit Discourse Relations via Repeated Reading: Neural Networks with Multi-Level Attention , 2016, EMNLP.

[18]  Christian Chiarcos,et al.  A Recurrent Neural Model with Attention for the Recognition of Chinese Implicit Discourse Relations , 2017, ACL.

[19]  Hai Zhao,et al.  A Stacking Gated Neural Architecture for Implicit Discourse Relation Classification , 2016, EMNLP.

[20]  Hai Zhao,et al.  Adversarial Connective-exploiting Networks for Implicit Discourse Relation Classification , 2017, ACL.

[21]  Fatemeh Torabi Asr,et al.  Uniform Information Density at the Level of Discourse Relations: Negation Markers and Discourse Connective Omission , 2015 .

[22]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[23]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..