Syntax-Guided Sequence to Sequence Modeling for Discourse Segmentation

Previous studies on RST-style discourse segmentation have achieved impressive results. However, recent neural works either require a complex joint training process or heavily rely on powerful pre-trained word vectors. Under this condition, a simpler but more robust segmentation method is needed. In this work, we take a deeper look into intra-sentence dependencies to investigate if the syntax information is totally useless, or to what extent it can help improve the discourse segmentation performance. To achieve this, we propose a sequence-to-sequence model along with a GCN based encoder to well utilize intra-sentence dependencies and a multi-head biaffine attention based decoder to predict EDU boundaries. Experimental results on two benchmark corpora show that the syntax information we use is significantly useful and the resulting model is competitive when compared with the state-of-the-art.

[1]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[2]  Nicholas Asher,et al.  How much progress have we made on RST discourse parsing? A replication study of recent results on the RST-DT , 2017, EMNLP.

[3]  Diego Marcheggiani,et al.  Encoding Sentences with Graph Convolutional Networks for Semantic Role Labeling , 2017, EMNLP.

[4]  Fang Kong,et al.  Building Chinese Discourse Corpus with Connective-driven Dependency Tree Structure , 2014, EMNLP.

[5]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[6]  Zhen-Hua Ling,et al.  Enhancing Sentence Embedding with Generalized Pooling , 2018, COLING.

[7]  Guillaume Lample,et al.  Neural Architectures for Named Entity Recognition , 2016, NAACL.

[8]  Timothy Dozat,et al.  Deep Biaffine Attention for Neural Dependency Parsing , 2016, ICLR.

[9]  Yizhong Wang,et al.  Toward Fast and Accurate Neural Discourse Segmentation , 2018, EMNLP.

[10]  Daniel Marcu,et al.  Sentence Level Discourse Parsing using Syntactic and Lexical Information , 2003, NAACL.

[11]  Maki Watanabe,et al.  Discourse Tagging Reference Manual , 2001 .

[12]  Shafiq R. Joty,et al.  A Unified Linear-Time Framework for Sentence-Level Discourse Parsing , 2019, ACL.

[13]  Peifeng Li,et al.  A Top-down Neural Architecture towards Text-level Parsing of Discourse Rhetorical Structure , 2020, ACL.

[14]  Dongyan Zhao,et al.  Improved Discourse Parsing with Two-Step Neural Transition-Based Model , 2018, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[15]  Jing Li,et al.  SegBot: A Generic Neural Text Segmentation Model with Pointer Network , 2018, IJCAI.

[16]  Anders Søgaard,et al.  Does syntax help discourse segmentation? Not so much , 2017, EMNLP.

[17]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[18]  Shafiq R. Joty,et al.  CODRA: A Novel Discriminative Framework for Rhetorical Analysis , 2015, CL.

[19]  Brian Roark,et al.  The utility of parse-derived features for automatic discourse segmentation , 2007, ACL.

[20]  Daniel Marcu,et al.  Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001, SIGDIAL Workshop.