A Top-down Neural Architecture towards Text-level Parsing of Discourse Rhetorical Structure

Due to its great importance in deep natural language understanding and various down-stream applications, text-level parsing of discourse rhetorical structure (DRS) has been drawing more and more attention in recent years. However, all the previous studies on text-level discourse parsing adopt bottom-up approaches, which much limit the DRS determination on local information and fail to well benefit from global information of the overall discourse. In this paper, we justify from both computational and perceptive points-of-view that the top-down architecture is more suitable for text-level DRS parsing. On the basis, we propose a top-down neural architecture toward text-level DRS parsing. In particular, we cast discourse parsing as a recursive split point ranking task, where a split point is classified to different levels according to its rank and the elementary discourse units (EDUs) associated with it are arranged accordingly. In this way, we can determine the complete DRS as a hierarchical tree structure via an encoder-decoder with an internal stack. Experimentation on both the English RST-DT corpus and the Chinese CDTB corpus shows the great effectiveness of our proposed top-down approach towards text-level DRS parsing.

[1]  Zhu Kunhua,et al.  Research of Chinese Clause Identificiton Based on Comma , 2013 .

[2]  Graeme Hirst,et al.  A Linear-Time Bottom-Up Discourse Parser with Constraints and Post-Editing , 2014, ACL.

[3]  Jacob Eisenstein,et al.  A Joint Model of Rhetorical Discourse Structure and Summarization , 2016, SPNLP@EMNLP.

[4]  Anders Søgaard,et al.  Cross-lingual RST Discourse Parsing , 2017, EACL.

[5]  Lidong Bing,et al.  Hierarchical Pointer Net Parsing , 2019, EMNLP/IJCNLP.

[6]  Timothy Dozat,et al.  Deep Biaffine Attention for Neural Dependency Parsing , 2016, ICLR.

[7]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[8]  Fang Kong,et al.  Towards Better Chinese Zero Pronoun Resolution from Discourse Perspective , 2017, Natural Language Processing and Chinese Computing.

[9]  Nan Yu,et al.  Transition-based Neural RST Parsing with Implicit Syntax Features , 2018, COLING.

[10]  Liang Wang,et al.  Text-level Discourse Dependency Parsing , 2014, ACL.

[11]  Nicholas Asher,et al.  A Dependency Perspective on RST Discourse Parsing and Evaluation , 2018, CL.

[12]  Sheng Cheng,et al.  Towards Better Chinese Zero Pronoun Resolution from Discourse Perspective , 2017 .

[13]  Fang Kong,et al.  Building Chinese Discourse Corpus with Connective-driven Dependency Tree Structure , 2014, EMNLP.

[14]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[15]  Houfeng Wang,et al.  A Two-Stage Parsing Method for Text-Level Discourse Analysis , 2017, ACL.

[16]  Qi Li,et al.  Discourse Parsing with Attention-based Hierarchical Neural Networks , 2016, EMNLP.

[17]  Maki Watanabe,et al.  Discourse Tagging Reference Manual , 2001 .

[18]  Mitsuru Ishizuka,et al.  HILDA: A Discourse Parser Using Support Vector Machine Classification , 2010, Dialogue Discourse.

[19]  Noah A. Smith,et al.  Neural Discourse Structure for Text Categorization , 2017, ACL.

[20]  Livio Robaldo,et al.  The Penn Discourse TreeBank 2.0. , 2008, LREC.

[21]  Shafiq R. Joty,et al.  Combining Intra- and Multi-sentential Rhetorical Parsing for Document-level Discourse Analysis , 2013, ACL.

[22]  Shen Li,et al.  Revisiting Correlations between Intrinsic and Extrinsic Evaluations of Word Embeddings , 2018, CCL.

[23]  Kenji Sagae,et al.  Fast Rhetorical Structure Theory Discourse Parsing , 2015, ArXiv.

[24]  Shafiq R. Joty,et al.  A Unified Linear-Time Framework for Sentence-Level Discourse Parsing , 2019, ACL.

[25]  Jing Li,et al.  SegBot: A Generic Neural Text Segmentation Model with Pointer Network , 2018, IJCAI.

[26]  Barbara Plank,et al.  Multi-view and multi-task training of RST discourse parsers , 2016, COLING.

[27]  Rashmi Prasad,et al.  The Penn Discourse Treebank , 2004, LREC.

[28]  Jacob Eisenstein,et al.  Representation Learning for Text-level Discourse Parsing , 2014, ACL.

[29]  Eduard H. Hovy,et al.  Recursive Deep Models for Discourse Parsing , 2014, EMNLP.

[30]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[31]  Eunsol Choi,et al.  Document-level Sentiment Inference with Social, Faction, and Discourse Context , 2016, ACL.