QADiscourse - Discourse Relations as QA Pairs: Representation, Crowdsourcing and Baselines

Discourse relations describe how two propositions relate to one another, and identifying them automatically is an integral part of natural language understanding. However, annotating discourse relations typically requires expert annotators. Recently, different semantic aspects of a sentence have been represented and crowd-sourced via question-and-answer (QA) pairs. This paper proposes a novel representation of discourse relations as QA pairs, which in turn allows us to crowd-source wide-coverage data annotated with discourse relations, via an intuitively appealing interface for composing such questions and answers. Based on our proposed representation, we collect a novel and wide-coverage QADiscourse dataset, and present baseline algorithms for predicting QADiscourse relations.

[1]  Amália Mendes,et al.  Multilingual Extension of PDTB-Style Annotation: The Case of TED Multilingual Discourse Bank , 2018, LREC.

[2]  Hwee Tou Ng,et al.  A PDTB-styled end-to-end discourse parser , 2012, Natural Language Engineering.

[3]  Livio Robaldo,et al.  The Penn Discourse Treebank 2.0 Annotation Manual , 2007 .

[4]  Nathan Schneider,et al.  Filling in the Blanks in Understanding Discourse Adverbials: Consistency, Conflict, and Context-Dependence in a Crowdsourced Elicitation Task , 2016, LAW@ACL.

[5]  Livio Robaldo,et al.  The Penn Discourse TreeBank 2.0. , 2008, LREC.

[6]  H. Rohde,et al.  Asking between the lines: Elicitation of evoked questions in text , 2019 .

[7]  Vera Demberg,et al.  Crowdsourcing Discourse Relation Annotations by a Two-Step Connective Insertion Task , 2019, LAW@ACL.

[8]  Daisuke Kawahara,et al.  Rapid Development of a Corpus with Discourse Annotations using Two-stage Crowdsourcing , 2014, COLING.

[9]  Junyi Jessy Li,et al.  The Role of Discourse Units in Near-Extractive Summarization , 2016, SIGDIAL Conference.

[10]  Ruihong Huang,et al.  A Regularization Approach for Incorporating Event Knowledge and Coreference Relations into Neural Discourse Parsing , 2019, EMNLP.

[11]  Yugo Murawaki,et al.  Improving Crowdsourcing-Based Annotation of Japanese Discourse Relations , 2018, LREC.

[12]  Alan Lee,et al.  Bridging Sentential and Discourse-level Semantics through Clausal Adjuncts , 2015, LSDSem@EMNLP.

[13]  Hang Li,et al.  “ Tony ” DNN Embedding for “ Tony ” Selective Read for “ Tony ” ( a ) Attention-based Encoder-Decoder ( RNNSearch ) ( c ) State Update s 4 SourceVocabulary Softmax Prob , 2016 .

[14]  Alan Lee,et al.  A Discourse-Annotated Corpus of Conjoined VPs , 2016, LAW@ACL.

[15]  Nathan Schneider,et al.  Discourse Coherence: Concurrent Explicit and Implicit Relations , 2018, ACL.

[16]  Manfred Stede,et al.  Window-Based Neural Tagging for Shallow Discourse Argument Labeling , 2019, CoNLL.

[17]  Bonnie Webber,et al.  Shallow Discourse Annotation for Chinese TED Talks , 2020, LREC.

[18]  Vera Demberg,et al.  Crowdsourcing discourse interpretations: On the influence of context and the reliability of a connective insertion task , 2017, LAW@ACL.

[19]  Jian Zhang,et al.  SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[20]  Hwee Tou Ng,et al.  The CoNLL-2015 Shared Task on Shallow Discourse Parsing , 2015, CoNLL.

[21]  Luke S. Zettlemoyer,et al.  Question-Answer Driven Semantic Role Labeling: Using Natural Language to Annotate Natural Language , 2015, EMNLP.

[22]  P. Parham Filling in the blanks. , 1997, Tissue antigens.

[23]  Ido Dagan,et al.  Crowdsourcing Question-Answer Meaning Representations , 2017, NAACL.

[24]  Alex Lascarides,et al.  Logics of Conversation , 2005, Studies in natural language processing.

[25]  Kathleen McKeown,et al.  PDTB Discourse Parsing as a Tagging Task: The Two Taggers Approach , 2015, SIGDIAL Conference.

[26]  Rashmi Prasad,et al.  The Penn Discourse Treebank , 2004, LREC.

[27]  Vera Demberg,et al.  Next Sentence Prediction helps Implicit Discourse Relation Classification within and across Domains , 2019, EMNLP.

[28]  Dan Roth,et al.  Incidental Supervision from Question-Answering Signals , 2019, ArXiv.

[29]  Percy Liang,et al.  Data Recombination for Neural Semantic Parsing , 2016, ACL.

[30]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[31]  Dan Goldwasser,et al.  Multi-Relational Script Learning for Discourse Relations , 2019, ACL.

[32]  Hannah Rohde,et al.  TED-Q: TED Talks and the Questions they Evoke , 2020, LREC.

[33]  Luke S. Zettlemoyer,et al.  Large-Scale QA-SRL Parsing , 2018, ACL.

[34]  Bonnie L. Webber,et al.  Discourse structure and language technology , 2011, Natural Language Engineering.

[35]  Thien Huu Nguyen,et al.  Employing the Correspondence of Relations and Connectives to Identify Implicit Discourse Relations via Label Embeddings , 2019, ACL.

[36]  Luke S. Zettlemoyer,et al.  AllenNLP: A Deep Semantic Natural Language Processing Platform , 2018, ArXiv.

[37]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[38]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[39]  E. Weissenstein Filling in the blanks. , 1997, Modern healthcare.

[40]  J. V. Kuppevelt Discourse structure, topicality and questioning , 1995, Journal of Linguistics.

[41]  Craige Roberts,et al.  Information Structure: Towards an integrated formal theory of pragmatics , 2012 .