Easily Identifiable Discourse Relations

We present a corpus study of local discourse relations based on the Penn Discourse Tree Bank, a large manually annotated corpus of explicitly or implicitly realized relations. We show that while there is a large degree of ambiguity in temporal explicit discourse connectives, overall connectives are mostly unambiguous and allow high-accuracy prediction of discourse relation type. We achieve 93.09% accuracy in classifying the explicit relations and 74.74% accuracy overall. In addition, we show that some pairs of relations occur together in text more often than expected by chance. This finding suggests that global sequence classification of the relations in text can lead to better results, especially for implicit relations.

[1]  Jan Alexandersson,et al.  Proceedings of the 7th SIGdial Workshop on Discourse and Dialogue , 2009, SIGDIAL 2009.

[2]  Mirella Lapata,et al.  Modeling Local Coherence: An Entity-Based Approach , 2005, ACL.

[3]  Mirella Lapata,et al.  Inferring Sentence-internal Temporal Relations , 2004, NAACL.

[4]  Livio Robaldo,et al.  The Penn Discourse Treebank 2.0 Annotation Manual , 2007 .

[5]  Johanna D. Moore,et al.  Discourse in Computational Linguistics and Artificial Intelligence , 2003 .

[6]  Livio Robaldo,et al.  The Penn Discourse TreeBank 2.0. , 2008, LREC.

[7]  Jerry R. Hobbs Coherence and Coreference , 1979, Cogn. Sci..

[8]  Alex Lascarides,et al.  Edinburgh Research Explorer Using automatically labelled examples to classify rhetorical relations: an assessment , 2022 .

[9]  James Pustejovsky,et al.  Classification of Discourse Coherence Relations: An Exploratory Study using Multiple Knowledge Sources , 2006, SIGDIAL Workshop.

[10]  Owen Rambow,et al.  Building and Refining Rhetorical-Semantic Relation Models , 2007, HLT-NAACL.

[11]  B. Webber,et al.  Experiments on Sense Annotations and Sense Disambiguation of Discourse Connectives , 2005 .

[12]  Alex Lascarides,et al.  Exploiting Linguistic Cues to Classify Rhetorical Relations , 2005 .

[13]  T. Sanders,et al.  The classification of coherence relations and their linguistic markers: An exploration of two languages , 1998 .

[14]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[15]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[16]  Livio Robaldo,et al.  Sense Annotation in the Penn Discourse Treebank , 2008, CICLing.

[17]  Daniel Marcu,et al.  An Unsupervised Approach to Recognizing Discourse Relations , 2002, ACL.

[18]  Kathleen McKeown,et al.  Text generation: using discourse strategies and focus constraints to generate natural language text , 1985 .