Annotating Discourse Connectives and Their Arguments

This paper describes a new, large scale discourse-level annotation project – the Penn Discourse TreeBank (PDTB). We present an approach to annotating a level of discourse structure that is based on identifying discourse connectives and their arguments. The PDTB is being built directly on top of the Penn TreeBank and Propbank, thus supporting the extraction of useful syntactic and semantic features and providing a richer substrate for the development and evaluation of practical algorithms. We provide a detailed preliminary analysis of inter-annotator agreement – both the level of agreement and the types of inter-annotator variation.

[1]  Martha Palmer,et al.  From TreeBank to PropBank , 2002, LREC.

[2]  E. Prince,et al.  Discourse semantics of s-modifying adverbials , 2003 .

[3]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[4]  William C. Mann,et al.  Rhetorical Structure Theory: A Framework for the Analysis of Texts , 1987 .

[5]  Alex Lascarides,et al.  The Semantics and Pragmatics of Presupposition , 1998, J. Semant..

[6]  CarlettaJean Assessing agreement on classification tasks , 1996 .

[7]  Daniel Marcu,et al.  Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001, SIGDIAL Workshop.

[8]  Matthew Stone,et al.  Discourse Relations: A Structural and Presuppositional Account Using Lexicalised TAG , 1999, ACL.

[9]  Alistair Knott,et al.  A data-driven methodology for motivating a set of coherence relations , 1996 .

[10]  Bonnie Webber,et al.  What are Little Texts Made Of? A Structural and Presuppositional Account Using Lexicalised TAG , 1999 .

[11]  Livia Polanyi,et al.  Discourse Structure and Discourse Interpretation , 1997 .

[12]  John A. Bateman,et al.  Rhetorical structure theory , 2006 .

[13]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[14]  Bonnie Webber,et al.  Multiple Discourse Connectives in a Lexicalized Grammar For Discourse , 2001 .

[15]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[16]  Matthew Stone,et al.  Anaphora and Discourse Structure , 2001, CL.

[17]  Bonnie L. Webber,et al.  Anchoring a Lexicalized Tree-Adjoining Grammar for Discourse , 1998, ArXiv.

[18]  Bonnie Webber,et al.  Anaphoric arguments of discourse connectives: Semantic properties of antecedents versus non-antecedents , 2003 .