Turkish Discourse Bank: Porting a discourse annotation style to a morphologically rich language

This paper briefly describes the Turkish Discourse Bank, the first publicly available annotated discourse resource for Turkish. It focuses on the challenges posed by annotating Turkish, a free word order language with rich inflectional and derivational morphology. It shows the usefulness of the PDTB style annotation but points out the need to expand this annotation style with the needs of the target language.

[1]  Rashmi Prasad,et al.  Realization of Discourse Relations by Other Means: Alternative Lexicalizations , 2010, COLING.

[2]  Livia Polanyi,et al.  Discourse Structure and Discourse Interpretation , 1997 .

[3]  Bonnie L. Webber,et al.  Discourse structure and language technology , 2011, Natural Language Engineering.

[4]  Ann Bies,et al.  The Penn Treebank: Annotating Predicate Argument Structure , 1994, HLT.

[5]  Johanna D. Moore,et al.  Toward a Synthesis of Two Accounts of Discourse Structure , 1996, CL.

[6]  Wojciech Skut,et al.  An Annotation Scheme for Free Word Order Languages , 1997, ANLP.

[7]  Livio Robaldo,et al.  The Penn Discourse Treebank 2.0 Annotation Manual , 2007 .

[8]  Deniz Zeyrek,et al.  Pair Annotation: Adaption of Pair Programming to Corpus Annotation , 2012, LAW@ACL.

[9]  Deniz Zeyrek,et al.  Discourse Relation Configurations in Turkish and an Annotation Environment , 2010, Linguistic Annotation Workshop.

[10]  Matthew Stone,et al.  Anaphora and Discourse Structure , 2001, CL.

[11]  B. Webber,et al.  A Short Introduction to the Penn Discourse TreeBank , 2005 .

[12]  Manfred Stede,et al.  Machine-Assisted Rhetorical Structure Annotation , 2004, COLING.

[13]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[14]  Rashmi Prasad,et al.  The Penn Discourse Treebank , 2004, LREC.

[15]  Bonnie L. Webber,et al.  Computing Discourse Semantics: The Predicate-Argument Semantics of Discourse Connectives in D-LTAG , 2005, J. Semant..

[16]  Deniz Zeyrek,et al.  The Annotation Scheme of the Turkish Discourse Bank and an Evaluation of Inconsistent Annotations , 2010, Linguistic Annotation Workshop.

[17]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[18]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[19]  Nicholas Asher,et al.  Reference to abstract objects in discourse , 1993, Studies in linguistics and philosophy.

[20]  Abanhsan Yalçinkaya AN INTER-ANNOTATOR AGREEMENT MEASUREMENT METHODOLOGY FOR THE TURKISH DISCOURSE BANK (TDB) , 2010 .

[21]  C. Lehmann Towards a typology of clause linkage , 1988 .

[22]  Dilek Z. Hakkani-Tür,et al.  Building a Turkish Treebank , 2003 .

[23]  Ron Artstein,et al.  Survey Article: Inter-Coder Agreement for Computational Linguistics , 2008, CL.

[24]  Deniz Zeyrek,et al.  Turkish Discourse Bank : Ongoing Developments , 2012 .

[25]  Alex Lascarides,et al.  Logics of Conversation , 2005, Studies in natural language processing.