TreeAnnotator: Versatile Visual Annotation of Hierarchical Text Relations

We introduce TREEANNOTATOR, a graphical tool for annotating tree-like structures, in particular structures that jointly map dependency relations and inclusion hierarchies, as used by Rhetorical Structure Theory (RST). TREEANNOTATOR is browser-based, embedded within the UIMA framework and provides two visualization modes. TREEANNOTATOR’s interoperability exceeds similar tools, providing a wider range of formats, while annotation work can be completed more quickly due to a revised input method for RST dependency relations. TREEANNOTATOR offers a multiple window view, which allows users to inspect several annotations side by side. For storing and versioning annotations, the UIMA Database Interface (UIMA DI) was developed to save documents based on a pre-defined type system. These features not only connect TREEANNOTATOR annotations to modern technological and dialog theoretical work, but set it apart from related tools. The ease of use of TREEANNOTATOR and its newly designed user interface is evaluated in a user study consisting of annotating rhetorical relations with TREEANNOTATOR and the classic RSTTool.

[1]  Maite Taboada,et al.  The Good, the Bad, and the Disagreement: Complex ground truth in rhetorical structure analysis , 2017 .

[2]  Amir Zeldes,et al.  ANNIS3: A new architecture for generic corpus query and visualization , 2016, Digit. Scholarsh. Humanit..

[3]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[4]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[5]  Volker Gast,et al.  Atomic: an open-source software platform for multi-level corpus annotation , 2014, KONVENS.

[6]  Graham Wilcock,et al.  The Evolution of Text Annotation Frameworks , 2017 .

[7]  Amir Zeldes,et al.  The GUM corpus: creating multilayer resources in the classroom , 2016, Language Resources and Evaluation.

[8]  Manfred Stede,et al.  The Potsdam Commentary Corpus , 2004, ACL 2004.

[9]  David Reitter Rhetorical theory in L A T E X with the rst package , 2003 .

[10]  Mark A. Finlayson,et al.  Overview of Annotation Creation: Processes and Tools , 2017 .

[11]  Daniel Marcu,et al.  Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001, SIGDIAL Workshop.

[12]  Tomek Strzalkowski,et al.  From Discourse to Logic , 1991 .

[13]  Alexander Mehler,et al.  A UIMA Database Interface for Managing NLP-related Text Annotations , 2018, LREC.

[14]  Alex Lascarides,et al.  Logics of Conversation , 2005, Studies in natural language processing.

[15]  Amir Zeldes,et al.  rstWeb - A Browser-based Annotation Interface for Rhetorical Structure Theory and Discourse Relations , 2016, NAACL.

[16]  Johanna D. Moore,et al.  A Problem for RST: The Need for Multi-Level Discourse Analysis , 1992, CL.

[17]  Leslie Lamport,et al.  LATEX. A document preparation system. User's Guide and Reference Manual , 1996 .

[18]  Alexander Mehler,et al.  SOA implementation of the eHumanities Desktop , 2012 .

[19]  Kalina Bontcheva,et al.  Collaborative Web-Based Tools for Multi-layer Text Annotation , 2017 .

[20]  Thomas C. Schmidt,et al.  Tools for Multimodal Annotation , 2017 .

[21]  Iryna Gurevych,et al.  WebAnno: a flexible, web-based annotation tool for CLARIN , 2014 .

[22]  Tolga Uslu,et al.  TextImager: a Distributed UIMA-based System for NLP , 2016, COLING.