Annotating Arguments: The NOMAD Collaborative Annotation Tool

The huge amount of the available information in the Web creates the need for effective information extraction systems that are able to produce metadata that satisfy user’s information needs. The development of such systems, in the majority of cases, depends on the availability of an appropriately annotated corpus in order to learn or evaluate extraction models. The production of such corpora can be significantly facilitated by annotation tools, which provide user-friendly facilities and enable annotators to annotate documents according to a predefined annotation schema. However, the construction of annotation tools that operate in a distributed environment is a challenging task: the majority of these tools are implemented as Web applications, having to cope with the capabilities offered by browsers. This paper describes the NOMAD collaborative annotation tool, which implements an alternative architecture: it remains a desktop application, fully exploiting the advantages of desktop applications, but provides collaborative annotation through the use of a centralised server for storing both the documents and their metadata, and instance messaging protocols for communicating events among all annotators. The annotation tool is implemented as a component of the Ellogon language engineering platform, exploiting its extensive annotation engine, its cross-platform abilities and its linguistic processing components, if such a need arises. Finally, the NOMAD annotation tool is distributed with an open source license, as part of the Ellogon platform.

[1]  Georgios Petasis The SYNC3 Collaborative Annotation Tool , 2012, LREC.

[2]  Constantine D. Spyropoulos,et al.  BOEMIE Ontology-Based Text Annotation Tool , 2008, LREC.

[3]  Óscar Corcho,et al.  Ontology based document annotation: trends and open research problems , 2006, Int. J. Metadata Semant. Ontologies.

[4]  Kalina Bontcheva,et al.  GATE Teamware: a web-based, collaborative text annotation framework , 2013, Lang. Resour. Evaluation.

[5]  Anthony Hunter,et al.  Elements of Argumentation , 2007, ECSQARU.

[6]  Vangelis Karkaletsis,et al.  Argument Extraction from News, Blogs, and Social Media , 2014, SETN.

[7]  Philip V. Ogren,et al.  Knowtator: A Protégé plug-in for annotated corpus construction , 2006, NAACL.

[8]  Atanas Kiryakov,et al.  KIM – a semantic platform for information extraction and retrieval , 2004, Natural Language Engineering.

[9]  Jeff Heflin,et al.  SHOE: A Knowledge Representation Language for Internet Applications , 1999 .

[10]  Stephanie Strassel,et al.  Annotation Tools for Large-Scale Corpus Development: Using AGTK at the Linguistic Data Consortium , 2004, LREC.

[11]  Georgios Paliouras,et al.  Ellogon: A New Text Engineering Platform , 2002, LREC.

[12]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[13]  Georgios Petasis,et al.  A New Annotation Tool for Aligned Bilingual Corpora , 2011, TSD.

[14]  Christoph Müller,et al.  Multi-level annotation of linguistic data with MMAX 2 , 2006 .

[15]  H. Cunningham,et al.  Web-based Collaborative Corpus Annotation : Requirements and a Framework Implementation , 2010 .

[16]  Iryna Gurevych,et al.  WebAnno: A Flexible, Web-based and Visually Supported System for Distributed Annotations , 2013, ACL.

[17]  Siegfried Handschuh,et al.  Semantic annotation for knowledge management: Requirements and a survey of the state of the art , 2006, J. Web Semant..

[18]  Udo Kruschwitz,et al.  Phrase Detectives: A Web-based collaborative annotation game , 2008 .

[19]  Kalina Bontcheva,et al.  Text Processing with GATE , 2011 .