The huge amount of the available information in the Web creates the need of effective information extraction systems that are able to produce metadata that satisfy user's information needs. The development of such systems, in the majority of cases, depends on the availability of an appropriately annotated corpus in order to learn or evaluate extraction models. The production of such corpora can be significantly facilitated by annotation tools, that provide user-friendly facilities and enable annotators to annotate documents according to a predefined annotation schema. However, the construction of annotation tools that operate in a distributed environment is a challenging task: the majority of these tools are implemented as Web applications, having to cope with the capabilities offered by browsers. This paper describes the SYNC3 collaborative annotation tool, which implements an alternative architecture: it remains a desktop application, fully exploiting the advantages of desktop applications, but provides collaborative annotation through the use of a centralised server for storing both the documents and their metadata, and instance messaging protocols for communicating events among all annotators. The annotation tool is implemented as a component of the Ellogon language engineering platform, exploiting its extensive annotation engine, its cross-platform abilities and its linguistic processing components, if such a need arises. Finally, the SYNC3 annotation tool is distributed with an open source license, as part of the Ellogon platform.
[1]
Jeff Heflin,et al.
SHOE: A Knowledge Representation Language for Internet Applications
,
1999
.
[2]
Christoph Müller,et al.
Multi-level annotation of linguistic data with MMAX 2
,
2006
.
[3]
Óscar Corcho,et al.
Ontology based document annotation: trends and open research problems
,
2006,
Int. J. Metadata Semant. Ontologies.
[4]
Georgios Paliouras,et al.
Ellogon: A New Text Engineering Platform
,
2002,
LREC.
[5]
Siegfried Handschuh,et al.
Semantic annotation for knowledge management: Requirements and a survey of the state of the art
,
2006,
J. Web Semant..
[6]
Udo Kruschwitz,et al.
Phrase Detectives: A Web-based collaborative annotation game
,
2008
.
[7]
Atanas Kiryakov,et al.
KIM – a semantic platform for information extraction and retrieval
,
2004,
Natural Language Engineering.
[8]
Constantine D. Spyropoulos,et al.
BOEMIE Ontology-Based Text Annotation Tool
,
2008,
LREC.
[9]
Stephanie Strassel,et al.
Annotation Tools for Large-Scale Corpus Development: Using AGTK at the Linguistic Data Consortium
,
2004,
LREC.
[10]
Georgios Petasis,et al.
A New Annotation Tool for Aligned Bilingual Corpora
,
2011,
TSD.
[11]
H. Cunningham,et al.
Web-based Collaborative Corpus Annotation : Requirements and a Framework Implementation
,
2010
.
[12]
Philip V. Ogren,et al.
Knowtator: A Protégé plug-in for annotated corpus construction
,
2006,
NAACL.