Multiple hierarchies: new aspects of an old solution. Re-published

In this paper, we present the Multiple Annotation approach, which solves two problems: the problem of annotating overlapping structures, and the problem that occurs when documents should be annotated according to different, possibly heterogeneous tag sets. This approach has many advantages: it is based on XML, the modeling of alternative annotations is possible, each level can be viewed separately, and new levels can be added at any time. The files can be regarded as an interrelated unit, with the text serving as the implicit link. Two representations of the information contained in the multiple files (one in Prolog and one in XML) are described. These representations serve as a base for several applications.

[1]  S. Boag,et al.  XQuery 1.0 : An XML query language, W3C Working Draft 12 November 2003 , 2003 .

[2]  Andreas Witt,et al.  Unification of XML Documents with Concurrent Markup , 2005, Lit. Linguistic Comput..

[3]  Michael Otte,et al.  What is a Text , 1986 .

[4]  C. M. Sperberg-McQueen,et al.  Hierarchical Encoding of Text: Technical Problems and SGML Solutions , 1995 .

[5]  David McKelvie,et al.  Hyperlink semantics for standoff markup of read-only documents , 1997 .

[6]  Charles F. Goldfarb,et al.  SGML handbook , 1990 .

[7]  Felix Sasaki,et al.  Testing Structural Properties in Textual Data: Beyond Document Grammars , 2003, Lit. Linguistic Comput..

[8]  Andreas Witt Meaning and interpretation of concurrent markup , 2002 .

[9]  James Clark,et al.  XSL Transformations (XSLT) Version 1.0 , 1999 .

[10]  C. M. Sperberg-McQueen,et al.  Hierarchical encoding of text: Technical problems and SGML solutions , 1995, Comput. Humanit..

[11]  Patrick Durusau,et al.  Coming down from the trees: Next step in the evolution of markup? , 2002, Extreme Markup Languages®.

[12]  Jean Carletta,et al.  The NITE Object Model Library for Handling Structured Linguistic Annotation on Multimodal Data Sets , 2002 .

[13]  C. M. Sperberg-McQueen,et al.  Concurrent document hierarchies in MECS and SGML , 1999 .

[14]  David G. Durand,et al.  What is text, really? , 1990, J. Comput. High. Educ..

[15]  Maria Leonor Pacheco,et al.  of the Association for Computational Linguistics: , 2001 .

[16]  C. M. Sperberg-McQueen,et al.  Guidelines for electronic text encoding and interchange : TEI P4 , 2002 .

[17]  Andreas Witt,et al.  Declarations of Relations, Differences and Transformations between Theory-specific Treebanks: A New Methodology , 2003 .

[18]  Wendell Piez,et al.  The Layered Markup and Annotation Language (LMNL) , 2002, Extreme Markup Languages®.

[19]  Andreas Witt,et al.  Methods for the semantic analysis of document markup , 2003, DocEng '03.

[20]  C. M. Sperberg-McQueen,et al.  Drawing inferences on the basis of markup , 2002, Extreme Markup Languages®.

[21]  Emanuele Pianta,et al.  Annotating Discontinuous Structures in XML : the Multiword Case , 2004 .

[22]  C. M. Sperberg-McQueen,et al.  Meaning and Interpretation of Markup , 2000, Markup Lang..

[23]  David G. Durand,et al.  Refining our Notion of What Text Really Is: The Problem of Overlapping Hierarchies , 1993 .

[24]  Odd Einar Haugen Parallel Views: Multi-level Encoding of Medieval Nordic Primary Sources , 2004, Lit. Linguistic Comput..