Stand-off TEI Annotation: the Case of the National Corpus of Polish
暂无分享,去创建一个
We present the annotation architecture of the National Corpus of Polish and discuss problems identified in the TEI stand-off annotation system, which, in its current version, is still very much unfinished and untested, due to both technical reasons (lack of tools implementing the TEI-defined XPointer schemes) and certain problems concerning data representation. We concentrate on two features that a stand-off system should possess and that are conspicuously missing in the current TEI Guidelines.
[1] Adam Przepiórkowski,et al. Towards the National Corpus of Polish , 2008, LREC.
[2] Laurent Romary,et al. Towards International Standards for Language Resources , 2007 .