Annotation Procedure in Building the Prague Czech-English Dependency Treebank
暂无分享,去创建一个
In this paper, we present some organizational aspects of building of a large corpus with rich linguistic annotation, while Prague Czech-English Dependency Treebank (PCEDT) serves as an example. We stress the necessity to divide the annotation process into several well planed phases. We present a system of automatic checking of the correctness of the annotation and describe several ways to measure and evaluate the annotation and annotators (inter-annotator accord, error rate and performance).
[1] Jan Hajic,et al. The Prague Dependency Treebank , 2003 .
[2] Václav Klimeš. Analytical and Tectogrammatical Analysis of a Natural Language , 2006 .