A Machine Learning Approach to Automatic Functor Assignment in the Prague Dependency Treebank
暂无分享,去创建一个
The aim of this paper is to describe and evaluate a system that automates a part of the transition from analytical to tectogrammatical tree structures within the Prague Dependency Treebank. In particular, it assigns functors to autosemantic words. The system is based on the machine learning approach of decision tree induction. The resulting software tool is incorporated into the annotation process and significantly reduces the manual annotation effort during the transition from analytical tree structures to the tectogrammatical tree structures, which consumes a huge amount of time of linguistic experts.
[1] Eva Hajičová. Dependency-based underlying-structure tagging of a very large Czech corpus , 2000 .
[2] Markéta Lopatková,et al. Valency Dictionary of Czech Verbs: Complex Tectogrammatical Annotation , 2002, LREC.
[3] Saso Dzeroski,et al. A Machine Learning Approach to Automatic Functor Assignment in the Prague Dependency Treebank , 2002, LREC.
[4] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .