论文信息 - Chunking with WPDV Models

Chunking with WPDV Models

In this paper I describe the application of the WPDV algorithm to the CoNLL-2000 shared task, the identification of base chunks in English text (Tjong Kim Sang and Buchholz, 2000). For this task, I use a three-stage architecture: I first run five different base chunkers, then combine them and finally try to correct some recurring errors. Except for one base chunker, which uses the memory-based machine learning system TiMBL, all modules are based on WPDV models (van Halteren, 2000a).

Hans van Halteren

[1] Walter Daelemans,et al. Improving Accuracy in word class tagging through the Combination of Machine Learning Systems , 2001, CL.

[2] Hans van Halteren,et al. Improving Data Driven Wordclass Tagging by System Combination , 1998, ACL.

[3] Sabine Buchholz,et al. Introduction to the CoNLL-2000 Shared Task Chunking , 2000, CoNLL/LLL.

[4] Erik F. Tjong Kim Sang,et al. Memory-Based Shallow Parsing , 2002, J. Mach. Learn. Res..

[5] Erik F. Tjong Kim Sang,et al. Noun Phrase Recognition by System Combination , 2000, ANLP.

[6] Hans van Halteren,et al. The Detection of Inconsistency in Manually Tagged Text , 2000, COLING 2000.

[7] Hans van Halteren,et al. A Default First Order Family Weight Determination Procedure for WPDV Models , 2000, CoNLL/LLL.

[8] Walter Daelemans,et al. Cascaded Grammatical Relation Assignment , 1999, EMNLP.