论文信息 - A modular approach to learning Dutch co-reference

A modular approach to learning Dutch co-reference

This paper presents the first machine learning approach to the resolution of co-referential relations between nominal constituents in Dutch. Based on the hypothesis that different types of information sources contribute to a correct resolution of different types (pronominal, proper noun and common noun) of co-referential links, we propose a modular approach in which a separate module is trained per NP type. We present a thorough comparison of two machine learning techniques, a lazy learner and an eager learning approach, trained on the modular tasks as well as on the undecomposed task. In addition, we show that by postprocessing the resulting co-reference chains by means of a string-edit distance correction mechanism, we can avoid some unlikely local chainings and thereby improve precision. Lacking comparative results for Dutch, we also report results on the English MUC-6 and MUC-7 data sets, which are widely used for evaluation.

Antal van den Bosch | Veronique Hoste | Veronique Hoste

[1] Walter Daelemans,et al. Combined Optimization of Feature Selection and Algorithm Parameter Interaction in Machine Learning of Language , 2003 .

[2] Rada Mihalcea,et al. Word sense disambiguation with pattern learning and automatic feature selection , 2002, Natural Language Engineering.

[3] Scott Bennett,et al. Evaluating Automated and Manual Acquisition of Anaphora Resolution Strategies , 1995, ACL.

[4] Walter Daelemans,et al. Memory-Based Named Entity Recognition using Unannotated Data , 2003, CoNLL.

[5] Walter Daelemans,et al. Evaluation of Machine Learning Methods for Natural Language Processing Tasks , 2002, LREC.

[6] Michael Strube,et al. The Influence of Minimum Edit Distance on Reference Resolution , 2002, EMNLP.

[7] Jian Su,et al. Coreference Resolution Using Competition Learning Approach , 2003, ACL.

[8] Edith Bolling. Anaphora Resolution , 2006 .

[9] Lynette Hirschman,et al. Appendix F: MUC-7 Coreference Task Definition (version 3.0) , 1998, MUC.

[10] Walter Daelemans,et al. Forgetting Exceptions is Harmful in Language Learning , 1998, Machine Learning.

[11] S. Buchholz,et al. Memory-Based Grammatical Relation Finding , 2002 .