论文信息 - Combining Heterogeneous Classifiers for Word Sense Disambiguation

Combining Heterogeneous Classifiers for Word Sense Disambiguation

This paper discusses ensembles of simple but heterogeneous classifiers for word-sense disambiguation, examining the Stanford-CS224N system entered in the SENSEVAL-2 English lexical sample task. First-order classifiers are combined by a second-order classifier, which variously uses majority voting, weighted voting, or a maximum entropy model. While individual first-order classifiers perform comparably to middle-scoring teams' systems, the combination achieves high performance. We discuss trade-offs and empirical performance. Finally, we present an analysis of the combination, examining how ensemble performance depends on error independence and task difficulty.

[1] J. Mesirov,et al. Hybrid system for protein secondary structure prediction. , 1992, Journal of molecular biology.

[2] Raymond J. Mooney,et al. Comparative Experiments on Disambiguating Word Senses: An Illustration of the Role of Bias in Machine Learning , 1996, EMNLP.

[3] Kagan Tumer,et al. Error Correlation and Error Reduction in Ensemble Classifiers , 1996, Connect. Sci..

[4] Adam L. Berger,et al. A Maximum Entropy Approach to Natural Language Processing , 1996, CL.