论文信息 - Evaluating knowledge-poor and knowledge-rich features in automatic classification: A case study in WSD

Evaluating knowledge-poor and knowledge-rich features in automatic classification: A case study in WSD

Word Sense Disambiguation (WSD) is a fundamental task in many Computational Linguistics applications. It consists of automatically identifying the sense of ambiguous words in context using computational methods. This work evaluates the automatic disambiguation performance of five machine learning classifiers: Naive Bayes, Support Vector Machines, Decision Trees, KStar and Maximum Entropy. For the classification we compare the performance of these algorithms using knowledge-rich and knowledge-poor features applied to Portuguese data.

M. Zampieri | Marcos Zampieri

[1] Hwee Tou Ng,et al. Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach , 1996, ACL.

[2] Ted Pedersen,et al. An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet , 2002, CICLing.

[3] Helmut Schmidt,et al. Probabilistic part-of-speech tagging using decision trees , 1994 .

[4] Averil Coxhead. A New Academic Word List , 2000 .

[5] Jorge Baptista,et al. P-AWL: Academic Word List for Portuguese , 2010, PROPOR.

[6] Hinrich Schütze,et al. Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[7] Eric Brill,et al. Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging , 1995, CL.

[8] Petr Sgall,et al. Graeme Hirst. Semantic interpretation and the resolution of ambiguity , 1989 .

[9] Diana McCarthy,et al. Text Categorization for Improved Priors of Word Meaning , 2007, CICLing.

[10] Mirella Lapata,et al. Graph Connectivity Measures for Unsupervised Word Sense Disambiguation , 2007, IJCAI.

[11] Graeme Hirst,et al. Semantic Interpretation and the Resolution of Ambiguity , 1987, Studies in natural language processing.