论文信息 - Data-Driven Semantic Analysis for Multilingual WSD and Lexical Selection in Translation

Data-Driven Semantic Analysis for Multilingual WSD and Lexical Selection in Translation

A common way of describing the senses of ambiguous words in multilingual Word Sense Disambiguation (WSD) is by reference to their translation equivalents in another language. The theoretical soundness of the senses induced in this way can, however, be doubted. This type of cross-lingual sense identification has implications for multilingual WSD and MT evaluation as well. In this article, we first present some arguments in favour of a more thorough analysis of the semantic information that may be induced by the equivalents of ambiguous words found in parallel corpora. Then, we present an unsupervised WSD method and a lexical selection method that exploit the results of a data-driven sense induction method. Finally, we show how this automatically acquired information can be exploited for a multilingual WSD and MT evaluation more sensitive to lexical semantics.

Marianna Apidianaki | Marianna Apidianaki

[1] Philip Resnik,et al. Exploiting Hidden Meanings: Using Bilingual Text for Monolingual Annotation , 2004, CICLing.

[2] Ron Artstein,et al. Survey Article: Inter-Coder Agreement for Computational Linguistics , 2008, CL.

[3] Helmut Schmidt,et al. Probabilistic part-of-speech tagging using decision trees , 1994 .

[4] Marine Carpuat,et al. Improving Statistical Machine Translation Using Word Sense Disambiguation , 2007, EMNLP.

[5] Stelios Piperidis,et al. Building Parallel Corpora for eContent Professionals , 2004 .

[6] Andy Way,et al. Labelled Dependencies in Machine Translation Evaluation , 2007, WMT@ACL.

[7] ResnikPhilip,et al. Distinguishing systems and distinguishing senses: new evaluation methods for Word Sense Disambiguation , 1999 .

[8] Philipp Koehn,et al. Re-evaluating the Role of Bleu in Machine Translation Research , 2006, EACL.

[9] Robert L. Mercer,et al. Word-Sense Disambiguation Using Statistical Methods , 1991, ACL.

[10] Gregory Grefenstette,et al. Explorations in automatic thesaurus discovery , 1994 .

[11] G. Miller,et al. Contextual correlates of semantic similarity , 1991 .