A Hybrid Model for Word Sense Disambiguation in English-Portuguese Machine Translation

We present the proposal for an approach to word sense disambiguation with application in machine translation from English to Brazilian Portuguese. This approach follows a hybrid natural language processing method, that is, a mixture of knowledge and corpus-based approaches. The main innovative feature is the formalism that we intend to use to represent the instances and the background knowledge. Opposed to other approaches to word sense disambiguation, which use restricted propositional formalisms, we will employ a relational one. As a consequence, we expect that the resultant system will overcome the limitations of the current work, effectively dealing with the lexical semantic ambiguity in machine translation.

[1]  Yorick Wilks,et al.  The Interaction of Knowledge Sources in Word Sense Disambiguation , 2001, CL.

[2]  Eneko Agirre,et al.  Syntactic Features for High Precision Word Sense Disambiguation , 2002, COLING.

[3]  John Francis Kros,et al.  Data mining and the impact of missing data , 2003, Ind. Manag. Data Syst..

[4]  Ana Frankenberg-Garcia,et al.  Introducing COMPARA: the Portuguese-English Parallel Corpus , 2003 .

[5]  Robert L. Mercer,et al.  Word-Sense Disambiguation Using Statistical Methods , 1991, ACL.

[6]  Maria das Graças Volpe Nunes,et al.  Evaluation of Methods for Sentence and Lexical Alignment of Brazilian Portuguese and English Parallel Texts , 2004, SBIA.

[7]  Natalia Zinovjeva,et al.  Learning Sense Disambiguation Rules for Machine Translation , 2000 .

[8]  Hyo-Kyung Lee Classification Approach to Word Selection in Machine Translation , 2002, AMTA.

[9]  Lucia Specia An automatic approach to create a sense tagged corpus for word sense disambiguation in machine translation , 2004 .

[10]  Bonnie J. Dorr,et al.  Lexical selection for cross-language applications: combining LCS with WordNet , 1998, AMTA.

[11]  Raymond J. Mooney,et al.  Inductive Logic Programming for Natural Language Processing , 1996, Inductive Logic Programming Workshop.

[12]  Scott Cotton,et al.  SENSEVAL-2: Overview , 2001, *SEMEVAL.

[13]  Yorick Wilks,et al.  Sense Tagging: Semantic Tagging with a Lexicon , 1997, ArXiv.

[14]  Harold L. Somers,et al.  An introduction to machine translation , 1992 .