论文信息 - Kannada Word Sense Disambiguation Using Association Rules

Kannada Word Sense Disambiguation Using Association Rules

Disambiguating the polysemous word is one of the major issues in the process of Machine Translation. The word may have many senses, selecting the most appropriate sense for an ambiguous word in a sentence is a central problem in Machine Translation. Because, each sense of a word in a source language sentence may generate different target language sentences. Knowledge and corpus based methods are usually applied for disambiguation task. In the present paper, we propose an algorithm to disambiguate Kannada polysemous words using association rules. We built Kannada corpora using web resources. The corpora are divided in to training and testing corpora. The association rules required for disambiguation tasks are extracted from training corpora. The example sentences needs to be disambiguated are stored in testing corpora. The proposed algorithm attempts to disambiguate all the content words such as nouns, verbs, adverbs, adjectives in an unrestricted text using association rules.

S. Parameswarappa | V. N. Narayana

[1] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[2] David Yarowsky,et al. Hierarchical Decision Lists for Word Sense Disambiguation , 2000, Comput. Humanit..

[3] John Sinclair,et al. Corpus, Concordance, Collocation , 1991 .

[4] Michael Barlow. Corpora for theory and practice , 1996 .

[5] Seyed Mostafa Fakhrahmad,et al. A new fuzzy rule-based classification system for word sense disambiguation , 2012, Intell. Data Anal..

[6] Hwee Tou Ng,et al. Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach , 1996, ACL.

[7] G. Miller,et al. Contextual correlates of semantic similarity , 1991 .

[8] Eneko Agirre,et al. Syntactic Features for High Precision Word Sense Disambiguation , 2002, COLING.

[9] Tomasz Imielinski,et al. Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[10] Rada Mihalcea,et al. Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling , 2005, HLT.

[11] Niladri Sekhar Dash,et al. Relevance of corpus in language research and application , 2003 .