Kannada Word Sense Disambiguation Using Association Rules

Disambiguating the polysemous word is one of the major issues in the process of Machine Translation. The word may have many senses, selecting the most appropriate sense for an ambiguous word in a sentence is a central problem in Machine Translation. Because, each sense of a word in a source language sentence may generate different target language sentences. Knowledge and corpus based methods are usually applied for disambiguation task. In the present paper, we propose an algorithm to disambiguate Kannada polysemous words using association rules. We built Kannada corpora using web resources. The corpora are divided in to training and testing corpora. The association rules required for disambiguation tasks are extracted from training corpora. The example sentences needs to be disambiguated are stored in testing corpora. The proposed algorithm attempts to disambiguate all the content words such as nouns, verbs, adverbs, adjectives in an unrestricted text using association rules.

[1]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[2]  David Yarowsky,et al.  Hierarchical Decision Lists for Word Sense Disambiguation , 2000, Comput. Humanit..

[3]  John Sinclair,et al.  Corpus, Concordance, Collocation , 1991 .

[4]  Michael Barlow Corpora for theory and practice , 1996 .

[5]  Seyed Mostafa Fakhrahmad,et al.  A new fuzzy rule-based classification system for word sense disambiguation , 2012, Intell. Data Anal..

[6]  Hwee Tou Ng,et al.  Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach , 1996, ACL.

[7]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[8]  Eneko Agirre,et al.  Syntactic Features for High Precision Word Sense Disambiguation , 2002, COLING.

[9]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[10]  Rada Mihalcea,et al.  Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling , 2005, HLT.

[11]  Niladri Sekhar Dash,et al.  Relevance of corpus in language research and application , 2003 .

[12]  Yorick Wilks,et al.  Word Sense Disambiguation using Optimised Combinations of Knowledge Sources , 1998, COLING-ACL.

[13]  Rada Mihalcea,et al.  A Method for Word Sense Disambiguation of Unrestricted Text , 1999, ACL.

[14]  David Yarowsky,et al.  DECISION LISTS FOR LEXICAL AMBIGUITY RESOLUTION: Application to Accent Restoration in Spanish and French , 1994, ACL.

[15]  David Yarowsky,et al.  Estimating Upper and Lower Bounds on the Performance of Word-Sense Disambiguation Programs , 1992, ACL.

[16]  H. Schütze,et al.  Dimensions of meaning , 1992, Supercomputing '92.

[17]  Paola Velardi,et al.  Structural semantic interconnections: a knowledge-based approach to word sense disambiguation , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Eneko Agirre,et al.  Word Sense Disambiguation using Conceptual Density , 1996, COLING.

[19]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.