A Survey of Word-sense Disambiguation Effective Techniques and Methods for Indian Languages

Word Sense Disambiguation is a challenging technique in Natural Language Processing. There are some words in the natural languages which can cause ambiguity about the sense of the word.WSD identifies the correct sense of the word in a sentence or a document. The paper summarizes about the history of WSD. We have discussed about the knowledge - based and machine learning - based approaches for WSD. Various supervised learning and unsupervised learning techniques have been discussed. WSD is mainly used in Information Retrieval (IR), Information Extraction (IE), Machine Translation (MT), Content Analysis, Word Processing, Lexicography and Semantic Web. Finally, we have discussed about WSD for Indian languages (Hindi, Malayalam, and Kannada) and other languages (Chinese, Mongolian, Polish, Turkish, English, Myanmar, Arabic, Nepali, Persian, Dutch, and Italian).

[1]  Sunil G. Bhirud,et al.  Word Sense Disambiguation Using WordNet Domains , 2008, 2008 First International Conference on Emerging Trends in Engineering and Technology.

[2]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[3]  Rosna P Haroon,et al.  Malayalam word sense disambiguation , 2010, 2010 IEEE International Conference on Computational Intelligence and Computing Research.

[4]  Sivaji Bandyopadhyay,et al.  Word Sense Disambiguation Using Extended WordNet , 2007, 2007 International Conference on Computing: Theory and Applications (ICCTA'07).

[5]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[6]  Eneko Agirre,et al.  Word Sense Disambiguation: Algorithms and Applications (Text, Speech and Language Technology) , 2006 .

[7]  Wojdan Alsaeedan,et al.  Genetic Algorithm for Arabic Word Sense Disambiguation , 2012, 2012 13th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing.

[8]  Jian Zhang,et al.  Word sense disambiguation of the English modal verb may by back propagation neural network , 2008, 2008 International Conference on Natural Language Processing and Knowledge Engineering.

[9]  Sivaji Bandyopadhyay,et al.  A DECISION TREE BASED WORD SENSE DISAMBIGUATION SYSTEM IN MANIPURI LANGUAGE , 2014 .

[10]  Xu Li,et al.  An improved unsupervised learning probabilistic model of word sense disambiguation , 2012, 2012 World Congress on Information and Communication Technologies.

[11]  Danushka Bollegala,et al.  A Context Expansion Method for Supervised Word Sense Disambiguation , 2012, 2012 IEEE Sixth International Conference on Semantic Computing.

[12]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[13]  Julio Gonzalo,et al.  The SemEval-2007 WePS Evaluation: Establishing a benchmark for the Web People Search Task , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[14]  W. Bruce Croft,et al.  Lexical ambiguity and information retrieval , 1992, TOIS.

[15]  Edwin Mit,et al.  Word Sense Disambiguation by using domain knowledge , 2011, 2011 International Conference on Semantic Technology and Information Retrieval.

[16]  G. Rigau,et al.  Combining Knowledge- and Corpus-based Word-Sense-Disambiguation Methods , 2005, J. Artif. Intell. Res..

[17]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[18]  Zhu Shu,et al.  A New Approach to Word Sense Disambiguation in MT System , 2009, 2009 WRI World Congress on Computer Science and Information Engineering.

[19]  Prity Bala,et al.  Word Sense Disambiguation Using Selectional Restriction , 2013 .

[20]  Yukiko Sasaki Alam Lexical-Semantic Representation of the Lexicon for Word Sense Disambiguation and Text Understanding , 2009, 2009 IEEE International Conference on Semantic Computing.

[21]  Rakesh Kumar,et al.  Natural Language Engineering: The Study of Word Sense Disambiguation in Punjabi , 2011 .

[22]  Rada Mihalcea,et al.  Unsupervised Graph-basedWord Sense Disambiguation Using Measures of Word Semantic Similarity , 2007 .

[23]  Jingbo Zhu,et al.  Active Learning With Sampling by Uncertainty and Density for Data Annotations , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[24]  Jean Véronis,et al.  HyperLex: lexical cartography for information retrieval , 2004, Comput. Speech Lang..