Jaccard coefficient-based word sense disambiguation using hybrid knowledge resources

Word Sense Disambiguation (WSD) has become a popular method for solving the ambiguous meaning of the words in Information Retrieval (IR) field area. Under the Natural Language Processing (NLP) community, WSD has been described as the task which able to select the appropriate meaning among the ambiguous meanings to a given word. Among three approaches, supervised based, unsupervised based and knowledge based approaches to WSD, this paper focuses on both supervised based and knowledge based approaches by proposing new Jaccard coefficient-based WSD algorithm to overcome the vocabulary miss match problem. WordNet and corpus external knowledge resources are utilized as the sense repositories by linking up with the new WSD algorithm to consider additional semantic for WSD. According to sample testing, IR system with new WSD algorithm attains more about 20 percent of total accuracy rate than traditional IR system.

[1]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[2]  Aleksander Smywinski-Pohl,et al.  Improving the Wikipedia Miner word sense disambiguation algorithm , 2012, 2012 Federated Conference on Computer Science and Information Systems (FedCSIS).

[3]  Vlado Keselj,et al.  Comparing Word Relatedness Measures Based on Google $n$-grams , 2012, COLING.

[4]  Lakhmi C. Jain,et al.  Knowledge-Based Intelligent Information and Engineering Systems, 9th International Conference, KES 2005, Melbourne, Australia, September 14-16, 2005, Proceedings, Part I , 2005, KES.

[5]  Ted Pedersen,et al.  An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet , 2002, CICLing.

[6]  Ms.D Subarani Concept Based Information Retrieval from Text Documents , 2012 .

[7]  Lynda Tamine,et al.  Sense-Based Biomedical Indexing and Retrieval , 2010, NLDB.

[8]  Rada Mihalcea,et al.  Semantic Indexing using WordNet Senses , 2000 .

[9]  共立出版株式会社 コンピュータ・サイエンス : ACM computing surveys , 1978 .

[10]  Jianying Wang,et al.  A corpus analysis approach for automatic query expansion , 1997, CIKM '97.

[11]  Ying Liu,et al.  Using WordNet to Disambiguate Word Senses for Text Classification , 2007, International Conference on Computational Science.

[12]  Stephen E. Robertson Evaluation in Information Retrieval , 2000, ESSIR.

[13]  John Tait,et al.  Word sense disambiguation in information retrieval revisited , 2003, SIGIR.

[14]  Georgios C. Anagnostopoulos,et al.  Knowledge-Based Intelligent Information and Engineering Systems , 2003, Lecture Notes in Computer Science.

[15]  E. Rasmussen Evaluation in Information Retrieval , 2002 .