Fuzzy Hindi WordNet and Word Sense Disambiguation Using Fuzzy Graph Connectivity Measures

In this article, we propose Fuzzy Hindi WordNet, which is an extended version of Hindi WordNet. The proposed idea of fuzzy relations and their role in modeling Fuzzy Hindi WordNet is explained. We mathematically define fuzzy relations and the composition of these fuzzy relations for this extended version. We show that the concept of composition of fuzzy relations can be used to infer a relation between two words that otherwise are not directly related in Hindi WordNet. Then we propose fuzzy graph connectivity measures that include both local and global measures. These measures are used in determining the significance of a concept (which is represented as a vertex in the fuzzy graph) in a specific context. Finally, we show how these extended measures solve the problem of word sense disambiguation (WSD) effectively, which is useful in many natural language processing applications to improve their performance. Experiments on standard sense tagged corpus for WSD show better results when Fuzzy Hindi WordNet is used in place of Hindi WordNet.

[1]  Prabir Bhattacharya,et al.  Some remarks on fuzzy graphs , 1987, Pattern Recognit. Lett..

[2]  Girish Nath Jha,et al.  Translating politeness across cultures: case of Hindi and English , 2010, ICIC '10.

[3]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[4]  Parteek Bhatia,et al.  Word Sense Disambiguation for Hindi Language , 2008 .

[5]  Devendra K. Tayal,et al.  Measuring context-meaning for open class words in Hindi language , 2013, 2013 Sixth International Conference on Contemporary Computing (IC3).

[6]  Sunny Rai,et al.  Shrinking digital gap through automatic generation of WordNet for Indian languages , 2014, AI & SOCIETY.

[7]  Rada Mihalcea,et al.  Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling , 2005, HLT.

[8]  S. K. Dwivedi,et al.  An Entropy Based Method for Removing Web Query Ambiguity in Hindi Language , 2008 .

[9]  T. Pavlidis,et al.  Fuzzy sets and their applications to cognitive and decision processes , 1977 .

[10]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[11]  Sarah Eichmann,et al.  Fuzzy Logic Intelligence Control And Information , 2016 .

[12]  Gopal K Gupta,et al.  Introduction to Data Mining with Case Studies , 2011 .

[13]  Ben Shneiderman,et al.  Structural analysis of hypertexts: identifying hierarchies and useful metrics , 1992, TOIS.

[14]  Simone Paolo Ponzetto,et al.  Knowledge-Rich Word Sense Disambiguation Rivaling Supervised Systems , 2010, ACL.

[15]  Dragomir R. Radev,et al.  Book Review: Graph-Based Natural Language Processing and Information Retrieval by Rada Mihalcea and Dragomir Radev , 2011, CL.

[16]  Christiane Fellbaum,et al.  WordNet then and now , 2007, Lang. Resour. Evaluation.

[17]  Katrin Erk,et al.  Measuring Word Meaning in Context , 2013, CL.

[18]  Francisco P. Romero,et al.  Classifying unlabeled short texts using a fuzzy declarative approach , 2013, Lang. Resour. Evaluation.

[19]  Devendra K. Tayal,et al.  Retrieving web search results using Max–Max soft clustering for Hindi query , 2016, Int. J. Syst. Assur. Eng. Manag..

[20]  Roberto Navigli,et al.  Semi-Automatic Extension of Large-Scale Linguistic Knowledge Bases , 2005, FLAIRS.

[21]  Luca Becchetti,et al.  The distribution of pageRank follows a power-law only for particular values of the damping factor , 2006, WWW '06.

[22]  Pushpak Bhattacharyya,et al.  IndoWordNet , 2010, LREC.

[23]  Pushpak Bhattacharyya,et al.  Hindi Word Sense Disambiguation , 2004 .

[24]  Devendra K. Tayal,et al.  New method for solving reviewer assignment problem using type-2 fuzzy sets and fuzzy functions , 2013, Applied Intelligence.

[25]  Ted Pedersen,et al.  An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet , 2002, CICLing.

[26]  Mark E. J. Newman A measure of betweenness centrality based on random walks , 2005, Soc. Networks.

[27]  Noga Alon,et al.  Source coding and graph entropies , 1996, IEEE Trans. Inf. Theory.

[28]  Avneet Kaur Development of an Approach for Disambiguating Ambiguous Hindi postposition , 2010 .

[29]  George J. Klir,et al.  Fuzzy sets, uncertainty and information , 1988 .

[30]  Shyi-Ming Chen,et al.  Document retrieval using fuzzy-valued concept networks , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[31]  David Hawking,et al.  Predicting Fame and Fortune: PageRank or Indegree? , 2003 .

[32]  Raymond T Yeh,et al.  FUZZY RELATIONS, FUZZY GRAPHS, AND THEIR APPLICATIONS TO CLUSTERING ANALYSIS , 1975 .

[33]  S. G. Bhirud,et al.  Exploiting links in WordNet hierarchy for word sense disambiguation of nouns , 2009, ICAC3 '09.

[34]  Dominic Widdows,et al.  A Graph Model for Unsupervised Lexical Acquisition , 2002, COLING.

[35]  John Skvoretz,et al.  Node centrality in weighted networks: Generalizing degree and shortest paths , 2010, Soc. Networks.

[36]  M S Sunitha STUDIES ON FUZZY GRAPHS , 2001 .

[37]  Piero P. Bonissone,et al.  Selecting Uncertainty Calculi and Granularity: An Experiment in Trading-off Precision and Complexity , 1985, UAI.

[38]  Vibhakar Mansotra,et al.  Query Optimization: A Solution for Low Recall Problem in Hindi Language Information Retrieval , 2012 .

[39]  Christiane Fellbaum,et al.  Erratum to: Large, huge or gigantic? Identifying and encoding intensity relations among adjectives in WordNet , 2013, Lang. Resour. Evaluation.

[40]  Shyi-Ming Chen,et al.  Fuzzy information retrieval based on multi-relationship fuzzy concept networks , 2003, Fuzzy Sets Syst..

[41]  Kwang Hyung Lee,et al.  First Course on Fuzzy Theory and Applications , 2005, Advances in Soft Computing.

[42]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[43]  Satyendr Singh,et al.  Evaluating effect of context window size, stemming and stop word removal on Hindi word sense disambiguation , 2012, 2012 International Conference on Information Retrieval & Knowledge Management.

[44]  Robert LIN,et al.  NOTE ON FUZZY SETS , 2014 .

[45]  Martine De Cock,et al.  Fuzzy Thesauri for and from the WWW , 2005 .

[46]  Mirella Lapata,et al.  An Experimental Study of Graph Connectivity for Unsupervised Word Sense Disambiguation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  M. Kracker A fuzzy concept network model and its applications , 1992, [1992 Proceedings] IEEE International Conference on Fuzzy Systems.

[48]  Donald B. Johnson,et al.  Efficient Algorithms for Shortest Paths in Sparse Networks , 1977, J. ACM.

[49]  Devendra K. Tayal,et al.  Automatically incorporating context meaning for query expansion using graph connectivity measures , 2014, Progress in Artificial Intelligence.

[50]  Sunil Mathew,et al.  Types of arcs in a fuzzy graph , 2009, Inf. Sci..

[51]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[52]  J. Yen,et al.  Fuzzy Logic: Intelligence, Control, and Information , 1998 .

[53]  Stephen P. Borgatti,et al.  Identifying sets of key players in a social network , 2006, Comput. Math. Organ. Theory.

[54]  S. G. Kolte,et al.  WordNet : A Knowledge Source for Word Sense Disambiguation , 2009 .

[55]  Christos Diou,et al.  Constructing Fuzzy Relations fromWordNet forWord Sense Disambiguation , 2006, 2006 First International Workshop on Semantic Media Adaptation and Personalization (SMAP'06).

[56]  Ronald R. Yager,et al.  Concept Representation and Database Structures in Fuzzy Social Relational Networks , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[57]  L. Freeman,et al.  Centrality in valued graphs: A measure of betweenness based on network flow , 1991 .

[58]  James C. Bezdek,et al.  Transitive Closures of Fuzzy Thesauri for Information-Retrieval Systems , 1986, Int. J. Man Mach. Stud..

[59]  Aditi Sharan,et al.  Exploiting Ontology for Concept Based Information Retrieval , 2011, ICIS 2011.

[60]  Tanveer J. Siddiqui,et al.  An Unsupervised Approach to Hindi Word Sense Disambiguation , 2009, IHCI.

[61]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[62]  Akinori Fujino,et al.  Word Sense Disambiguation by Combining Labeled Data Expansion and Semi-Supervised Learning Method , 2013, TALIP.