Personalizing PageRank for Word Sense Disambiguation

In this paper we propose a new graph-based method that uses the knowledge in a LKB (based on WordNet) in order to perform unsupervised Word Sense Disambiguation. Our algorithm uses the full graph of the LKB efficiently, performing better than previous approaches in English all-words datasets. We also show that the algorithm can be easily ported to other languages with good results, with the only requirement of having a wordnet. In addition, we make an analysis of the performance of the algorithm, showing that it is efficient and that it could be tuned to be faster.

[1]  Mirella Lapata,et al.  Graph Connectivity Measures for Unsupervised Word Sense Disambiguation , 2007, IJCAI.

[2]  Julie Weeds,et al.  Finding Predominant Word Senses in Untagged Text , 2004, ACL.

[3]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[4]  Lluís Màrquez i Villodre,et al.  SemEval-2007 Task 09: Multilevel Semantic Annotation of Catalan and Spanish , 2007, SemEval@ACL.

[5]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[6]  Carl D. Meyer,et al.  Deeper Inside PageRank , 2004, Internet Math..

[7]  Rada Mihalcea,et al.  Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling , 2005, HLT.

[8]  Lluís Padró,et al.  Mapping WordNets Using Structural Information , 2000, ACL.

[9]  Christiane Fellbaum,et al.  English Tasks: All-Words and Verb Lexical Sample , 2001, *SEMEVAL.

[10]  Michalis Vazirgiannis,et al.  Word Sense Disambiguation with Spreading Activation Networks Generated from Thesauri , 2007, IJCAI.

[11]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[12]  Paola Velardi,et al.  Structural semantic interconnections: a knowledge-based approach to word sense disambiguation , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Rada Mihalcea,et al.  eXtended WordNet: progress report , 2001, HTL 2001.

[14]  Eneko Agirre,et al.  Using the Multilingual Central Repository for Graph-Based Word Sense Disambiguation , 2008, LREC.

[15]  Rada Mihalcea,et al.  Unsupervised Graph-basedWord Sense Disambiguation Using Measures of Word Semantic Similarity , 2007, International Conference on Semantic Computing (ICSC 2007).

[16]  Martha Palmer,et al.  The English all-words task , 2004, SENSEVAL@ACL.

[17]  Louise Guthrie,et al.  Lexical Disambiguation using Simulated Annealing , 1992, COLING.

[18]  German Rigau,et al.  Spanish WordNet 1.6: Porting the Spanish Wordnet Across Princeton Versions , 2004, LREC.

[19]  Eneko Agirre,et al.  Word Sense Disambiguation using Conceptual Density , 1996, COLING.

[20]  Martha Palmer,et al.  SemEval-2007 Task-17: English Lexical Sample, SRL and All Words , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[21]  Piek Vossen,et al.  The MEANING Multilingual Central Repository , 2004 .