SUDOKU: Treating Word Sense Disambiguation & Entitiy Linking as a Deterministic Problem - via an Unsupervised & Iterative Approach

SUDOKU’s submissions to SemEval Task 13 treats Word Sense Disambiguation and Entity Linking as a deterministic problem that exploits two key attributes of open-class words as constraints ‐ their degree of polysemy and their part of speech. This is an extension and further validation of the results achieved by Manion and Sainudiin (2014). SUDOKU’s three submissions are incremental in the use of the two aforementioned constraints. Run1 has no constraints and disambiguates all lemmas in one pass. Run2 disambiguates lemmas at increasing degrees of polysemy, leaving the most polysemous until last. Run3 is identical to Run2, with the additional constraint of disambiguating all named entities and nouns first before other types of open-class words (verbs, adjectives, and adverbs). Over all-domains, for English Run2 and Run3 were placed second and third. For Spanish Run2, Run3, and Run1 were placed first, second, and third respectively. For Italian Run1 was placed first with Run2 and Run3 placed second equal.

[1]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[2]  Taher H. Haveliwala Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search , 2003, IEEE Trans. Knowl. Data Eng..

[3]  Raazesh Sainudiin,et al.  An Iterative 'Sudoku Style' Approach to Subgraph-based Word Sense Disambiguation , 2014, *SEM@COLING.

[4]  George A. Miller,et al.  A Semantic Concordance , 1993, HLT.

[5]  Martha Palmer,et al.  The English all-words task , 2004, SENSEVAL@ACL.

[6]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[7]  Rafael Muñoz,et al.  UMCC_DLSI: Reinforcing a Ranking Algorithm with Sense Frequencies and Multidimensional Semantic Resources to solve Multilingual Word Sense Disambiguation , 2013, SemEval@NAACL-HLT.

[8]  Julie Weeds,et al.  Finding Predominant Word Senses in Untagged Text , 2004, ACL.

[9]  David Yarowsky,et al.  One Sense per Collocation , 1993, HLT.

[10]  Roberto Navigli,et al.  SemEval-2013 Task 12: Multilingual Word Sense Disambiguation , 2013, *SEMEVAL.

[11]  Simone Paolo Ponzetto,et al.  BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network , 2012, Artif. Intell..

[12]  M. Abdelguerfi,et al.  Introduction 1.2 Parallel Database Systems 1.2.1 Computation Model 2 1.2 Parallel Database Systems Introduction Select * from Employee, Department Where (employee.dept_no @bullet Department.dept_no) and (employee.position = "manager") (a) Sql Request 1.2.2 Engineering Model , 2022 .

[13]  Roberto Navigli,et al.  SemEval-2015 Task 13: Multilingual All-Words Sense Disambiguation and Entity Linking , 2015, *SEMEVAL.

[14]  Eneko Agirre,et al.  Personalizing PageRank for Word Sense Disambiguation , 2009, EACL.

[15]  Eneko Agirre,et al.  Word Sense Disambiguation: Algorithms and Applications , 2007 .

[16]  Piek T. J. M. Vossen,et al.  SemEval-2010 Task 17: All-Words Word Sense Disambiguation on a Specific Domain , 2009, *SEMEVAL.

[17]  Steve L. Manion,et al.  Unsupervised Knowledge-based Word Sense Disambiguation: Exploration & Evaluation of Semantic Subgraphs , 2014 .