Word sense disambiguation by semantic inference

This paper proposes an algorithm for unsupervised Word Sense Disambiguation to bypass the knowledge bottleneck faced by supervised approaches. By simulating the semantic inference process performed by human language users, the algorithm makes use of a thesaurus to obtain potential substitute words for the target word in a sentence, builds substitute constructs by replacing the target word with substitute words, uses large-scale dependency parsed corpora to calculate the likelihood of the substitute constructs, and then obtain the best substitute word which help specify the sense of the target word in the sentence. Experiments with WordNet 2.1 and the corpora English Gigawords on the lexical sample task in SemEval-2007 show that the algorithm achieves the-state-of-art accuracy for both nouns and verbs, which are 3–5 percent higher than the best unsupervised system in SemEval-2007, given the condition that the knowledge source provides sufficient information.

[1]  Martha Palmer,et al.  SemEval-2007 Task-17: English Lexical Sample, SRL and All Words , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[2]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[3]  Stefano Faralli,et al.  Unsupervised, Knowledge-Free, and Interpretable Word Sense Disambiguation , 2017, EMNLP.

[4]  Hans Uszkoreit,et al.  DFKI: Multi-objective Optimization for the Joint Disambiguation of Entities and Nouns & Deep Verb Sense Disambiguation , 2015, SemEval@NAACL-HLT.

[5]  Dekang Lin,et al.  Using Syntactic Dependency as Local Context to Resolve Word Sense Ambiguity , 1997, ACL.

[6]  Ted Pedersen,et al.  UMND1: Unsupervised Word Sense Disambiguation Using Contextual Semantic Relatedness , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[7]  George A. Miller,et al.  Using Corpus Statistics and WordNet Relations for Sense Identification , 1998, CL.

[8]  Nancy Ide,et al.  Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[9]  Roberto Navigli,et al.  Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison , 2017, EACL.

[10]  Rada Mihalcea,et al.  Bootstrapping Large Sense Tagged Corpora , 2002, LREC.

[11]  Fernando Gomez,et al.  Acquiring Knowledge from the Web to be used as Selectors for Noun Sense Disambiguation , 2008, CoNLL.

[12]  Eneko Agirre,et al.  Word Relatives in Context for Word Sense Disambiguation , 2006, ALTA.

[13]  Alok Ranjan Pal,et al.  Word sense disambiguation: a survey , 2015, ArXiv.

[14]  Mirella Lapata,et al.  Ensemble Methods for Unsupervised WSD , 2006, ACL.

[15]  Eneko Agirre,et al.  Word Sense Disambiguation: Algorithms and Applications , 2007 .

[16]  A. Goldberg Constructions at Work: The Nature of Generalization in Language , 2006 .

[17]  Deniz Yuret,et al.  KU: Word Sense Disambiguation by Substitution , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).