Parameter Transfer across Domains for Word Sense Disambiguation

Word sense disambiguation is defined as finding the corresponding sense for a target word in a given context, which comprises a major step in text applications. Recently, it has been addressed as an optimization problem. The idea behind is to find a sequence of senses that corresponds to the words in a given context with a maximum semantic similarity. Metaheuristics like simulated annealing and D-Bees provide approximate good-enough solutions, but are usually influenced by the starting parameters. In this paper, we study the parameter tuning for both algorithms within the word sense disambiguation problem. The experiments are conducted on different datasets to cover different disambiguation scenarios. We show that D-Bees is robust and less sensitive towards the initial parameters compared to simulated annealing, hence, it is sufficient to tune the parameters once and reuse them for different datasets, domains or languages.

[1]  Didier Schwab,et al.  Ant Colony Algorithm for the Unsupervised Word Sense Disambiguation of Texts: Comparison and Evaluation , 2012, COLING.

[2]  Yiming Zhou,et al.  Genetic Word Sense Disambiguation Algorithm , 2008, 2008 Second International Symposium on Intelligent Information Technology Application.

[3]  Steven Skiena,et al.  Statistically Significant Detection of Linguistic Change , 2014, WWW.

[4]  Mirella Lapata,et al.  Ensemble Methods for Unsupervised WSD , 2006, ACL.

[5]  Ted Pedersen,et al.  Maximizing Semantic Relatedness to Perform Word Sense Disambiguation , 2005 .

[6]  Iryna Gurevych,et al.  Metaheuristic Approaches to Lexical Substitution and Simplification , 2017, EACL.

[7]  Lucia Specia,et al.  SemEval-2012 Task 1: English Lexical Simplification , 2012, *SEMEVAL.

[8]  Manuel López-Ibáñez,et al.  Ant colony optimization , 2010, GECCO '10.

[9]  Didier Schwab,et al.  Parameter estimation under uncertainty with Simulated Annealing applied to an ant colony based probabilistic WSD algorithm , 2012, Coling 2012.

[10]  Piek T. J. M. Vossen,et al.  SemEval-2010 Task 17: All-Words Word Sense Disambiguation on a Specific Domain , 2009, *SEMEVAL.

[11]  Karl-Heinz Zimmermann,et al.  D-Bees: A novel method inspired by bee colony optimization for solving word sense disambiguation , 2014, Swarm Evol. Comput..

[12]  Daphne Koller,et al.  Word-Sense Disambiguation for Machine Translation , 2005, HLT.

[13]  German Rigau,et al.  Supervised Corpus-Based Methods for WSD , 2007 .

[14]  Nancy Ide,et al.  Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[15]  Ted Pedersen,et al.  An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet , 2002, CICLing.

[16]  Dusan Teodorovic,et al.  Bee Colony Optimization (BCO) , 2009, Innovations in Swarm Intelligence.

[17]  Eneko Agirre,et al.  Word Sense Disambiguation: Algorithms and Applications , 2007 .

[18]  David Yarowsky,et al.  Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora , 2010, COLING.

[19]  El-Ghazali Talbi,et al.  Metaheuristics - From Design to Implementation , 2009 .

[20]  R. Navigli,et al.  SemEval-2007 Task 07: Coarse-Grained English All-Words Task , 2007, International Workshop on Semantic Evaluation.

[21]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[22]  Didier Schwab,et al.  A Global Ant Colony Algorithm for Word Sense Disambiguation Based on Semantic Relatedness , 2011, PAAMS.

[23]  Hinrich Schütze,et al.  Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[24]  Chee Peng Lim,et al.  Innovations in Swarm Intelligence , 2009, Innovations in Swarm Intelligence.

[25]  Thomas Risse,et al.  Towards automatic language evolution tracking A study on word sense tracking , 2011 .

[26]  Thomas Stützle,et al.  Automatic Algorithm Configuration Based on Local Search , 2007, AAAI.

[27]  Bertrand Neveu,et al.  A beginner's guide to tuning methods , 2014, Appl. Soft Comput..

[28]  Louise Guthrie,et al.  Lexical Disambiguation using Simulated Annealing , 1992, COLING.

[29]  Roberto Navigli A Quick Tour of Word Sense Disambiguation, Induction and Related Approaches , 2012, SOFSEM.