A hybrid genetic-ant colony optimization algorithm for the word sense disambiguation problem

Abstract Word sense disambiguation (WSD) is a natural language processing problem that occurs at the semantic level. It consists of determining the sense of a polysemous word that is suitable in a particular context. WSD has been addressed using several approaches, including metaheuristic algorithms. We propose hybrid algorithms for WSD that consist of a self-adaptive genetic algorithm (SAGA) and variants of ant colony optimization (ACO) algorithms: max-min ant system (MMAS) and ant colony system (ACS). SAGA is used to automatically tune the parameters of MMAS and ACS. The ACO algorithms are adapted based on a combination of semantic relatedness between sequences of senses corresponding to the context words and semantic relatedness between the sense of a target word and the sense of a context word. We evaluated the performance of the two ACO algorithms (MMASWSD and ACSWSD) and their hybridization with SAGA (GMMASWSD and GACSWSD) on fine-grained and coarse-grained corpora, and compared them with the best-performing algorithms. The empirical results indicate that GMMASWSD outperformed the other variants and all of the rival algorithms on the fine-grained corpora. However, GMMASWSD did not achieve the best performance on the coarse-grained corpus, even though its performance was close to that of the best algorithm.

[1]  Eric Bonabeau,et al.  Evolving Ant Colony Optimization , 1998, Adv. Complex Syst..

[2]  Mounir Zrigui,et al.  Combination of information retrieval methods with LESK algorithm for Arabic word sense disambiguation , 2011, Artificial Intelligence Review.

[3]  Hwee Tou Ng,et al.  NUS-PT: Exploiting Parallel Texts for Word Sense Disambiguation in the English All-Words Tasks , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[4]  M. Hausman A Genetic Algorithm using Semantic Relations for Word Sense Disambiguation , 2010 .

[5]  Walter Daelemans,et al.  GAMBL, genetic algorithm optimization of memory-based WSD , 2004, SENSEVAL@ACL.

[6]  Yiming Zhou,et al.  Genetic Word Sense Disambiguation Algorithm , 2008, 2008 Second International Symposium on Intelligent Information Technology Application.

[7]  Rada Mihalcea,et al.  SenseLearner: Minimally supervised Word Sense Disambiguation for all words in open text , 2004, SENSEVAL@ACL.

[8]  Paola Velardi,et al.  Structural semantic interconnections: a knowledge-based approach to word sense disambiguation , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Rada Mihalcea,et al.  Unsupervised Graph-basedWord Sense Disambiguation Using Measures of Word Semantic Similarity , 2007 .

[10]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[11]  Eneko Agirre,et al.  Random Walks for Knowledge-Based Word Sense Disambiguation , 2014, CL.

[12]  Christian Posse,et al.  PNNL: A Supervised Maximum Entropy Approach to Word Sense Disambiguation , 2007, SemEval@ACL.

[13]  Simone Paolo Ponzetto,et al.  Knowledge-Rich Word Sense Disambiguation Rivaling Supervised Systems , 2010, ACL.

[14]  Carlo Strapparava,et al.  Using Domain Information for Word Sense Disambiguation , 2001, *SEMEVAL.

[15]  Andrew Bennett,et al.  LCC-WSD: System Description for English Coarse Grained All Words Task at SemEval 2007 , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[16]  Thomas Stützle,et al.  MAX-MIN Ant System , 2000, Future Gener. Comput. Syst..

[17]  Wojdan Alsaeedan,et al.  A Self-adaptive Genetic Algorithm for the Word Sense Disambiguation Problem , 2015, IEA/AIE.

[18]  Didier Schwab,et al.  Ant Colony Algorithm for the Unsupervised Word Sense Disambiguation of Texts: Comparison and Evaluation , 2012, COLING.

[19]  Luca Maria Gambardella,et al.  Ant colony system: a cooperative learning approach to the traveling salesman problem , 1997, IEEE Trans. Evol. Comput..

[20]  Wojdan Alsaeedan,et al.  Swarm intelligence for natural language processing , 2015, Int. J. Artif. Intell. Soft Comput..

[21]  Eneko Agirre,et al.  UBC-ALM: Combining k-NN with SVD for WSD , 2007, SemEval@ACL.

[22]  Ted Pedersen,et al.  Extended Gloss Overlaps as a Measure of Semantic Relatedness , 2003, IJCAI.

[23]  Walter Daelemans,et al.  Parameter optimization for machine-learning of word sense disambiguation , 2002, Natural Language Engineering.

[24]  Didier Schwab,et al.  A Global Ant Colony Algorithm for Word Sense Disambiguation Based on Semantic Relatedness , 2011, PAAMS.

[25]  Carlo Strapparava,et al.  The role of domain information in Word Sense Disambiguation , 2002, Natural Language Engineering.

[26]  Roberto Navigli,et al.  SemEval-2007 Task 07: Coarse-Grained English All-Words Task , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[27]  Cheol-Young Ock,et al.  Word sense disambiguation as a traveling salesman problem , 2013, Artificial Intelligence Review.

[28]  Wojdan Alsaeedan,et al.  A Novel Genetic Algorithm for the Word Sense Disambiguation Problem , 2016, Canadian Conference on AI.

[29]  Martha Palmer,et al.  SemEval-2007 Task-17: English Lexical Sample, SRL and All Words , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[30]  German Rigau,et al.  GPLSI: Word Coarse-grained Disambiguation aided by Basic Level Concepts , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[31]  Nazlia Omar,et al.  Harmony Search Algorithm for Word Sense Disambiguation , 2015, PloS one.

[32]  Marco Dorigo,et al.  Swarm intelligence: from natural to artificial systems , 1999 .

[33]  Didier Schwab,et al.  Ant colony algorithm for Arabic word sense disambiguation through English lexical information , 2015, Int. J. Metadata Semant. Ontologies.

[34]  Eneko Agirre,et al.  Personalizing PageRank for Word Sense Disambiguation , 2009, EACL.

[35]  Rada Mihalcea,et al.  UNT-Yahoo: SuperSenseLearner: Combining SenseLearner with SuperSense and other Coarse Semantic Features , 2007, SemEval@ACL.

[36]  Karl-Heinz Zimmermann,et al.  D-Bees: A novel method inspired by bee colony optimization for solving word sense disambiguation , 2014, Swarm Evol. Comput..

[37]  Hwee Tou Ng,et al.  It Makes Sense: A Wide-Coverage Word Sense Disambiguation System for Free Text , 2010, ACL.

[38]  Mohamed El Bachir Menai,et al.  Word sense disambiguation using evolutionary algorithms - Application to Arabic language , 2014, Comput. Hum. Behav..

[39]  Arvinder Kaur,et al.  A modified technique for Word Sense Disambiguation using Lesk algorithm in Hindi language , 2014, 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[40]  Marco Dorigo,et al.  Ant system: optimization by a colony of cooperating agents , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[41]  Yee Whye Teh,et al.  NUS-ML: Improving Word Sense Disambiguation Using Topic Features , 2007, SemEval@ACL.

[42]  Li-Sheng Hu,et al.  A model induced max-min ant colony optimization for asymmetric traveling salesman problem , 2013, Appl. Soft Comput..

[43]  Kwang Mong Sim,et al.  Ant colony optimization for routing and load-balancing: survey and new directions , 2003, IEEE Trans. Syst. Man Cybern. Part A.

[44]  Vittorio Maniezzo,et al.  The Ant System Applied to the Quadratic Assignment Problem , 1999, IEEE Trans. Knowl. Data Eng..

[45]  Rada Mihalcea,et al.  Word sense disambiguation with pattern learning and automatic feature selection , 2002, Natural Language Engineering.

[46]  Alexander Gelbukh,et al.  Evolutionary Approach to Natural Language Word Sense Disambiguation through Global Coherence Optimization , 2005 .

[47]  Thomas Stützle,et al.  Parameter Adaptation in Ant Colony Optimization , 2012, Autonomous Search.

[48]  Martha Palmer,et al.  The English all-words task , 2004, SENSEVAL@ACL.

[49]  Carlo Strapparava,et al.  Pattern abstraction and term similarity for Word Sense Disambiguation: IRST at Senseval-3 , 2004 .

[50]  Christiane Fellbaum,et al.  English Tasks: All-Words and Verb Lexical Sample , 2001, *SEMEVAL.