Query sense disambiguation leveraging large scale user behavioral data

Term ambiguity - the challenge of having multiple potential meanings for a keyword or phrase - can be a major problem for search engines. Contextual information is essential for word sense disambiguation, but search queries are often limited to very few keywords, making the available textual context needed for disambiguation minimal or non-existent. In this paper we propose a novel system to identify and resolve term ambiguity in search queries using large-scale user behavioral data. The proposed system demonstrates that, despite the lack of context in most keyword queries, multiple potential senses of a keyword or phrase within a search query can be accurately identified, disambiguated, and expressed in order to maximize the likelihood of fulfilling a user's information need. The proposed system overcomes the immediate lack of context by leveraging large-scale user behavioral data from historical query logs. Unlike traditional word sense disambiguation methods that rely on knowledge sources or available textual corpora, our system is language-agnostic, is able to easily handle domain-specific terms and meanings, and is automatically generated so that it does not grow out of date or require manual updating as ambiguous terms emerge or undergo a shift in meaning. The system has been implemented using the Hadoop eco-system and integrated within CareerBuilder's semantic search engine.

[1]  Hwee Tou Ng,et al.  Word Sense Disambiguation with Semi-Supervised Learning , 2005, AAAI.

[2]  Hwee Tou Ng,et al.  Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach , 1996, ACL.

[3]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[4]  Roberto Navigli,et al.  Align, Disambiguate and Walk: A Unified Approach for Measuring Semantic Similarity , 2013, ACL.

[5]  Osmar R. Zaïane,et al.  Unsupervised graph-based Word Sense Disambiguation of biomedical documents , 2013, 2013 IEEE 15th International Conference on e-Health Networking, Applications and Services (Healthcom 2013).

[6]  Khalifeh AlJadda,et al.  Crowdsourced query augmentation through semantic discovery of domain-specific jargon , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[7]  Adam Pease,et al.  Mapping WordNet to the SUMO Ontology , 2003 .

[8]  Christian Posse,et al.  PNNL: A Supervised Maximum Entropy Approach to Word Sense Disambiguation , 2007, SemEval@ACL.

[9]  Rada Mihalcea,et al.  Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling , 2005, HLT.

[10]  David Yarowsky,et al.  Combining Classifiers for word sense disambiguation , 2002, Nat. Lang. Eng..

[11]  Patrick Pantel,et al.  An Unsupervised Approach to Prepositional Phrase Attachment using Contextually Similar Words , 2000, ACL.

[12]  Philip Resnik,et al.  An Unsupervised Method for Word Sense Tagging using Parallel Corpora , 2002, ACL.

[13]  Roberto Navigli A Quick Tour of Word Sense Disambiguation, Induction and Related Approaches , 2012, SOFSEM.

[14]  Pete Wyckoff,et al.  Hive - A Warehousing Solution Over a Map-Reduce Framework , 2009, Proc. VLDB Endow..

[15]  Marian Makins,et al.  Collins English dictionary , 1991 .

[16]  Martha Palmer,et al.  SemEval-2007 Task-17: English Lexical Sample, SRL and All Words , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[17]  Mirella Lapata,et al.  Graph Connectivity Measures for Unsupervised Word Sense Disambiguation , 2007, IJCAI.

[18]  John A. Miller,et al.  PGMHD: A scalable probabilistic graphical model for massive hierarchical data problems , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[19]  Patrick Pantel,et al.  The Omega Ontology , 2005, IJCNLP.

[20]  Lluís Màrquez i Villodre,et al.  A Comparison between Supervised Learning Algorithms for Word Sense Disambiguation , 2000, CoNLL/LLL.

[21]  Houfeng Wang,et al.  Learning Entity Representation for Entity Disambiguation , 2013, ACL.

[22]  Stefano Faralli,et al.  A New Minimally-Supervised Framework for Domain Word Sense Disambiguation , 2012, EMNLP.

[23]  Lluís Màrquez i Villodre,et al.  Naive Bayes and Exemplar-based Approaches to Word Sense Disambiguation Revisited , 2000, ECAI.

[24]  Claire Cardie,et al.  SimCompass: Using Deep Learning Word Embeddings to Assess Cross-level Similarity , 2014, *SEMEVAL.

[25]  Francis Bond,et al.  A Survey of WordNet Annotated Corpora , 2014, GWC.

[26]  Gerlof Bouma,et al.  Normalized (pointwise) mutual information in collocation extraction , 2009 .

[27]  David Yarowsky,et al.  Unsupervised Personal Name Disambiguation , 2003, CoNLL.

[28]  Paola Velardi,et al.  Structural semantic interconnections: a knowledge-based approach to word sense disambiguation , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Hwee Tou Ng,et al.  Supervised Word Sense Disambiguation with Support Vector Machines and multiple knowledge sources , 2004, SENSEVAL@ACL.

[30]  Eneko Agirre,et al.  Random Walks for Knowledge-Based Word Sense Disambiguation , 2014, CL.

[31]  Ioana Stanoi,et al.  Automatic Term Ambiguity Detection , 2013, ACL.

[32]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[33]  Mirella Lapata,et al.  An Experimental Study of Graph Connectivity for Unsupervised Word Sense Disambiguation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[35]  George A. Miller,et al.  A Semantic Concordance , 1993, HLT.

[36]  Marcos André Gonçalves,et al.  A brief survey of automatic methods for author name disambiguation , 2012, SGMD.

[37]  Qi Hu,et al.  Supervised word sense disambiguation using semantic diffusion kernel , 2014, Eng. Appl. Artif. Intell..

[38]  Eneko Agirre,et al.  Knowledge Sources for Word Sense Disambiguation , 2001, TSD.

[39]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[40]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[41]  A. Babu,et al.  Word Sense Disambiguation : An Empirical Survey , 2012 .

[42]  Eneko Agirre,et al.  Word Sense Disambiguation: Algorithms and Applications , 2007 .

[43]  Hyoil Han,et al.  Survey of Word Sense Disambiguation Approaches , 2005, FLAIRS Conference.

[44]  Carlos Sousa Pinto,et al.  Solving problem of ambiguity terms using ontology , 2014 .

[45]  Martha Palmer,et al.  The English all-words task , 2004, SENSEVAL@ACL.

[46]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.