A Taxonomy and Survey of Semantic Approaches for Query Expansion

Conventional approaches to query expansion (QE) rely on the integration of an unstructured corpus and probabilistic rules for the extraction of candidate expansion terms. These methods do not consider search query semantics, thereby resulting in ineffective retrieval of information. The semantic approaches for QE overcome this limitation, whereby a search query is expanded with meaningful terms that accord with user information needs. This paper surveys recent approaches to semantic QE that employ different models and strategies and leverages various knowledge structures. We organize these approaches into a taxonomy that includes linguistic methods, ontology-based methods, and mixed-mode methods. We also discuss the strengths and limitations of each type of semantic QE method. In addition, we evaluate various semantic QE approaches in terms of knowledge structure utilization, corpus collection, baseline model adaptation, and retrieval performance. Finally, future directions in exploiting personalized social information and multiple ontologies for semantic QE are suggested.

[1]  Azzam Sleit,et al.  Enhancing retrieval effectiveness of diacritisized Arabic passages using stemmer and thesaurus , 2008 .

[2]  Aarti Singh,et al.  Web Semantics for Personalized Information Retrieval , 2017 .

[3]  Yuanxi Li,et al.  A framework of query expansion for image retrieval based on knowledge base and concept similarity , 2016, Neurocomputing.

[4]  Rada Mihalcea,et al.  Using WordNet and Lexical Operators to Improve Internet Searches , 2000, IEEE Internet Comput..

[5]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[6]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[7]  Travis Atkison,et al.  Preliminary research on thesaurus-based query expansion for Twitter data extraction , 2018, ACM Southeast Regional Conference.

[8]  Ghalem Belalem,et al.  Query Expansion Using Medical Information Extraction for Improving Information Retrieval in French Medical Domain , 2018, Int. J. Intell. Inf. Technol..

[9]  G. Meera Gandhi,et al.  Wordnet and Ontology Based Query Expansion for Semantic Information Retrieval in Sports Domain , 2015, J. Comput. Sci..

[10]  William R. Hersh,et al.  Assessing thesaurus-based query expansion using the UMLS Metathesaurus , 2000, AMIA.

[11]  Jane Greenberg Automatic query expansion via lexical-semantic relationships , 2001, J. Assoc. Inf. Sci. Technol..

[12]  Jiangbo Dang,et al.  UNIpedia: A Unified Ontological Knowledge Platform for Semantic Content Tagging and Search , 2010, 2010 IEEE Fourth International Conference on Semantic Computing.

[13]  Junping Du,et al.  Extended search method based on a semantic hashtag graph combining social and conceptual information , 2018, World Wide Web.

[14]  Ed Powers,et al.  Ontology-aided vs. keyword-based web searches: a comparative user study , 2007 .

[15]  Hamid Bennis,et al.  Enriching User Queries Using DBpedia Features and Relevance Feedback , 2018 .

[16]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[17]  James P. Callan,et al.  Query Expansion with Freebase , 2015, ICTIR.

[18]  Hsin-Hsi Chen,et al.  Query Expansion with ConceptNet and WordNet: An Intrinsic Comparison , 2006, AIRS.

[19]  Roberto Navigli,et al.  An analysis of ontology-based query expansion strategies , 2003 .

[20]  Giuseppe Sansonetti,et al.  Social semantic query expansion , 2013, ACM Trans. Intell. Syst. Technol..

[21]  Susan Jones A thesaurus data model for an intelligent retrieval system , 1993, J. Inf. Sci..

[22]  Cherif Chiraz Latiri,et al.  Short Query Expansion for Microblog Retrieval , 2016, KES.

[23]  Vincent Claveau,et al.  Automatic Morphological Query Expansion Using Analogy-Based Machine Learning , 2007, ECIR.

[24]  Mathieu Lafourcade,et al.  Spreading Relation Annotations in a Lexical Semantic Network Applied to Radiology , 2014, CICLing.

[25]  Zhiguo Gong,et al.  Web Query Expansion by WordNet , 2005, DEXA.

[26]  Iadh Ounis,et al.  Studying Query Expansion Effectiveness , 2009, ECIR.

[27]  Sarantos Kapidakis,et al.  Query Expansion of Zero-Hit Subject Searches: Using a Thesaurus in Conjunction with NLP Techniques , 2012, TPDL.

[28]  Lixin Gan,et al.  Improving Query Expansion for Information Retrieval Using Wikipedia , 2015 .

[29]  Santosh Kumar Ray,et al.  Exploring Multiple Ontologies and WordNet Framework to Expand Query for Question Answering System , 2009, IHCI.

[30]  Stephen E. Robertson,et al.  Interactive Thesaurus Navigation: Intelligence Rules OK? , 1995, J. Am. Soc. Inf. Sci..

[31]  Johanna Enberg,et al.  Query Expansion , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[32]  Patrick Hanks,et al.  The New Collins concise dictionary of the English language , 1982 .

[33]  Chris D. Paice,et al.  Another stemmer , 1990, SIGF.

[34]  Allan Collins,et al.  A spreading-activation theory of semantic processing , 1975 .

[35]  Xiang Zhu,et al.  Real-time personalized twitter search based on semantic expansion and quality model , 2017, Neurocomputing.

[36]  Jaana Kekäläinen,et al.  ExpansionTool: Concept-Based Query Expansion and Construction , 2001, Information Retrieval.

[37]  P. Smith,et al.  A review of ontology based query expansion , 2007, Inf. Process. Manag..

[38]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[39]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[40]  Andrei Z. Broder,et al.  Classifying search queries using the Web as a source of knowledge , 2009, TWEB.

[41]  A. R. Rivas,et al.  Study of Query Expansion Techniques and Their Application in the Biomedical Information Retrieval , 2014, TheScientificWorldJournal.

[42]  Jane Greenberg,et al.  Optimal query expansion (QE) processing methods with semantically encoded structured thesauri terminology , 2001, J. Assoc. Inf. Sci. Technol..

[43]  Andrew Trotman,et al.  Automatic Term Reweighting for Query Expansion , 2017, ADCS.

[44]  Brajendra Singh Rajput,et al.  A survey of Stemming Algorithms for Information Retrieval , 2015 .

[45]  Paola Velardi,et al.  A knowledge-based approach to ontology learning and semantic annotation , 2004, CAiSE Workshops.

[46]  Jiewen Wu,et al.  A Study of Ontology-based Query Expansion , 2011 .

[47]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[48]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[49]  Alexander Kotov,et al.  An Empirical Comparison of Statistical Term Association Graphs with DBpedia and ConceptNet for Query Expansion , 2015, FIRE.

[50]  Jens Lehmann,et al.  DBpedia - A crystallization point for the Web of Data , 2009, J. Web Semant..

[51]  Patrick Ruch,et al.  Evaluation of Stemming, Query Expansion and Manual Indexing Approaches for the Genomic Task , 2005, TREC.

[52]  Paul Buitelaar,et al.  Query Expansion Using Wikipedia and Dbpedia , 2012, CLEF.

[53]  Lu Huijuan,et al.  An approach to semantic query expansion system based on Hepatitis ontology , 2016, Journal of Biological Research-Thessaloniki.

[54]  Haolin Wang,et al.  Semantically Enhanced Medical Information Retrieval System: A Tensor Factorization Based Approach , 2017, IEEE Access.

[55]  Philippe Mulhem,et al.  Hybrid query expansion model for text and microblog information retrieval , 2018, Information Retrieval Journal.

[56]  J C Bezdek,et al.  A knowledge-based approach to online document retrieval system design , 1986, ISMIS '86.

[57]  Eric Brill,et al.  A Simple Rule-Based Part of Speech Tagger , 1992, HLT.

[58]  Gareth J. F. Jones,et al.  Using WordNet for Query Expansion: ADAPT @ FIRE 2016 Microblog Track , 2016, FIRE.

[59]  Annapoorna Shetty,et al.  A Hybrid Framework to Refine Queries using Ontology , 2015 .

[60]  Robert Krovetz,et al.  Viewing morphology as an inference process , 1993, Artif. Intell..

[61]  Pertti Vakkari,et al.  Subject knowledge improves interactive query expansion assisted by a thesaurus , 2004, J. Documentation.

[62]  Clement T. Yu,et al.  An effective approach to document retrieval via utilizing WordNet and recognizing phrases , 2004, SIGIR '04.

[63]  Michael J. Witbrock,et al.  An Introduction to the Syntax and Content of Cyc , 2006, AAAI Spring Symposium: Formalizing and Compiling Background Knowledge and Its Applications to Knowledge Representation and Question Answering.

[64]  Sandeep Purao,et al.  CONQUER: A Methodology for Context-Aware Query Processing on the World Wide Web , 2008, Inf. Syst. Res..

[65]  Dong Zhou,et al.  Query Expansion with Enriched User Profiles for Personalized Search Utilizing Folksonomy Data , 2017, IEEE Transactions on Knowledge and Data Engineering.

[66]  Alain Polguère,et al.  How Terms Meet in Small-World Lexical Networks: The Case of Chemistry Terminology , 2015, TIA.

[67]  Gerard Deepak,et al.  Personalized and Enhanced Hybridized Semantic Algorithm for web image retrieval incorporating ontology classification, strategic query expansion, and content-based analysis , 2018, Comput. Electr. Eng..

[68]  Lauri Karttunen,et al.  Word Sense Disambiguation : The Case for Combinations of Knowledge Sources , 2004 .

[69]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[70]  Maleerat Sodanil,et al.  An Ontology-Based Query Expansion for an Agricultural Expert Retrieval System , 2013, IIWAS '13.

[71]  Min Song,et al.  Integration of association rules and ontologies for semantic query expansion , 2007, Data Knowl. Eng..

[72]  Yasushi Ogawa,et al.  Structuring and expanding queries in the probabilistic model , 1999, TREC.

[73]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[74]  Vijayan Sugumaran,et al.  Improving web-query processing through semantic knowledge , 2008, Data Knowl. Eng..

[75]  Mohamed Yehia Dahab,et al.  A Tutorial on Information Retrieval Using Query Expansion , 2018 .

[76]  Ibrahim F. Moawad,et al.  Ontology-based Query Expansion for Arabic Text Retrieval , 2016 .

[77]  B. Padmaja Rani,et al.  Reformulation of Telugu web query using word semantic relationships , 2012, ICACCI '12.

[78]  W. Bruce Croft,et al.  An Association Thesaurus for Information Retrieval , 1994, RIAO.

[79]  Natalia V. Loukachevitch,et al.  Development of Ontologies with Minimal Set of Conceptual Relations , 2004, LREC.

[80]  Guangyan Huang,et al.  Query Expansion Based on Semantic Related Network , 2018, PRICAI.

[81]  Valentina Franzoni,et al.  Collective Evolutionary Concept Distance Based Query Expansion for Effective Web Document Retrieval , 2013, ICCSA.

[82]  Takenobu Tokunaga,et al.  Combining multiple evidence from different types of thesaurus for query expansion , 1999, SIGIR '99.

[83]  Moulay Driss Rahmani,et al.  Geographical Query reformulation using a Geographical Taxonomy and WordNet , 2018 .

[84]  Lipika Dey,et al.  Ontology Aided Query Expansion for Retrieving Relevant Texts , 2005, AWIC.

[85]  Ahmad Noraziah,et al.  A survey of statistical approaches for query expansion , 2018, Knowledge and Information Systems.

[86]  Theo Tryfonas,et al.  Frontiers in Artificial Intelligence and Applications , 2009 .

[87]  Jimmy J. Lin,et al.  What Works Better for Question Answering: Stemming or Morphological Query Expansion? , 2004 .

[88]  James Allan,et al.  Automatic Query Expansion Using SMART: TREC 3 , 1994, TREC.

[89]  David Lo,et al.  Query expansion via WordNet for effective code search , 2015, 2015 IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER).

[90]  J. Jayanthi,et al.  Personalized Query Expansion based on phrases semantic similarity , 2011, 2011 3rd International Conference on Electronics Computer Technology.

[91]  Jin H. Kim,et al.  A Model of Knowledge Based Information Retrieval with Hierarchical Concept Graph , 1990, J. Documentation.

[92]  Ron Weber,et al.  Ontological Issues in Accounting Information Systems , 2002 .

[93]  Mathieu Lafourcade,et al.  About Inferences in a Crowdsourced Lexical-Semantic Network , 2014, EACL.

[94]  Claudio Carpineto,et al.  A Survey of Automatic Query Expansion in Information Retrieval , 2012, CSUR.