Query Expansion Techniques for Information Retrieval: a Survey

Abstract With the ever increasing size of the web, relevant information extraction on the Internet with a query formed by a few keywords has become a big challenge. Query Expansion (QE) plays a crucial role in improving searches on the Internet. Here, the user’s initial query is reformulated by adding additional meaningful terms with similar significance. QE – as part of information retrieval (IR) – has long attracted researchers’ attention. It has become very influential in the field of personalized social document, question answering, cross-language IR, information filtering and multimedia IR. Research in QE has gained further prominence because of IR dedicated conferences such as TREC (Text Information Retrieval Conference) and CLEF (Conference and Labs of the Evaluation Forum). This paper surveys QE techniques in IR from 1960 to 2017 with respect to core techniques, data sources used, weighting and ranking methodologies, user participation and applications – bringing out similarities and differences.

[1]  Stephen E. Robertson,et al.  Parallel computing in information retrieval - an updated review , 1997, J. Documentation.

[2]  Xiangji Huang,et al.  Proximity-based rocchio's model for pseudo relevance , 2012, SIGIR '12.

[3]  Tetsuya Sakai,et al.  Structured query suggestion for specialization and parallel movement: effect on search behaviors , 2012, WWW.

[4]  Cherif Chiraz Latiri,et al.  Short Query Expansion for Microblog Retrieval , 2016, KES.

[5]  Duen-Ren Liu,et al.  Complementary QA network analysis for QA retrieval in social question‐answering websites , 2015, J. Assoc. Inf. Sci. Technol..

[6]  Lixin Gan,et al.  Improving Query Expansion for Information Retrieval Using Wikipedia , 2015 .

[7]  Ana Paula Appel,et al.  Building a Question-Answering Corpus Using Social Media and News Articles , 2016, PROPOR.

[8]  Johanna Enberg,et al.  Query Expansion , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[9]  Vasileios Theodorou,et al.  Goal-Based Semantic Queries for Dynamic Processes in the Internet of Things , 2016, Int. J. Semantic Comput..

[10]  Stephen E. Robertson,et al.  Understanding inverse document frequency: on theoretical arguments for IDF , 2004, J. Documentation.

[11]  Isabelle Augenstein,et al.  Mapping Keywords to Linked Data Resources for Automatic Query Expansion , 2013, KNOW@LOD.

[12]  Chirag Shah,et al.  Evaluating high accuracy retrieval techniques , 2004, SIGIR '04.

[13]  Wei-Ying Ma,et al.  Optimizing web search using web click-through data , 2004, CIKM '04.

[14]  Min Wang,et al.  Exploiting entity relationship for query expansion in enterprise search , 2014, Information Retrieval.

[15]  Ellen M. Voorhees,et al.  Overview of the seventh text retrieval conference (trec-7) [on-line] , 1999 .

[16]  Yue Xu,et al.  Pattern-based Topics for Document Modelling in Information Filtering , 2014, IEEE Transactions on Knowledge and Data Engineering.

[17]  Nicholas J. Belkin,et al.  A case for interaction: a study of interactive information retrieval behavior and effectiveness , 1996, CHI.

[18]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[19]  Ji-Rong Wen,et al.  Query clustering using user logs , 2002, TOIS.

[20]  Stephen E. Robertson,et al.  A probabilistic model of information retrieval: development and comparative experiments - Part 1 , 2000, Inf. Process. Manag..

[21]  Eneko Agirre,et al.  Random Walks for Knowledge-Based Word Sense Disambiguation , 2014, CL.

[22]  Hongfei Lin,et al.  Improving biomedical information retrieval by linear combinations of different query expansion techniques , 2016, BMC Bioinformatics.

[23]  Alia I. Abdelmoty,et al.  Ontology-Based Spatial Query Expansion in Information Retrieval , 2005, OTM Conferences.

[24]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[25]  Javed A. Aslam,et al.  Relevance score normalization for metasearch , 2001, CIKM '01.

[26]  Pasquale Lops,et al.  Social Question Answering , 2016, ACM Trans. Inf. Syst..

[27]  Hsin-Hsi Chen,et al.  Combining WordNet and ConceptNet for Automatic Query Expansion: A Learning Approach , 2008, AIRS.

[28]  Oliver A. McBryan,et al.  GENVL and WWWW: Tools for taming the Web , 1994, WWW Spring 1994.

[29]  W. Bruce Croft,et al.  Query reformulation using anchor text , 2010, WSDM '10.

[30]  Efthimis N. Efthimiadis,et al.  Analyzing and evaluating query reformulation strategies in web search logs , 2009, CIKM.

[31]  Fernando Diaz,et al.  Condensed List Relevance Models , 2015, ICTIR.

[32]  Thomas Schlegel,et al.  Using Semantic Queries to Enable Dynamic Service Invocation for Processes in the Internet of Things , 2016, 2016 IEEE Tenth International Conference on Semantic Computing (ICSC).

[33]  Jimmy J. Lin,et al.  Assessing the term independence assumption in blind relevance feedback , 2005, SIGIR '05.

[34]  Stephen E. Robertson,et al.  On Term Selection for Query Expansion , 1991, J. Documentation.

[35]  Paul M. B. Vitányi,et al.  The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.

[36]  Jaime G. Carbonell,et al.  Document Representation and Query Expansion Models for Blog Recommendation , 2008, ICWSM.

[37]  Jacques Savoy,et al.  Comparative study of monolingual and multilingual search models for use with asian languages , 2005, TALIP.

[38]  Ben Carterette,et al.  Time Based Feedback and Query Expansion for Twitter Search , 2013, ECIR.

[39]  Julio Gonzalo,et al.  Indexing with WordNet synsets can improve text retrieval , 1998, WordNet@ACL/COLING.

[40]  John Tait,et al.  Word sense disambiguation in information retrieval revisited , 2003, SIGIR.

[41]  CloughPaul,et al.  An IR-Based Approach Utilizing Query Expansion for Plagiarism Detection in MEDLINE , 2017 .

[42]  W. Bruce Croft,et al.  Automatic boolean query suggestion for professional search , 2011, SIGIR.

[43]  C. J. van Rijsbergen,et al.  Term Similarity-Based Query Expansion for Cross-Language Information Retrieval , 1999, ECDL.

[44]  Winston H. Hsu,et al.  Query expansion for hash-based image object retrieval , 2009, ACM Multimedia.

[45]  W. Bruce Croft,et al.  Lexical ambiguity and information retrieval , 1992, TOIS.

[46]  Claudio Carpineto,et al.  FUB at TREC 2008 Relevance Feedback Track: Extending Rocchio with Distributional Term Analysis , 2008, TREC.

[47]  Benoît Gaillard,et al.  Query expansion for Cross Language Information Retrieval Improvement , 2010, 2010 Fourth International Conference on Research Challenges in Information Science (RCIS).

[48]  James P. Callan,et al.  Query Expansion with Freebase , 2015, ICTIR.

[49]  Hsin-Hsi Chen,et al.  Query Expansion with ConceptNet and WordNet: An Intrinsic Comparison , 2006, AIRS.

[50]  G Salton,et al.  Developments in Automatic Text Retrieval , 1991, Science.

[51]  Giuseppe Sansonetti,et al.  Social semantic query expansion , 2013, ACM Trans. Intell. Syst. Technol..

[52]  Mandar Mitra,et al.  Improving query expansion using WordNet , 2013, J. Assoc. Inf. Sci. Technol..

[53]  Xiaochen Li,et al.  Query Expansion Based on Crowd Knowledge for Code Search , 2016, IEEE Transactions on Services Computing.

[54]  Eric Brill,et al.  Automatic question answering using the web: Beyond the Factoid , 2006, Information Retrieval.

[55]  ChengXiang Zhai,et al.  Mining term association patterns from search logs for effective query reformulation , 2008, CIKM '08.

[56]  Aditi Sharan,et al.  Selecting Effective Expansion Terms for Better Information Retrieval , 2010, Int. J. Comput. Sci. Appl..

[57]  W. Bruce Croft,et al.  Relevance-Based Language Models , 2001, SIGIR '01.

[58]  Alessandro Micarelli,et al.  Social Tagging in Query Expansion: A New Way for Personalized Web Search , 2009, 2009 International Conference on Computational Science and Engineering.

[59]  Lisa Ballesteros,et al.  Light Stemming for Arabic Information Retrieval , 2007 .

[60]  Valentina Franzoni,et al.  PMING Distance: A Collaborative Semantic Proximity Measure , 2012, 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[61]  ChengXiang Zhai,et al.  Positional relevance model for pseudo-relevance feedback , 2010, SIGIR.

[62]  Ellen M. Voorhees,et al.  TREC 2014 Web Track Overview , 2015, TREC.

[63]  ChengXiang Zhai,et al.  Learn from web search logs to organize search results , 2007, SIGIR.

[64]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[65]  Evgeniy Gabrilovich,et al.  Concept-Based Information Retrieval Using Explicit Semantic Analysis , 2011, TOIS.

[66]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[67]  Van Rijsbergen,et al.  A theoretical basis for the use of co-occurence data in information retrieval , 1977 .

[68]  Cordelia Schmid,et al.  Stable Hyper-pooling and Query Expansion for Event Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[69]  Justin Zobel,et al.  Questioning Query Expansion: An Examination of Behaviour and Parameters , 2004, ADC.

[70]  Hugo Liu,et al.  ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[71]  Jian-Yun Nie,et al.  Query expansion using term relationships in language models for information retrieval , 2005, CIKM '05.

[72]  Mandar Mitra,et al.  Query Expansion Using Term Distribution and Term Association , 2013, ArXiv.

[73]  Dong Zhou,et al.  Improving search via personalized query expansion using social media , 2012, Information Retrieval.

[74]  Jun Guo,et al.  Improving Retrieval Performance by Global Analysis , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[75]  Jun Cai,et al.  Automatic Query Refinement Using Mined Semantic Relations , 2005, International Workshop on Challenges in Web Information Retrieval and Integration.

[76]  Sudeshna Sarkar,et al.  UsingWord Embeddings for Query Translation for Hindi to English Cross Language Information Retrieval , 2016, ArXiv.

[77]  Chin-Teng Lin,et al.  A Novel Fuzzy Logic Model for Pseudo-Relevance Feedback-Based Query Expansion , 2016, Int. J. Fuzzy Syst..

[78]  Ibrahim Abu El-Khair,et al.  Arabic information retrieval , 2007, Annu. Rev. Inf. Sci. Technol..

[79]  Jing Wang,et al.  Clickage: towards bridging semantic and intent gaps via mining click logs of search engines , 2013, ACM Multimedia.

[80]  James Allan,et al.  The effect of adding relevance information in a relevance feedback environment , 1994, SIGIR '94.

[81]  M. de Rijke,et al.  Articulating information needs in XML query languages , 2006, TOIS.

[82]  C. J. van Rijsbergen,et al.  The use of hierarchic clustering in information retrieval , 1971, Inf. Storage Retr..

[83]  Karen Sparck Jones Automatic keyword classification for information retrieval , 1971 .

[84]  Rada Mihalcea,et al.  eXtended WordNet: progress report , 2001, HTL 2001.

[85]  Milad Shokouhi,et al.  Query Expansion Using External Evidence , 2009, ECIR.

[86]  James Allan,et al.  Automatic Query Expansion Using SMART: TREC 3 , 1994, TREC.

[87]  Claudio Carpineto,et al.  A Survey of Automatic Query Expansion in Information Retrieval , 2012, CSUR.

[88]  Jianying Wang,et al.  A corpus analysis approach for automatic query expansion and its extension to multiple databases , 1999, TOIS.

[89]  David R. Karger,et al.  Tie strength in question & answer on social network sites , 2012, CSCW '12.

[90]  Eric Horvitz,et al.  Patterns of search: analyzing and modeling Web query refinement , 1999 .

[91]  ChengXiang Zhai,et al.  Positional language models for information retrieval , 2009, SIGIR.

[92]  Lanfen Lin,et al.  Domain lexicon-based query expansion for patent retrieval , 2016, 2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD).

[93]  Ryen W. White,et al.  A study of factors affecting the utility of implicit relevance feedback , 2005, SIGIR '05.

[94]  Stefanie Tellex,et al.  Grounding spatial language for video search , 2010, ICMI-MLMI '10.

[95]  Amit Singhal,et al.  Document expansion for speech retrieval , 1999, SIGIR '99.

[96]  Jean Paul Ballerini,et al.  Experiments in multilingual information retrieval using the SPIDER system , 1996, SIGIR '96.

[97]  Prasenjit Mitra,et al.  Query suggestions in the absence of query logs , 2011, SIGIR.

[98]  David A. Hull Stemming Algorithms: A Case Study for Detailed Evaluation , 1996, J. Am. Soc. Inf. Sci..

[99]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[100]  Mark Stevenson,et al.  An IR-Based Approach Utilizing Query Expansion for Plagiarism Detection in MEDLINE , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[101]  Paola Velardi,et al.  Structural semantic interconnections: a knowledge-based approach to word sense disambiguation , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[102]  Rong Yan,et al.  Semantic concept-based query expansion and re-ranking for multimedia retrieval , 2007, ACM Multimedia.

[103]  Jian-Yun Nie,et al.  Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web , 1999, SIGIR '99.

[104]  Wei-Ying Ma,et al.  Probabilistic query expansion using query logs , 2002, WWW '02.

[105]  Donna K. Harman,et al.  Relevance Feedback and Other Query Modification Techniques , 1992, Information retrieval (Boston).

[106]  Ricardo A. Baeza-Yates,et al.  Query Recommendation Using Query Logs in Search Engines , 2004, EDBT Workshops.

[107]  Jian-Yun Nie,et al.  Using query contexts in information retrieval , 2007, SIGIR.

[108]  Hugh E. Williams,et al.  Query expansion using associated queries , 2003, CIKM '03.

[109]  Pertti Vakkari,et al.  Subject knowledge improves interactive query expansion assisted by a thesaurus , 2004, J. Documentation.

[110]  Micheline Beaulieu,et al.  Experiments on interfaces to support query expansion , 1997, J. Documentation.

[111]  Ihab F. Ilyas,et al.  Expressive and flexible access to web-extracted data: a keyword-based structured query language , 2010, SIGMOD Conference.

[112]  Tat-Seng Chua,et al.  Mining dependency relations for query expansion in passage retrieval , 2006, SIGIR.

[113]  Ellen M. Voorhees,et al.  Overview of the Seventh Text REtrieval Conference , 1998 .

[114]  Clement T. Yu,et al.  An effective approach to document retrieval via utilizing WordNet and recognizing phrases , 2004, SIGIR '04.

[115]  Frederick Jelinek,et al.  Interpolated estimation of Markov source parameters from sparse data , 1980 .

[116]  Kevyn Collins-Thompson,et al.  Query expansion using random walk models , 2005, CIKM '05.

[117]  Volker Tresp,et al.  A nonparametric hierarchical bayesian framework for information filtering , 2004, SIGIR '04.

[118]  Nick Craswell,et al.  Query Expansion with Locally-Trained Word Embeddings , 2016, ACL.

[119]  Jimmy J. Lin,et al.  What Works Better for Question Answering: Stemming or Morphological Query Expansion? , 2004 .

[120]  Charles L. A. Clarke,et al.  Information Retrieval - Implementing and Evaluating Search Engines , 2010 .

[121]  Mandar Mitra,et al.  Exploring Query Categorisation for Query Expansion: A Study , 2015, ArXiv.

[122]  Luis Gravano,et al.  Learning to find answers to questions on the Web , 2004, TOIT.

[123]  Paul-Alexandru Chirita,et al.  Personalized query expansion for the web , 2007, SIGIR.

[124]  Hakan Ferhatosmanoglu,et al.  Short text classification in twitter to improve information filtering , 2010, SIGIR.

[125]  Jens Lehmann,et al.  Keyword Query Expansion on Linked Data Using Linguistic and Semantic Features , 2013, 2013 IEEE Seventh International Conference on Semantic Computing.

[126]  James Mayfield,et al.  Comparing cross-language query expansion techniques by degrading translation resources , 2002, SIGIR '02.

[127]  Chao Li,et al.  A Query Expansion Algorithm Based on Phrases Semantic Similarity , 2008, 2008 International Symposiums on Information Processing.

[128]  Jiewen Wu,et al.  A Study of Ontology-based Query Expansion , 2011 .

[129]  Wessel Kraaij,et al.  Embedding Web-Based Statistical Translation Models in Cross-Language Information Retrieval , 2003, CL.

[130]  Tamas E. Doszkocs,et al.  AID, an Associative Interactive Dictionary for online searching , 1978 .

[131]  Luo Si,et al.  Learning for Efficient Supervised Query Expansion via Two-stage Feature Selection , 2016, SIGIR.

[132]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[133]  Hakim Hacid,et al.  Personalized social query expansion using social bookmarking systems , 2011, SIGIR.

[134]  Stephen E. Robertson,et al.  Microsoft Cambridge at TREC 2002: Filtering Track , 2002, TREC.

[135]  Yen-Jen Oyang,et al.  Relevant term suggestion in interactive web search based on contextual information in query session logs , 2003, J. Assoc. Inf. Sci. Technol..

[136]  Reiner Kraft,et al.  Mining anchor text for query refinement , 2004, WWW '04.

[137]  Dik Lun Lee,et al.  Re-examining the effects of adding relevance information in a relevance feedback environment , 2008, Inf. Process. Manag..

[138]  Gareth J. F. Jones,et al.  Applying summarization techniques for term selection in relevance feedback , 2001, SIGIR '01.

[139]  William R. Hersh,et al.  Assessing thesaurus-based query expansion using the UMLS Metathesaurus , 2000, AMIA.

[140]  Stephen E. Robertson,et al.  A probabilistic model of information retrieval: development and comparative experiments - Part 2 , 2000, Inf. Process. Manag..

[141]  Doug Beeferman,et al.  Agglomerative clustering of a search engine query log , 2000, KDD '00.

[142]  Khaled Radwan,et al.  Vers l'acces multilingue en langage naturel aux bases de donnees textuelles , 1994 .

[143]  Larry Fitzpatrick,et al.  Automatic feedback using past queries: social searching? , 1997, SIGIR '97.

[144]  Valentina Franzoni Just an Update on PMING Distance for Web-based Semantic Similarity in Artificial Intelligence and Data Mining , 2017, ArXiv.

[145]  Gareth J. F. Jones,et al.  Investigating segment-based query expansion for user-generated spoken content retrieval , 2016, 2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI).

[146]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[147]  Alexander Kotov,et al.  An Empirical Comparison of Statistical Term Association Graphs with DBpedia and ConceptNet for Query Expansion , 2015, FIRE.

[148]  Peter Willett,et al.  A Comparison of Spelling-Correction Methods for the Identification of Word Forms in Historical Text Databases , 1993 .

[149]  W. Bruce Croft,et al.  Modeling Term Associations for Ad-Hoc Retrieval Performance Within Language Modeling Framework , 2007, ECIR.

[150]  Philippe Mulhem,et al.  Axiomatic Term-Based Personalized Query Expansion Using Bookmarking System , 2016, DEXA.

[151]  Hao Wu,et al.  An incremental approach to efficient pseudo-relevance feedback , 2013, SIGIR.

[152]  Robert Krovetz,et al.  Viewing morphology as an inference process , 1993, Artif. Intell..

[153]  Berthier A. Ribeiro-Neto,et al.  Concept-based interactive query expansion , 2005, CIKM '05.

[154]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[155]  Kevyn Collins-Thompson,et al.  Estimation and use of uncertainty in pseudo-relevance feedback , 2007, SIGIR.

[156]  Dong Zhou,et al.  Query Expansion with Enriched User Profiles for Personalized Search Utilizing Folksonomy Data , 2017, IEEE Transactions on Knowledge and Data Engineering.

[157]  James Allan,et al.  A cluster-based resampling method for pseudo-relevance feedback , 2008, SIGIR '08.

[158]  K. S. Venkatesh,et al.  Perceptual synoptic view-based video retrieval using metadata , 2017, Signal Image Video Process..

[159]  Hakim Hacid,et al.  Sopra: a new social personalized ranking function for improving web search , 2013, SIGIR.

[160]  Koji Eguchi,et al.  NTCIR-5 Query Expansion Experiments using Term Dependence Models , 2005, NTCIR.

[161]  C. Buckley,et al.  Reliable Information Access Final Workshop Report , 2004 .

[162]  W. Bruce Croft,et al.  Phrasal translation and query expansion techniques for cross-language information retrieval , 1997, SIGIR '97.

[163]  James Allan,et al.  A context‐dependent relevance model , 2016, J. Assoc. Inf. Sci. Technol..

[164]  Iadh Ounis,et al.  Studying Query Expansion Effectiveness , 2009, ECIR.

[165]  In-Ho Kang,et al.  Query type classification for web document retrieval , 2003, SIGIR.

[166]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[167]  Wael Khreich,et al.  A Survey of Techniques for Event Detection in Twitter , 2015, Comput. Intell..

[168]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[169]  Claudio Carpineto,et al.  An information-theoretic approach to automatic query expansion , 2001, TOIS.

[170]  Patrick Pantel,et al.  Discovery of inference rules for question-answering , 2001, Natural Language Engineering.

[171]  Peter Willett,et al.  Recent trends in hierarchic document clustering: A critical review , 1988, Inf. Process. Manag..

[172]  Jack Minker,et al.  An evaluation of query expansion by the addition of clustered terms for a document retrieval system , 1972, Inf. Storage Retr..

[173]  Beixing Deng,et al.  Concept Based Query Expansion Using WordNet , 2009, 2009 International e-Conference on Advanced Science and Technology.

[174]  Jennifer Chu-Carroll,et al.  Semantic search via XML fragments: a high-precision approach to IR , 2006, SIGIR.

[175]  Paul Buitelaar,et al.  Query Expansion Using Wikipedia and Dbpedia , 2012, CLEF.

[176]  Klamer Schutte,et al.  Knowledge based query expansion in complex multimedia event detection , 2016, Multimedia Tools and Applications.

[177]  Yi Liu,et al.  Statistical Machine Translation for Query Expansion in Answer Retrieval , 2007, ACL.

[178]  Jean-Pierre Chevallet,et al.  Wikipedia-based semantic query enrichment , 2013, ESAIR '13.

[179]  Kevyn Collins-Thompson,et al.  Reducing the risk of query expansion via robust constrained optimization , 2009, CIKM.

[180]  Chris D. Paice An evaluation method for stemming algorithms , 1994, SIGIR '94.

[181]  G. Salton,et al.  A Generalized Term Dependence Model in Information Retrieval , 1983 .

[182]  Gerard Salton,et al.  Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[183]  Gu Si-yang,et al.  Privacy preserving association rule mining in vertically partitioned data , 2006 .

[184]  Gianni Amati,et al.  Probability models for information retrieval based on divergence from randomness , 2003 .

[185]  Raymond Y. K. Lau,et al.  Belief revision for adaptive information retrieval , 2004, SIGIR '04.

[186]  Pierre Genevès,et al.  XML query-update independence analysis revisited , 2012, DocEng '12.

[187]  Gerhard Weikum,et al.  Exploiting correlated keywords to improve approximate information filtering , 2008, SIGIR '08.

[188]  Spiros Skiadopoulos,et al.  Query Reorganization Algorithms for Efficient Boolean Information Filtering , 2017, IEEE Transactions on Knowledge and Data Engineering.

[189]  Carolyn J. Crouch,et al.  Experiments in automatic statistical thesaurus construction , 1992, SIGIR '92.

[190]  Jian-Yun Nie,et al.  Context-Dependent Term Relations for Information Retrieval , 2006, EMNLP.

[191]  Philippe Mulhem,et al.  Toward Word Embedding for Personalized Information Retrieval , 2016, SIGIR 2016.

[192]  Sung-Hyon Myaeng,et al.  Wikipedia-based query phrase expansion in patent class search , 2013, Information Retrieval.

[193]  Ricardo A. Baeza-Yates,et al.  Extracting semantic relations from query logs , 2007, KDD '07.

[194]  Stephen E. Robertson,et al.  Interactive Thesaurus Navigation: Intelligence Rules OK? , 1995, J. Am. Soc. Inf. Sci..

[195]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[196]  Utpal Garain,et al.  Using Word Embeddings for Automatic Query Expansion , 2016, ArXiv.

[197]  Alberto Del Bimbo,et al.  Socializing the Semantic Gap , 2015, ACM Comput. Surv..

[198]  Sudeshna Sarkar,et al.  Using Word Embeddings for Query Translation for Hindi to English Cross Language Information Retrieval , 2016, Computación y Sistemas.

[199]  Wei-Ying Ma,et al.  Query Expansion by Mining User Logs , 2003, IEEE Trans. Knowl. Data Eng..

[200]  Séamus Lawless,et al.  A study of user profile representation for personalized cross-language information retrieval , 2016, Aslib J. Inf. Manag..

[201]  Alexander Mikroyannidis Toward a Social Semantic Web , 2007, Computer.

[202]  ChengXiang Zhai,et al.  A boosting approach to improving pseudo-relevance feedback , 2011, SIGIR.

[203]  Stephen E. Robertson,et al.  Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval , 1994, SIGIR '94.

[204]  Min Song,et al.  Integration of association rules and ontologies for semantic query expansion , 2007, Data Knowl. Eng..

[205]  An Zeng,et al.  Behavior patterns of online users and the effect on information filtering , 2011, ArXiv.

[206]  Hakim Hacid,et al.  PerSaDoR: Personalized social document representation for improving web search , 2016, Inf. Sci..

[207]  Douglas W. Oard,et al.  Dictionary-based techniques for cross-language information retrieval , 2005, Inf. Process. Manag..

[208]  Jan Cernocký,et al.  Comparison of methods for language-dependent and language-independent query-by-example spoken term detection , 2012, TOIS.

[209]  W. Bruce Croft,et al.  Quary Expansion Using Local and Global Document Analysis , 1996, SIGIR Forum.

[210]  James Allan,et al.  Entity query feature expansion using knowledge base links , 2014, SIGIR.

[211]  Antoni B. Chan,et al.  Audio Information Retrieval using Semantic Similarity , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[212]  M. E. Maron,et al.  On Relevance, Probabilistic Indexing and Information Retrieval , 1960, JACM.

[213]  Walid Magdy,et al.  A study on query expansion methods for patent retrieval , 2011, PaIR '11.

[214]  Xiangji Huang,et al.  A Simple Enhancement for Ad-hoc Information Retrieval via Topic Modelling , 2016, SIGIR.

[215]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[216]  Lisa Ballesteros,et al.  Cross-Language Retrieval via Transitive Translation , 2002 .

[217]  Rongrong Ji,et al.  Large-scale visual sentiment ontology and detectors using adjective noun pairs , 2013, ACM Multimedia.

[218]  Iadh Ounis,et al.  Query reformulation using automatically generated query concepts from a document space , 2006, Inf. Process. Manag..

[219]  Hyo-Won Suh,et al.  A personalized query expansion approach for engineering document retrieval , 2014, Adv. Eng. Informatics.

[220]  W. Bruce Croft,et al.  Improving the effectiveness of information retrieval with local context analysis , 2000, TOIS.

[221]  Rada Mihalcea,et al.  Using WordNet and Lexical Operators to Improve Internet Searches , 2000, IEEE Internet Comput..

[222]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[223]  W. Bruce Croft,et al.  Resolving ambiguity for cross-language retrieval , 1998, SIGIR '98.

[224]  Laura Dietz,et al.  A neighborhood relevance model for entity linking , 2013, OAIR.

[225]  Howard R. Turtle Natural language vs. Boolean query evaluation: a comparison of retrieval performance , 1994, SIGIR '94.

[226]  James Allan,et al.  Incremental relevance feedback for information filtering , 1996, SIGIR '96.

[227]  Cristina V. Lopes,et al.  Thesaurus-based automatic query expansion for interface-driven code search , 2014, MSR 2014.

[228]  Hwee Tou Ng,et al.  Word Sense Disambiguation Improves Information Retrieval , 2012, ACL.

[229]  Vincent Claveau,et al.  Automatic Morphological Query Expansion Using Analogy-Based Machine Learning , 2007, ECIR.

[230]  Elena Cabrio,et al.  6th Open Challenge on Question Answering over Linked Data (QALD-6) , 2016, SemWebEval@ESWC.

[231]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[232]  Peter Willett,et al.  The limitations of term co-occurrence data for query expansion in document retrieval systems , 1991, J. Am. Soc. Inf. Sci..

[233]  W. Bruce Croft,et al.  Predicting query performance , 2002, SIGIR '02.

[234]  Zhiguo Gong,et al.  Multi-term Web Query Expansion Using WordNet , 2006, DEXA.

[235]  W. Bruce Croft,et al.  Parameterized concept weighting in verbose queries , 2011, SIGIR.

[236]  Aditi Sharan,et al.  A new fuzzy logic-based query expansion model for efficient information retrieval using relevance feedback approach , 2017, Neural Computing and Applications.

[237]  Jan Snajder,et al.  Evaluation of Manual Query Expansion Rules on a Domain Specific FAQ Collection , 2015, CLEF.

[238]  Hakim Hacid,et al.  LAICOS: an open source platform for personalized social web search , 2013, KDD.

[239]  Amanda Spink,et al.  Searching the Web: the public and their queries , 2001 .

[240]  Gerhard Weikum,et al.  Exploiting social relations for query expansion and result ranking , 2008, 2008 IEEE 24th International Conference on Data Engineering Workshop.

[241]  P. Smith,et al.  A review of ontology based query expansion , 2007, Inf. Process. Manag..

[242]  Min Xie,et al.  CCCF: Improving Collaborative Filtering via Scalable User-Item Co-Clustering , 2016, WSDM.

[243]  Yi Liu,et al.  Translating Queries into Snippets for Improved Query Expansion , 2008, COLING.

[244]  Jianfeng Gao,et al.  Extending query translation to cross-language query expansion with markov chain models , 2007, CIKM '07.

[245]  Peng Wang,et al.  Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification , 2016, Neurocomputing.

[246]  P. Jaccard THE DISTRIBUTION OF THE FLORA IN THE ALPINE ZONE.1 , 1912 .

[247]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[248]  Stephen Clark,et al.  Syntactic Processing Using the Generalized Perceptron and Beam Search , 2011, CL.

[249]  Enhong Chen,et al.  Context-aware query suggestion by mining click-through and session data , 2008, KDD.

[250]  Jian-Yun Nie,et al.  Diversified query expansion using conceptnet , 2013, CIKM.

[251]  Korris Fu-Lai Chung,et al.  Improving weak ad-hoc queries using wikipedia asexternal corpus , 2007, SIGIR.

[252]  Aviezri S. Fraenkel,et al.  Local Feedback in Full-Text Retrieval Systems , 1977, JACM.

[253]  Reed McEwan,et al.  Corpus domain effects on distributional semantic modeling of medical terms , 2016, Bioinform..

[254]  Brad A. Myers,et al.  Improving user performance on Boolean queries , 2000, CHI Extended Abstracts.

[255]  Dong Zhou,et al.  Query expansion for personalized cross-language information retrieval , 2015, 2015 10th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP).

[256]  Zhendong Niu,et al.  Concept Based Query Expansion , 2013, 2013 Ninth International Conference on Semantics, Knowledge and Grids.

[257]  Stephen E. Robertson,et al.  Selecting good expansion terms for pseudo-relevance feedback , 2008, SIGIR '08.

[258]  Eugene Agichtein,et al.  Finding the right facts in the crowd: factoid question answering over social media , 2008, WWW.

[259]  Dong Liu,et al.  Image retrieval with query-adaptive hashing , 2013, TOMCCAP.

[260]  W. Bruce Croft,et al.  Using Probabilistic Models of Document Retrieval without Relevance Information , 1979, J. Documentation.

[261]  Mounia Lalmas,et al.  A survey on the use of relevance feedback for information access systems , 2003, The Knowledge Engineering Review.

[262]  Claudio Carpineto,et al.  Improving retrieval feedback with multiple term-ranking function combination , 2002, TOIS.

[263]  Hatem Haddad,et al.  Towards an effective automatic query expansion process using an association rule mining approach , 2012, Journal of Intelligent Information Systems.

[264]  C. J. van Rijsbergen,et al.  A Non-Classical Logic for Information Retrieval , 1997, Comput. J..

[265]  Iadh Ounis,et al.  Combining fields for query expansion and adaptive query expansion , 2007, Inf. Process. Manag..

[266]  Alan F. Smeaton,et al.  TREC-4 Experiments at Dublin City University: Thresholding Posting Lists, Query Expansion with WordNet and POS Tagging of Spanish , 1995, TREC.

[267]  Fabio Crestani,et al.  The effect of citation analysis on query expansion for patent retrieval , 2013, Information Retrieval.

[268]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[269]  Swapan K. Parui,et al.  Incremental blind feedback , 2014, ACM Trans. Asian Lang. Inf. Process..

[270]  ChengXiang Zhai,et al.  Tapping into knowledge base for concept feedback: leveraging conceptnet to improve search results for difficult queries , 2012, WSDM '12.

[271]  Turid Hedlund,et al.  Dictionary-Based Cross-Language Information Retrieval: Problems, Methods, and Research Findings , 2001, Information Retrieval.

[272]  Enhong Chen,et al.  Improving search relevance for short queries in community question answering , 2014, WSDM.

[273]  Stephen E. Robertson,et al.  Okapi at TREC-7: Automatic Ad Hoc, Filtering, VLC and Interactive , 1998, TREC.

[274]  Oren Kurland,et al.  Query Expansion Using Word Embeddings , 2016, CIKM.

[275]  Vincent P. Wade,et al.  Personalised Information Retrieval: survey and classification , 2013, User Modeling and User-Adapted Interaction.

[276]  W. Bruce Croft,et al.  Query expansion using local and global document analysis , 1996, SIGIR '96.

[277]  Josep-Lluís Larriba-Pey,et al.  Query Expansion via structural motifs in Wikipedia Graph , 2016, ArXiv.

[278]  A. R. Rivas,et al.  Study of Query Expansion Techniques and Their Application in the Biomedical Information Retrieval , 2014, TheScientificWorldJournal.

[279]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[280]  Yang Xu,et al.  Query dependent pseudo-relevance feedback based on wikipedia , 2009, SIGIR.

[281]  W. Bruce Croft,et al.  Latent concept expansion using markov random fields , 2007, SIGIR.

[282]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[283]  Stephen E. Robertson,et al.  Microsoft Cambridge at TREC-9: Filtering Track , 2000, TREC.

[284]  H. V. Jagadish,et al.  A Structured Query Model for the Deep Relational Web , 2015, CIKM.

[285]  Xianghui Zhao,et al.  Query Processing Based on Associated Semantic Context Inference , 2015, 2015 2nd International Conference on Information Science and Control Engineering.

[286]  K. Sparck Jones,et al.  General query expansion techniques for spoken document retrieval , 1999 .

[287]  W. Bruce Croft,et al.  Dictionary Methods for Cross-Lingual Information Retrieval , 1996, DEXA.

[288]  Peretz Shoval,et al.  Information Filtering: Overview of Issues, Research and Systems , 2001, User Modeling and User-Adapted Interaction.

[289]  Yongdong Zhang,et al.  Contextual Query Expansion for Image Retrieval , 2014, IEEE Transactions on Multimedia.

[290]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[291]  Ji-Rong Wen,et al.  A Proximity Probabilistic Model for Information Retrieval , 2011 .

[292]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[293]  Nicholas J. Belkin,et al.  Information filtering and information retrieval: two sides of the same coin? , 1992, CACM.

[294]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[295]  John D. Lafferty,et al.  Model-based feedback in the language modeling approach to information retrieval , 2001, CIKM '01.

[296]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[297]  James Allan,et al.  Real-time Query Expansion in Relevance Models , 2006 .

[298]  Jean-Pierre Chevallet,et al.  A Comparison of Deep Learning Based Query Expansion with Pseudo-Relevance Feedback and Mutual Information , 2016, ECIR.