Structural annotation of search queries using pseudo-relevance feedback

Marking up queries with annotations such as part-of-speech tags, capitalization, and segmentation, is an important part of many approaches to query processing and understanding. Due to their brevity and idiosyncratic structure, search queries pose a challenge to existing annotation tools that are commonly trained on full-length documents. To address this challenge, we view the query as an explicit representation of a latent information need, which allows us to use pseudo-relevance feedback, and to leverage additional information from the document corpus, in order to improve the quality of query annotation.

[1]  Xiao Li,et al.  Semantic Tagging of Web Search Queries , 2009, ACL.

[2]  Daqing He,et al.  Enhancing query translation with relevance feedback in translingual information retrieval , 2011, Inf. Process. Manag..

[3]  Andrei Z. Broder,et al.  Robust classification of rare queries using web knowledge , 2007, SIGIR.

[4]  Fuchun Peng,et al.  Unsupervised query segmentation using generative language models and wikipedia , 2008, WWW.

[5]  Hang Li,et al.  A unified and discriminative model for query refinement , 2008, SIGIR '08.

[6]  Fuchun Peng,et al.  Analyzing web text association to disambiguate abbreviation in queries , 2008, SIGIR '08.

[7]  Marius Pasca,et al.  Weakly-supervised discovery of named entities using web search queries , 2007, CIKM '07.

[8]  Qin Iris Wang,et al.  Learning Noun Phrase Query Segmentation , 2007, EMNLP.

[9]  Benjamin Rey,et al.  Generating query substitutions , 2006, WWW '06.

[10]  Hang Li,et al.  Named entity recognition in query , 2009, SIGIR.

[11]  Rosie Jones,et al.  The Linguistic Structure of English Web-Search Queries , 2008, EMNLP.

[12]  Gilad Mishne,et al.  Improving Web Search Relevance with Semantic Features , 2009, EMNLP.

[13]  Rosie Jones,et al.  Query word deletion prediction , 2003, SIGIR.

[14]  James Allan,et al.  Automatic Query Expansion Using SMART: TREC 3 , 1994, TREC.

[15]  Iadh Ounis,et al.  Automatically Building a Stopword List for an Information Retrieval System , 2005, J. Digit. Inf. Manag..

[16]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[17]  W. Bruce Croft,et al.  Analysis of long queries in a large scale search log , 2009, WSCD '09.

[18]  Ming Zhou,et al.  Improving Query Spelling Correction Using Web Search Results , 2007, EMNLP-CoNLL.

[19]  Ying Li,et al.  Personal name classification in web queries , 2008, WSDM '08.