Query Phrase Suggestion from Topically Tagged Session Logs

Searchers' difficulty in formulating effective queries for their information needs is well known. Analysis of search session logs shows that users often pose short, vague queries and then struggle with revising them. Interactive query expansion (users selecting terms to add to their queries) dramatically improves effectiveness and satisfaction. Suggesting relevant candidate expansion terms based on the initial query enables users to satisfy their information needs faster. We find that suggesting query phrases other users have found it necessary to add for a given query (mined from session logs) dramatically improves the quality of suggestions over simply using cooccurrence. However, this exacerbates the sparseness problem faced when mining short queries that lack features. To mitigate this, we tag query phrases with higher level topical categories to mine more general rules, finding that this enables us to make suggestions for approximately 10% more queries while maintaining an acceptable false positive rate.

[1]  Nicholas J. Belkin,et al.  The human element: helping people find what they don't know. , 2000 .

[2]  Alvin F. Martin,et al.  The DET curve in assessment of detection task performance , 1997, EUROSPEECH.

[3]  Pertti Vakkari,et al.  Subject Knowledge, Thesaurus-assisted Query Expansion and Search Success , 2004, RIAO.

[4]  Ji-Rong Wen,et al.  Query Clustering in the Web Context , 2003, Clustering and Information Retrieval.

[5]  Xin Fu,et al.  The loquacious user: a document-independent source of terms for query expansion , 2005, SIGIR '05.

[6]  Rosie Jones,et al.  Query word deletion prediction , 2003, SIGIR.

[7]  Ophir Frieder,et al.  Hourly analysis of a very large topically categorized web query log , 2004, SIGIR '04.

[8]  Noriaki Kawamae,et al.  Semantic log analysis based on a user query behavior model , 2003, Third IEEE International Conference on Data Mining.

[9]  Nicholas J. Belkin,et al.  Helping people find what they don't know , 2000, CACM.

[10]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[11]  David F. Gleich,et al.  SVD based term suggestion and ranking system , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[12]  Ricardo A. Baeza-Yates,et al.  Query Recommendation Using Query Logs in Search Engines , 2004, EDBT Workshops.

[13]  Xuehua Shen,et al.  Context-sensitive information retrieval using implicit feedback , 2005, SIGIR '05.

[14]  James Allan,et al.  A critical examination of TDT's cost function , 2002, SIGIR '02.

[15]  Amanda Spink,et al.  Use of query reformulation and relevance feedback by Excite users , 2000, Internet Res..

[16]  William R. Hersh,et al.  TREC 2002 Interactive Track Report , 2002, TREC.

[17]  Berthier A. Ribeiro-Neto,et al.  Concept-based interactive query expansion , 2005, CIKM '05.

[18]  Yen-Jen Oyang,et al.  Relevant term suggestion in interactive web search based on contextual information in query session logs , 2003, J. Assoc. Inf. Sci. Technol..