Intent models for contextualising and diversifying query suggestions

The query suggestion or auto-completion mechanisms help users to type less while interacting with a search engine. A basic approach that ranks suggestions according to their frequency in query logs is suboptimal. Firstly, many candidate queries with the same prefix can be removed as redundant. Secondly, the suggestions can also be personalised based on the user's context. These two directions to improve the mechanisms' quality can be in opposition: while the latter aims to promote suggestions that address search intents that a user is likely to have, the former aims to diversify the suggestions to cover as many intents as possible. We introduce a contextualisation framework that utilises a short-term context using the user's behaviour within the current search session, such as the previous query, the documents examined, and the candidate query suggestions that the user has discarded. This short-term context is used to contextualise and diversify the ranking of query suggestions, by modelling the user's information need as a mixture of intent-specific user models. The evaluation is performed offline on a set of approximately 1.0M test user sessions. Our results suggest that the proposed approach significantly improves query suggestions compared to the baseline approach.

[1]  Charles L. A. Clarke,et al.  Overview of the TREC 2011 Web Track , 2011, TREC.

[2]  ChengXiang Zhai,et al.  Statistical Language Models for Information Retrieval: A Critical Review , 2008, Found. Trends Inf. Retr..

[3]  Sreenivas Gollapudi,et al.  Diversifying search results , 2009, WSDM '09.

[4]  Michael R. Lyu,et al.  Diversifying Query Suggestion Results , 2010, AAAI.

[5]  Olivier Chapelle,et al.  A dynamic bayesian network click model for web search ranking , 2009, WWW '09.

[6]  Yong Yu,et al.  Identifying ambiguous queries in web search , 2007, WWW '07.

[7]  Anil K. Jain,et al.  Unsupervised Learning of Finite Mixture Models , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Ziv Bar-Yossef,et al.  Context-sensitive query auto-completion , 2011, WWW.

[9]  Milad Shokouhi,et al.  Time-sensitive query auto-completion , 2012, SIGIR '12.

[10]  Milad Shokouhi,et al.  Learning to personalize query auto-completion , 2013, SIGIR.

[11]  Yang Song,et al.  Post-ranking query suggestion by diversifying search results , 2011, SIGIR '11.

[12]  Naonori Ueda,et al.  Deterministic annealing EM algorithm , 1998, Neural Networks.

[13]  Pablo Castells,et al.  Personalized diversification of search results , 2012, SIGIR '12.

[14]  Emre Velipasaoglu,et al.  Intent-based diversification of web search results: metrics and algorithms , 2011, Information Retrieval.

[15]  Craig MacDonald,et al.  Exploiting query reformulations for web search result diversification , 2010, WWW '10.

[16]  Sanjay Ghemawat,et al.  MapReduce: simplified data processing on large clusters , 2008, CACM.

[17]  Craig MacDonald,et al.  User model-based metrics for offline query suggestion evaluation , 2013, SIGIR.

[18]  Enhong Chen,et al.  Towards context-aware search by learning a very large variable length hidden markov model from search logs , 2009, WWW '09.

[19]  Enhong Chen,et al.  Context-aware query suggestion by mining click-through and session data , 2008, KDD.

[20]  Xiaohui Yan,et al.  Context-aware query recommendation by learning high-order relation in query logs , 2011, CIKM '11.

[21]  Marius Pasca,et al.  Weakly-supervised discovery of named entities using web search queries , 2007, CIKM '07.

[22]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[23]  Umut Ozertem,et al.  Learning to suggest: a machine learning framework for ranking query suggestions , 2012, SIGIR '12.

[24]  Craig MacDonald,et al.  Intent-aware search result diversification , 2011, SIGIR.

[25]  Ryen W. White,et al.  Probabilistic models for personalizing web search , 2012, WSDM '12.

[26]  Pavel Serdyukov,et al.  Actualization of query suggestions using query logs , 2012, WWW.