Predicting short-term interests using activity-based search context

A query considered in isolation offers limited information about a searcher's intent. Query context that considers pre-query activity (e.g., previous queries and page visits), can provide richer information about search intentions. In this paper, we describe a study in which we developed and evaluated user interest models for the current query, its context (from pre-query session activity), and their combination, which we refer to as intent. Using large-scale logs, we evaluate how accurately each model predicts the user's short-term interests under various experimental conditions. In our study we: (i) determine the extent of opportunity for using context to model intent; (ii) compare the utility of different sources of behavioral evidence (queries, search result clicks, and Web page visits) for building predictive interest models, and; (iii) investigate optimally combining the query and its context by learning a model that predicts the context weight for each query. Our findings demonstrate significant opportunity in leveraging contextual information, show that context and source influence predictive accuracy, and show that we can learn a near-optimal combination of the query and context for each query. The findings can inform the design of search systems that leverage contextual information to better understand, model, and serve searchers' information needs.

[1]  Susan T. Dumais,et al.  Learning user interaction models for predicting web search result preferences , 2006, SIGIR.

[2]  Susan T. Dumais,et al.  Classification-enhanced ranking , 2010, WWW '10.

[3]  Alexander Pretschner,et al.  Ontology-Based User Profiles for Search and Browsing , 2002 .

[4]  Ryen W. White,et al.  Predicting user interests from contextual information , 2009, SIGIR.

[5]  Susan T. Dumais,et al.  Evaluating implicit measures to improve the search experiences , 2003 .

[6]  Ellen M. Voorhees,et al.  TREC: Experiment and Evaluation in Information Retrieval (Digital Libraries and Electronic Publishing) , 2005 .

[7]  José Luis Vicedo González,et al.  TREC: Experiment and evaluation in information retrieval , 2007, J. Assoc. Inf. Sci. Technol..

[8]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[9]  Enhong Chen,et al.  Context-aware ranking in web search , 2010, SIGIR '10.

[10]  Doug Downey,et al.  Understanding the relationship between searchers' queries and information goals , 2008, CIKM '08.

[11]  Hinrich Schütze,et al.  Personalized search , 2002, CACM.

[12]  Susan Gauch,et al.  Personalizing Search Based on User Search Histories , 2004 .

[13]  Jaime Teevan,et al.  How people recall, recognize, and reuse search results , 2008, ACM Trans. Inf. Syst..

[14]  Jennifer Widom,et al.  Scaling personalized web search , 2003, WWW '03.

[15]  Alexander Pretschner,et al.  Ontology-based personalized search and browsing , 2003, Web Intell. Agent Syst..

[16]  Benjamin Piwowarski,et al.  Predictive user click models based on click-through history , 2007, CIKM '07.

[17]  Wolfgang Nejdl,et al.  Using ODP metadata to personalize search , 2005, SIGIR '05.

[18]  Susan T. Dumais,et al.  Improving Web Search Ranking by Incorporating User Behavior Information , 2019, SIGIR Forum.

[19]  Olivia R. Liu Sheng,et al.  Interest-based personalized search , 2007, TOIS.

[20]  ChengXiang Zhai,et al.  Mining long-term search history to improve search accuracy , 2006, KDD '06.

[21]  Raymond J. Mooney,et al.  Learning to Disambiguate Search Queries from Short Sessions , 2009, ECML/PKDD.

[22]  Xin Fu,et al.  The loquacious user: a document-independent source of terms for query expansion , 2005, SIGIR '05.

[23]  Enhong Chen,et al.  Context-aware query classification , 2009, SIGIR.

[24]  Susan T. Dumais,et al.  To personalize or not to personalize: modeling queries with variation in user intent , 2008, SIGIR '08.

[25]  Peter Ingwersen,et al.  The Turn - Integration of Information Seeking and Retrieval in Context , 2005, The Kluwer International Series on Information Retrieval.

[26]  Iain Campbell,et al.  The ostensive model of developing information needs , 2000 .

[27]  Ryen W. White,et al.  Mining Historic Query Trails to Label Long and Rare Search Engine Queries , 2010, TWEB.

[28]  Thorsten Joachims,et al.  WebWatcher : A Learning Apprentice for the World Wide Web , 1995 .

[29]  Lois M. L. Delcambre,et al.  Discounted Cumulated Gain Based Evaluation of Multiple-Query IR Sessions , 2008, ECIR.

[30]  Ryen W. White,et al.  WWW 2007 / Track: Browsers and User Interfaces Session: Personalization Investigating Behavioral Variability in Web Search , 2022 .

[31]  Susan T. Dumais,et al.  Analysis of topic dynamics in web search , 2005, WWW '05.

[32]  Ahmed Hassan Awadallah,et al.  Beyond DCG: user behavior as a predictor of a successful search , 2010, WSDM '10.

[33]  Enhong Chen,et al.  Context-aware query suggestion by mining click-through and session data , 2008, KDD.

[34]  Steve Fox,et al.  Evaluating implicit measures to improve web search , 2005, TOIS.

[35]  Xuehua Shen,et al.  Context-sensitive information retrieval using implicit feedback , 2005, SIGIR '05.