Enhancing personalized search by mining and modeling task behavior

Personalized search systems tailor search results to the current user intent using historic search interactions. This relies on being able to find pertinent information in that user's search history, which can be challenging for unseen queries and for new search scenarios. Building richer models of users' current and historic search tasks can help improve the likelihood of finding relevant content and enhance the relevance and coverage of personalization methods. The task-based approach can be applied to the current user's search history, or as we focus on here, all users' search histories as so-called "groupization" (a variant of personalization whereby other users' profiles can be used to personalize the search experience). We describe a method whereby we mine historic search-engine logs to find other users performing similar tasks to the current user and leverage their on-task behavior to identify Web pages to promote in the current ranking. We investigate the effectiveness of this approach versus query-based matching and finding related historic activity from the current user (i.e., group versus individual). As part of our studies we also explore the use of the on-task behavior of particular user cohorts, such as people who are expert in the topic currently being searched, rather than all other users. Our approach yields promising gains in retrieval performance, and has direct implications for improving personalization in search systems.

[1]  Doug Downey,et al.  Understanding the relationship between searchers' queries and information goals , 2008, CIKM '08.

[2]  Virgílio A. F. Almeida,et al.  A community-aware search engine , 2004, WWW '04.

[3]  Ryen W. White,et al.  Probabilistic models for personalizing web search , 2012, WSDM '12.

[4]  Ji-Rong Wen,et al.  A large-scale evaluation and analysis of personalized search strategies , 2007, WWW '07.

[5]  Susan T. Dumais,et al.  Evaluating implicit measures to improve the search experiences , 2003 .

[6]  Susan T. Dumais,et al.  Personalizing Search via Automated Analysis of Interests and Activities , 2005, SIGIR.

[7]  Steve Fox,et al.  Evaluating implicit measures to improve web search , 2005, TOIS.

[8]  Kenneth Ward Church,et al.  Entropy of search logs: how hard is search? with personalization? with backoff? , 2008, WSDM '08.

[9]  Michael A. Shepherd,et al.  A Field Study Characterizing Web-based Information Seeking Tasks , 2022 .

[10]  Enhong Chen,et al.  Context-aware ranking in web search , 2010, SIGIR '10.

[11]  Filip Radlinski,et al.  Inferring and using location metadata to personalize web search , 2011, SIGIR.

[12]  Quoc V. Le,et al.  Learning to Rank with Non-Smooth Cost Functions , 2007 .

[13]  Susan T. Dumais,et al.  Classification-enhanced ranking , 2010, WWW '10.

[14]  Jianfeng Gao,et al.  Learning Lexicon Models from Search Logs for Query Expansion , 2012, EMNLP.

[15]  Ahmed Hassan Awadallah,et al.  Beyond DCG: user behavior as a predictor of a successful search , 2010, WSDM '10.

[16]  Nicholas J. Belkin,et al.  Personalizing information retrieval for multi-session tasks: the roles of task stage and task type , 2010, SIGIR '10.

[17]  John D. Lafferty,et al.  Information Retrieval as Statistical Translation , 2017 .

[18]  Wei Yuan,et al.  Smoothing clickthrough data for web search ranking , 2009, SIGIR.

[19]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[20]  Carolyn Watters,et al.  A field study characterizing Web-based information-seeking tasks , 2007 .

[21]  Huan Liu,et al.  CubeSVD: a novel approach to personalized Web search , 2005, WWW '05.

[22]  Yang Song,et al.  A task level metric for measuring web search satisfaction and its application on improving relevance estimation , 2011, CIKM '11.

[23]  Ryen W. White,et al.  Mining the search trails of surfing crowds: identifying relevant websites from user activity , 2008, WWW.

[24]  Masatoshi Yoshikawa,et al.  Adaptive web search based on user profile constructed without any effort from users , 2004, WWW '04.

[25]  Young-Jin Lee,et al.  VisSearch: A collaborative Web searching environment , 2005, Comput. Educ..

[26]  ChengXiang Zhai,et al.  Implicit user modeling for personalized search , 2005, CIKM '05.

[27]  Ryen W. White,et al.  Modeling and analysis of cross-session search tasks , 2011, SIGIR.

[28]  Wei Chu,et al.  Modeling the impact of short- and long-term behavior on search personalization , 2012, SIGIR '12.

[29]  Jacek Gwizdka,et al.  Search behaviors in different task types , 2010, JCDL '10.

[30]  Jaime Teevan,et al.  Understanding and predicting personal navigation , 2011, WSDM '11.

[31]  Filip Radlinski,et al.  Query chains: learning to rank from implicit feedback , 2005, KDD '05.

[32]  Nicholas J. Belkin,et al.  A faceted approach to conceptualizing tasks in information seeking , 2008, Inf. Process. Manag..

[33]  Ryen W. White,et al.  Predicting user interests from contextual information , 2009, SIGIR.

[34]  Barry Smyth,et al.  A Community-Based Approach to Personalizing Web Search , 2007, Computer.

[35]  Ingmar Weber,et al.  The demographics of web search , 2010, SIGIR.

[36]  Ryen W. White,et al.  Predicting short-term interests using activity-based search context , 2010, CIKM.

[37]  Yang Song,et al.  Evaluating the effectiveness of search task trails , 2012, WWW.

[38]  Meredith Ringel Morris,et al.  Discovering and using groups to improve personalized search , 2009, WSDM '09.

[39]  Filip Radlinski,et al.  Inferring query intent from reformulations and clicks , 2010, WWW '10.

[40]  Bonnie MacKay,et al.  Exploring multi-session web tasks , 2008, CHI.

[41]  Ryen W. White,et al.  Characterizing local interests and local knowledge , 2012, CHI.

[42]  Jianfeng Gao,et al.  Ranking, Boosting, and Model Adaptation , 2008 .

[43]  ChengXiang Zhai,et al.  Mining long-term search history to improve search accuracy , 2006, KDD '06.

[44]  Barry Smyth,et al.  Cooperating Search Communities , 2006, AH.

[45]  Rosie Jones,et al.  Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs , 2008, CIKM '08.

[46]  Susan T. Dumais,et al.  Improving Web Search Ranking by Incorporating User Behavior Information , 2019, SIGIR Forum.

[47]  Jianfeng Gao,et al.  Clickthrough-based translation models for web search: from word models to phrase models , 2010, CIKM.

[48]  Ahmed Hassan Awadallah A semi-supervised approach to modeling web search satisfaction , 2012 .

[49]  Xiaojun Yuan,et al.  Domain knowledge, search behaviour, and search effectiveness of engineering and science students: an exploratory study , 2005, Inf. Res..

[50]  Fabrizio Silvestri,et al.  Identifying task-based sessions in search engine query logs , 2011, WSDM '11.

[51]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.