Behavior‐based personalization in web search

Personalized search approaches tailor search results to users' current interests, so as to help improve the likelihood of a user finding relevant documents for their query. Previous work on personalized search focuses on using the content of the user's query and of the documents clicked to model the user's preference. In this paper we focus on a different type of signal: We investigate the use of behavioral information for the purpose of search personalization. That is, we consider clicks and dwell time for reranking an initially retrieved list of documents. In particular, we (i) investigate the impact of distributions of users and queries on document reranking; (ii) estimate the relevance of a document for a query at 2 levels, at the query‐level and at the word‐level, to alleviate the problem of sparseness; and (iii) perform an experimental evaluation both for users seen during the training period and for users not seen during training. For the latter, we explore the use of information from similar users who have been seen during the training period. We use the dwell time on clicked documents to estimate a document's relevance to a query, and perform Bayesian probabilistic matrix factorization to generate a relevance distribution of a document over queries. Our experiments show that: (i) for personalized ranking, behavioral information helps to improve retrieval effectiveness; and (ii) given a query, merging information inferred from behavior of a particular user and from behaviors of other users with a user‐dependent adaptive weight outperforms any combination with a fixed weight.

[1]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[2]  Bracha Shapira,et al.  Personalized search: Integrating collaboration and social networks , 2011, J. Assoc. Inf. Sci. Technol..

[3]  Raymond J. Mooney,et al.  Learning to Disambiguate Search Queries from Short Sessions , 2009, ECML/PKDD.

[4]  Filip Radlinski,et al.  Personalizing web search using long term browsing history , 2011, WSDM '11.

[5]  Radford M. Neal Probabilistic Inference Using Markov Chain Monte Carlo Methods , 2011 .

[6]  Oren Kurland,et al.  Corpus structure, language models, and ad hoc information retrieval , 2004, SIGIR '04.

[7]  Enhong Chen,et al.  Context-aware ranking in web search , 2010, SIGIR '10.

[8]  Ryen W. White,et al.  Mining the search trails of surfing crowds: identifying relevant websites from user activity , 2008, WWW.

[9]  ChengXiang Zhai,et al.  Implicit user modeling for personalized search , 2005, CIKM '05.

[10]  Wolfgang Nejdl,et al.  Using ODP metadata to personalize search , 2005, SIGIR '05.

[11]  Susan T. Dumais,et al.  Improving Web Search Ranking by Incorporating User Behavior Information , 2019, SIGIR Forum.

[12]  Suju Rajan,et al.  Beyond clicks: dwell time for personalization , 2014, RecSys '14.

[13]  Daqing He,et al.  Personalization via collaboration in Web retrieval systems: A context based approach , 2005, ASIST.

[14]  Ryen W. White,et al.  Understanding web browsing behaviors through Weibull analysis of dwell time , 2010, SIGIR.

[15]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[16]  Ryen W. White,et al.  Probabilistic models for personalizing web search , 2012, WSDM '12.

[17]  Filip Radlinski,et al.  Inferring and using location metadata to personalize web search , 2011, SIGIR.

[18]  Xuehua Shen,et al.  Context-sensitive information retrieval using implicit feedback , 2005, SIGIR '05.

[19]  Susan T. Dumais,et al.  To personalize or not to personalize: modeling queries with variation in user intent , 2008, SIGIR '08.

[20]  Hao Jiang,et al.  Mining User Dwell Time for Personalized Web Search Re-Ranking , 2011, IJCAI.

[21]  M. de Rijke,et al.  Click Models for Web Search , 2015, Click Models for Web Search.

[22]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[23]  Maksims Volkovs,et al.  Context Models For Web Search Personalization , 2015, ArXiv.

[24]  Di Jiang,et al.  Context-aware search personalization with concept preference , 2011, CIKM '11.

[25]  Douglas Farenick,et al.  Jensen's inequality relative to matrix-valued measures , 2007 .

[26]  Pavel Serdyukov,et al.  Personalization of web-search using short-term browsing context , 2013, CIKM.

[27]  Gary Marchionini,et al.  Synthesis Lectures on Information Concepts, Retrieval, and Services , 2009 .

[28]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[29]  Ji-Rong Wen,et al.  WWW 2007 / Track: Search Session: Personalization A Largescale Evaluation and Analysis of Personalized Search Strategies ABSTRACT , 2022 .

[30]  Xiaozhong Liu,et al.  Real-time user interest modeling for real-time ranking , 2013, J. Assoc. Inf. Sci. Technol..

[31]  ChengXiang Zhai,et al.  Mining long-term search history to improve search accuracy , 2006, KDD '06.

[32]  Ruslan Salakhutdinov,et al.  Bayesian probabilistic matrix factorization using Markov chain Monte Carlo , 2008, ICML '08.

[33]  Jingjing Liu,et al.  Personalizing information retrieval for multi‐session tasks: Examining the roles of task stage, task type, and topic knowledge on the interpretation of dwell time as an indicator of document usefulness , 2015, J. Assoc. Inf. Sci. Technol..

[34]  Ryen W. White,et al.  Personalizing web search results by reading level , 2011, CIKM '11.

[35]  Guocong Song songgc Point-Wise Approach for Yandex Personalized Web Search Challenge , 2014 .

[36]  Gleb Gusev,et al.  An Optimization Framework for Weighting Implicit Relevance Labels for Personalized Web Search , 2015, WWW.

[37]  Ryen W. White,et al.  Modeling dwell time to predict click-level satisfaction , 2014, WSDM.

[38]  Matthew Richardson,et al.  Predictive client-side profiles for personalized advertising , 2011, KDD.

[39]  Wei Chu,et al.  Personalized ranking model adaptation for web search , 2013, SIGIR.

[40]  Li Chen,et al.  Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence GBPR: Group Preference Based Bayesian Personalized Ranking for One-Class Collaborative Filtering , 2022 .

[41]  M. de Rijke,et al.  Personalized document re-ranking based on Bayesian probabilistic matrix factorization , 2014, SIGIR.

[42]  Wei Chu,et al.  Modeling the impact of short- and long-term behavior on search personalization , 2012, SIGIR '12.

[43]  Ryen W. White,et al.  Predicting user interests from contextual information , 2009, SIGIR.

[44]  Wei Chu,et al.  Cohort modeling for enhanced personalized search , 2014, SIGIR.

[45]  Ryen W. White,et al.  Predicting short-term interests using activity-based search context , 2010, CIKM.