Cohort modeling for enhanced personalized search

Web search engines utilize behavioral signals to develop search experiences tailored to individual users. To be effective, such personalization relies on access to sufficient information about each user's interests and intentions. For new users or new queries, profile information may be sparse or non-existent. To handle these cases, and perhaps also improve personalization for those with profiles, search engines can employ signals from users who are similar along one or more dimensions, i.e., those in the same cohort. In this paper we describe a characterization and evaluation of the use of such cohort modeling to enhance search personalization. We experiment with three pre-defined cohorts-topic, location, and top-level domain preference-independently and in combination, and also evaluate methods to learn cohorts dynamically. We show via extensive experimentation with large-scale logs from a commercial search engine that leveraging cohort behavior can yield significant relevance gains when combined with a production search engine ranking algorithm that uses similar classes of personalization signal but at the individual searcher level. Additional experiments show that our gains can be extended when we dynamically learn cohorts and target easily-identifiable classes of ambiguous or unseen queries.

[1]  Steve Fox,et al.  Evaluating implicit measures to improve web search , 2005, TOIS.

[2]  Jaime Teevan,et al.  Information re-retrieval: repeat queries in Yahoo's logs , 2007, SIGIR.

[3]  David M. Pennock,et al.  Categories and Subject Descriptors , 2001 .

[4]  Filip Radlinski,et al.  Inferring and using location metadata to personalize web search , 2011, SIGIR.

[5]  Barry Smyth,et al.  A Community-Based Approach to Personalizing Web Search , 2007, Computer.

[6]  Barry Smyth,et al.  Cooperating Search Communities , 2006, AH.

[7]  Bart Selman,et al.  Referral Web: combining social networks and collaborative filtering , 1997, CACM.

[8]  Yeye He,et al.  Mining acronym expansions and their meanings using query click log , 2013, WWW.

[9]  Ryen W. White,et al.  Characterizing the influence of domain expertise on web search behavior , 2009, WSDM '09.

[10]  Jianfeng Gao,et al.  Ranking, Boosting, and Model Adaptation , 2008 .

[11]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[12]  Ryen W. White,et al.  Predicting short-term interests using activity-based search context , 2010, CIKM.

[13]  Huan Liu,et al.  CubeSVD: a novel approach to personalized Web search , 2005, WWW '05.

[14]  ChengXiang Zhai,et al.  Mining long-term search history to improve search accuracy , 2006, KDD '06.

[15]  Jaime Teevan,et al.  Understanding and predicting personal navigation , 2011, WSDM '11.

[16]  Ryen W. White,et al.  Mining the search trails of surfing crowds: identifying relevant websites from user activity , 2008, WWW.

[17]  Nina Mishra,et al.  Domain bias in web search , 2012, WSDM '12.

[18]  Meredith Ringel Morris,et al.  Discovering and using groups to improve personalized search , 2009, WSDM '09.

[19]  Enhong Chen,et al.  Context-aware ranking in web search , 2010, SIGIR '10.

[20]  John D. Lafferty,et al.  Information Retrieval as Statistical Translation , 2017 .

[21]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[22]  Ingmar Weber,et al.  The demographics of web search , 2010, SIGIR.

[23]  Wei Chu,et al.  Modeling the impact of short- and long-term behavior on search personalization , 2012, SIGIR '12.

[24]  Hsuan-Tien Lin,et al.  An Ensemble Ranking Solution for the Yahoo ! Learning to Rank Challenge , 2010 .

[25]  Wei Chu,et al.  Enhancing personalized search by mining and modeling task behavior , 2013, WWW.

[26]  Ryen W. White,et al.  Predicting user interests from contextual information , 2009, SIGIR.

[27]  Virgílio A. F. Almeida,et al.  A community-aware search engine , 2004, WWW '04.

[28]  Ryen W. White,et al.  Probabilistic models for personalizing web search , 2012, WSDM '12.

[29]  Susan T. Dumais,et al.  Personalizing Search via Automated Analysis of Interests and Activities , 2005, SIGIR.

[30]  Kenneth Ward Church,et al.  Entropy of search logs: how hard is search? with personalization? with backoff? , 2008, WSDM '08.

[31]  Quoc V. Le,et al.  Learning to Rank with Non-Smooth Cost Functions , 2007 .

[32]  Erick Cantú-Paz,et al.  Personalized click prediction in sponsored search , 2010, WSDM '10.

[33]  Qiang Yang,et al.  Personalized click model through collaborative filtering , 2012, WSDM '12.

[34]  Susan T. Dumais,et al.  Improving Web Search Ranking by Incorporating User Behavior Information , 2019, SIGIR Forum.

[35]  ChengXiang Zhai,et al.  Implicit user modeling for personalized search , 2005, CIKM '05.

[36]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[37]  Ryen W. White,et al.  Characterizing local interests and local knowledge , 2012, CHI.

[38]  Masatoshi Yoshikawa,et al.  Adaptive web search based on user profile constructed without any effort from users , 2004, WWW '04.

[39]  Young-Jin Lee,et al.  VisSearch: A collaborative Web searching environment , 2005, Comput. Educ..

[40]  Susan T. Dumais,et al.  Classification-enhanced ranking , 2010, WWW '10.