Personalizing Search via Automated Analysis of Interests and Activities

We formulate and study search algorithms that consider a user's prior interactions with a wide variety of content to personalize that user's current Web search. Rather than relying on the unrealistic assumption that people will precisely specify their intent when searching, we pursue techniques that leverage implicit information about the user's interests. This information is used to re-rank Web search results within a relevance feedback framework. We explore rich models of user interests, built from both search-related information, such as previously issued queries and previously visited Web pages, and other information about the user such as documents and email the user has read and created. Our research suggests that rich representations of the user and the corpus are important for personalization, but that it is possible to approximate these representations and provide efficient client-side algorithms for personalizing search. We show that such personalization algorithms can significantly improve on current Web search.

[1]  Jaime Teevan,et al.  The Re:Search Engine Helping People Return to Information on the Web , 2005 .

[2]  Jennifer Widom,et al.  Scaling personalized web search , 2003, WWW '03.

[3]  Susan T. Dumais,et al.  Beyond the Commons: Investigating the Value of Personalizing Web Search , 2005 .

[4]  Krishna Bharat SearchPad: explicit capture of search context to support Web search , 2000, Comput. Networks.

[5]  Nicholas J. Belkin,et al.  A case for interaction: a study of interactive information retrieval behavior and effectiveness , 1996, CHI.

[6]  Clement T. Yu,et al.  Personalized web search by mapping user queries to categories , 2002, CIKM '02.

[7]  Peter G. Anick Using terminological feedback for web search refinement: a log-based study , 2003, SIGIR.

[8]  Martha Sideri,et al.  The Compass Filter: Search Engine Result Personalization Using Web Communities , 2003, ITWP.

[9]  L. Adler A Modification of Kendall's Tau for the Case of Arbitrary Ties in Both Rankings , 1957 .

[10]  Kristian J. Hammond,et al.  Watson: Anticipating and Contextualizing Information Needs , 1999 .

[11]  Jaime Teevan,et al.  Implicit feedback for inferring user preference: a bibliography , 2003, SIGF.

[12]  David Hawking,et al.  Overview of the TREC-2001 Web track , 2002 .

[13]  Jaana Kekäläinen,et al.  IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR '00.

[14]  ChengXiang Zhai,et al.  Exploiting query history for document ranking in interactive information retrieval , 2003, SIGIR '03.

[15]  Mounia Lalmas,et al.  A survey on the use of relevance feedback for information access systems , 2003, The Knowledge Engineering Review.

[16]  Masatoshi Yoshikawa,et al.  Adaptive web search based on user profile constructed without any effort from users , 2004, WWW '04.

[17]  Yoichi Shinoda,et al.  Information filtering based on user behavior analysis and best match text retrieval , 1994, SIGIR '94.

[18]  K. Sparck Jones,et al.  A Probabilistic Model of Information Retrieval : Development and Status , 1998 .

[19]  Mark S. Ackerman,et al.  The perfect search engine is not enough: a study of orienteering behavior in directed search , 2004, CHI.

[20]  Hinrich Schütze,et al.  Personalized search , 2002, CACM.

[21]  Vasileios Hatzivassiloglou,et al.  Leveraging a common representation for personalized search and summarization in a medical digital library , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[22]  Alexander Pretschner,et al.  Ontology-based personalized search and browsing , 2003, Web Intell. Agent Syst..