Personalizing Search Using Socially Enhanced Interest Model Built from the Stream of User's Activity

Older studies have proved that when searching information on the Web, users tend to write short queries, unconsciously trying to minimize the cognitive load. However, as these short queries are very ambiguous, search engines tend to find the most popular meaning - someone who does not know anything about cascading stylesheets might search for a music band called css and be very surprised about the results. In this paper we propose a method which can infer additional keywords for a search query by leveraging a social network context and a method to build this network from the stream of user's activity on the Web. The approach was evaluated on real users using a personalized proxy server platform. The query expansion method was integrated into Google search engine and where possible, the original query was expanded and additional search results were retrieved and displayed. 70% of the expanded results were clicked and we observed a significant increase of time that the users spent on the expanded results when compared to the time spent on standard results.

[1]  Michal Barla Towards Social-based User Modeling and Personalization , 2010 .

[2]  Meredith Ringel Morris,et al.  Discovering and using groups to improve personalized search , 2009, WSDM '09.

[3]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[4]  Hinrich Schütze,et al.  Personalized search , 2002, CACM.

[5]  R. Conejo,et al.  MEDEA: an Open Service-Based Learning Platform for Developing Intelligent Educational Systems for the Web , 2005 .

[6]  Xu Sun,et al.  A Large Scale Ranker-Based System for Search Query Spelling Correction , 2010, COLING.

[7]  Eric Horvitz,et al.  SearchTogether: an interface for collaborative web search , 2007, UIST.

[8]  Mária Bieliková,et al.  An Inquiry into the Utilization of Behavior of Users in Personalized Web , 2011, J. Univers. Comput. Sci..

[9]  Clement T. Yu,et al.  An effective approach to document retrieval via utilizing WordNet and recognizing phrases , 2004, SIGIR '04.

[10]  Rossano Schifanella,et al.  Folks in Folksonomies: social link prediction from shared metadata , 2010, WSDM '10.

[11]  Barry Smyth,et al.  A Case Study of Collaboration and Reputation in Social Web Search , 2011, TIST.

[12]  Eitan Farchi,et al.  Automatic query wefinement using lexical affinities with maximal information gain , 2002, SIGIR '02.

[13]  Mark Claypool,et al.  Implicit interest indicators , 2001, IUI '01.

[14]  Marco Padula,et al.  Personalized web browsing experience , 2009, HT '09.

[15]  Mária Bieliková,et al.  Ordinary Web pages as a source for metadata acquisition for open corpus user modeling , 2010 .

[16]  Michael A. Shepherd,et al.  Browsing and keyword-based profiles: a cautionary tale , 2001, Proceedings of the 34th Annual Hawaii International Conference on System Sciences.

[17]  Mária Bieliková,et al.  On Deriving Tagsonomies: Keyword Relations Coming from Crowd , 2009, ICCCI.

[18]  Steve Lawrence,et al.  Context in Web Search , 2000, IEEE Data Eng. Bull..

[19]  Christian Bauckhage,et al.  I tag, you tag: translating tags for advanced user models , 2010, WSDM '10.

[20]  Susan T. Dumais,et al.  Characterizing the value of personalizing search , 2007, SIGIR.

[21]  Susan T. Dumais,et al.  Personalizing Search via Automated Analysis of Interests and Activities , 2005, SIGIR.

[22]  Ryen W. White,et al.  Predicting short-term interests using activity-based search context , 2010, CIKM.

[23]  Pavol Návrat,et al.  Context Search Enhanced by Readability Index , 2008, IFIP AI.

[24]  James Miller,et al.  A Survey of Cookie Technology Adoption Amongst Nations , 2009, J. Web Eng..

[25]  Yeliz Yesilada,et al.  COHSE: dynamic linking of web resources , 2007 .

[26]  Shyhtsun Felix Wu,et al.  Social Network Model Based on Keyword Categorization , 2009, 2009 International Conference on Advances in Social Network Analysis and Mining.

[27]  Erik Wilde,et al.  Web Site Metadata , 2010, J. Web Eng..

[28]  Feng Qiu,et al.  Automatic identification of user interest for personalized search , 2006, WWW '06.

[29]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[30]  Federica Cena,et al.  Towards a Tag-Based User Model: How Can User Model Benefit from Tags? , 2007, User Modeling.

[31]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[33]  Paul-Alexandru Chirita,et al.  Personalized query expansion for the web , 2007, SIGIR.

[34]  Luca Becchetti,et al.  An optimization framework for query recommendation , 2010, WSDM '10.

[35]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[36]  Mária Bieliková,et al.  Disambiguating Search by Leveraging a Social Context Based on the Stream of User's Activity , 2010, UMAP.

[37]  Barry Smyth,et al.  Are people biased in their use of search engines? , 2008, CACM.

[38]  Daniel Gayo-Avello,et al.  A survey on session detection methods in query logs and a proposal for future evaluation , 2009, Inf. Sci..

[39]  Ali A. Ghorbani,et al.  GUMSAWS: A Generic User Modeling Server for Adaptive Web Systems , 2007, Fifth Annual Conference on Communication Networks and Services Research (CNSR '07).

[40]  Gary Marchionini,et al.  Interfaces for End-User Information Seeking , 1992, J. Am. Soc. Inf. Sci..

[41]  Giorgio Maria Di Nunzio,et al.  Gathering and Mining Information from Web Log Files , 2007, DELOS.

[42]  Kjetil Nørvåg,et al.  QUEST: Query Expansion Using Synonyms over Time , 2010, ECML/PKDD.

[43]  Alessandro Micarelli,et al.  Nereau: a social approach to query expansion , 2008, WIDM '08.