Temporal Latent Topic User Profiles for Search Personalisation

The performance of search personalisation largely depends on how to build user profiles effectively. Many approaches have been developed to build user profiles using topics discussed in relevant documents, where the topics are usually obtained from human-generated online ontology such as Open Directory Project. The limitation of these approaches is that many documents may not contain the topics covered in the ontology. Moreover, the human-generated topics require expensive manual effort to determine the correct categories for each document. This paper addresses these problems by using Latent Dirichlet Allocation for unsupervised extraction of the topics from documents. With the learned topics, we observe that the search intent and user interests are dynamic, i.e., they change from time to time. In order to evaluate the effectiveness of temporal aspects in personalisation, we apply three typical time scales for building a long-term profile, a daily profile and a session profile. In the experiments, we utilise the profiles to re-rank search results returned by a commercial web search engine. Our experimental results demonstrate that our temporal profiles can significantly improve the ranking quality. The results further show a promising effect of temporal features in correlation with click entropy and query position in a search session.

[1]  Ryen W. White,et al.  Personalized models of search satisfaction , 2013, CIKM.

[2]  Fabio Crestani,et al.  Building user profiles from topic models for personalised search , 2013, CIKM.

[3]  Susan T. Dumais,et al.  Personalizing search via automated analysis of interests and activities , 2005, SIGIR '05.

[4]  Wei Chu,et al.  Modeling the impact of short- and long-term behavior on search personalization , 2012, SIGIR '12.

[5]  Wei Chu,et al.  Enhancing personalized search by mining and modeling task behavior , 2013, WWW.

[6]  Dawei Song,et al.  Improving search personalisation with dynamic group formation , 2014, SIGIR.

[7]  Ji-Rong Wen,et al.  A large-scale evaluation and analysis of personalized search strategies , 2007, WWW '07.

[8]  Steve Fox,et al.  Evaluating implicit measures to improve web search , 2005, TOIS.

[9]  Christopher J. C. Burges,et al.  From RankNet to LambdaRank to LambdaMART: An Overview , 2010 .

[10]  Milad Shokouhi,et al.  Fighting search engine amnesia: reranking repeated results , 2013, SIGIR.

[11]  Yang Song,et al.  Context-aware web search abandonment prediction , 2014, SIGIR.

[12]  Paul N. Bennett,et al.  Toward whole-session relevance: exploring intrinsic diversity in web search , 2013, SIGIR.

[13]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[14]  Yi Chang,et al.  Yahoo! Learning to Rank Challenge Overview , 2010, Yahoo! Learning to Rank Challenge.

[15]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[16]  Meredith Ringel Morris,et al.  Discovering and using groups to improve personalized search , 2009, WSDM '09.

[17]  Thomas Hofmann,et al.  Learning to Rank with Nonsmooth Cost Functions , 2006, NIPS.

[18]  Ryen W. White,et al.  Predicting short-term interests using activity-based search context , 2010, CIKM.

[19]  Yang Song,et al.  Modeling action-level satisfaction for search task satisfaction prediction , 2014, SIGIR.