Building user profiles from topic models for personalised search

Personalisation is an important area in the field of IR that attempts to adapt ranking algorithms so that the results returned are tuned towards the searcher's interests. In this work we use query logs to build personalised ranking models in which user profiles are constructed based on the representation of clicked documents over a topic space. Instead of employing a human-generated ontology, we use novel latent topic models to determine these topics. Our experiments show that by subtly introducing user profiles as part of the ranking algorithm, rather than by re-ranking an existing list, we can provide personalised ranked lists of documents which improve significantly over a non-personalised baseline. Further examination shows that the performance of the personalised system is particularly good in cases where prior knowledge of the search query is limited.

[1]  W. Bruce Croft,et al.  LDA-based document models for ad-hoc retrieval , 2006, SIGIR.

[2]  Wolfgang Nejdl,et al.  Using ODP metadata to personalize search , 2005, SIGIR '05.

[3]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[4]  Bamshad Mobasher,et al.  Web search personalization with ontological user profiles , 2007, CIKM '07.

[5]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[6]  Ian Ruthven,et al.  Improving social bookmark search using personalised latent variable language models , 2011, WSDM '11.

[7]  Mohand Boughanem,et al.  A session based personalized search using an ontological user profile , 2009, SAC '09.

[8]  Susan T. Dumais,et al.  To personalize or not to personalize: modeling queries with variation in user intent , 2008, SIGIR '08.

[9]  Omid Madani,et al.  A large-scale analysis of query logs for assessing personalization opportunities , 2006, KDD '06.

[10]  Ryen W. White,et al.  Predicting user interests from contextual information , 2009, SIGIR.

[11]  Alexander Pretschner,et al.  Ontology-Based User Profiles for Search and Browsing , 2002 .

[12]  Fabio Crestani,et al.  Towards query log based personalization using topic models , 2010, CIKM.

[13]  Alexander Pretschner,et al.  Ontology based personalized search , 1999, Proceedings 11th International Conference on Tools with Artificial Intelligence.

[14]  Filip Radlinski,et al.  Personalizing web search using long term browsing history , 2011, WSDM '11.

[15]  Susan T. Dumais,et al.  Potential for personalization , 2010, TCHI.

[16]  Elad Yom-Tov,et al.  What makes a query difficult? , 2006, SIGIR.

[17]  Feng Qiu,et al.  Automatic identification of user interest for personalized search , 2006, WWW '06.

[18]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Wei Chu,et al.  Modeling the impact of short- and long-term behavior on search personalization , 2012, SIGIR '12.

[20]  Ji-Rong Wen,et al.  A large-scale evaluation and analysis of personalized search strategies , 2007, WWW '07.

[21]  Fabio Crestani,et al.  A statistical comparison of tag and query logs , 2009, SIGIR.