Personalized query suggestion diversification in information retrieval

Query suggestions help users refine their queries after they input an initial query. Previous work on query suggestion has mainly concentrated on approaches that are similarity-based or context-based, developing models that either focus on adapting to a specific user (personalization) or on diversifying query aspects in order to maximize the probability of the user being satisfied (diversification). We consider the task of generating query suggestions that are both personalized and diversified. We propose a personalized query suggestion diversification (PQSD) model, where a user’s long-term search behavior is injected into a basic greedy query suggestion diversification model that considers a user’s search context in their current session. Query aspects are identified through clicked documents based on the open directory project (ODP) with a latent dirichlet allocation (LDA) topic model. We quantify the improvement of our proposed PQSD model against a state-of-the-art baseline using the public america online (AOL) query log and show that it beats the baseline in terms of metrics used in query suggestion ranking and diversification. The experimental results show that PQSD achieves its best performance when only queries with clicked documents are taken as search context rather than all queries, especially when more query suggestions are returned in the list.

[1]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[2]  Hongyan Liu,et al.  Multi-view random walk framework for search task discovery from click-through log , 2011, CIKM '11.

[3]  Starr Roxanne Hiltz,et al.  Identifying Opportunities for Valuable Encounters: Toward Context-Aware Social Matching Systems , 2015, TOIS.

[4]  M. de Rijke,et al.  Learning from homologous queries and semantically related terms for query auto completion , 2016, Inf. Process. Manag..

[5]  Pablo Castells,et al.  Personalized diversification of search results , 2012, SIGIR '12.

[6]  Kenneth Ward Church,et al.  Query suggestion using hitting time , 2008, CIKM '08.

[7]  Saurabh Sharma,et al.  Obtaining Personalized and Accurate Query Suggestion by Using Agglomerative Clustering Algorithm and P-QC Method , 2012 .

[8]  Xueqi Cheng,et al.  Intent-aware query similarity , 2011, CIKM '11.

[9]  Charles L. A. Clarke,et al.  Novelty and diversity in information retrieval evaluation , 2008, SIGIR '08.

[10]  Olivier Chapelle,et al.  Expected reciprocal rank for graded relevance , 2009, CIKM.

[11]  Shaha T. Al-Otaibi,et al.  Hybrid immunizing solution for job recommender system , 2017, Frontiers of Computer Science.

[12]  Craig MacDonald,et al.  Explicit Search Result Diversification through Sub-queries , 2010, ECIR.

[13]  Nick Craswell,et al.  Random walks on the click graph , 2007, SIGIR.

[14]  Michael R. Lyu,et al.  Learning latent semantic relations from clickthrough data for query suggestion , 2008, CIKM '08.

[15]  Maarten de Rijke,et al.  Personalized Query Suggestion Diversification , 2017, SIGIR.

[16]  Fei Cai,et al.  Prefix-Adaptive and Time-Sensitive Personalized Query Auto Completion , 2016, IEEE Transactions on Knowledge and Data Engineering.

[17]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[18]  Yang Song,et al.  Post-ranking query suggestion by diversifying search results , 2011, SIGIR '11.

[19]  Enhong Chen,et al.  Context-aware query suggestion by mining click-through and session data , 2008, KDD.

[20]  Jade Goldstein-Stewart,et al.  The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries , 1998, SIGIR Forum.

[21]  Yen-Jen Oyang,et al.  Relevant term suggestion in interactive web search based on contextual information in query session logs , 2003, J. Assoc. Inf. Sci. Technol..

[22]  Sean M. McNee,et al.  Improving recommendation lists through topic diversification , 2005, WWW '05.

[23]  Michael R. Lyu,et al.  Diversifying Query Suggestion Results , 2010, AAAI.

[24]  이주연,et al.  Latent Dirichlet Allocation (LDA) 모델 기반의 인공지능(A.I.) 기술 관련 연구 활동 및 동향 분석 , 2018 .

[25]  Danushka Bollegala,et al.  Measuring semantic similarity between words using web search engines , 2007, WWW '07.

[26]  Wessel Kraaij,et al.  User Simulations for Interactive Search: Evaluating Personalized Query Suggestion , 2015, ECIR.

[27]  Maarten de Rijke,et al.  Efficient Structured Learning for Personalized Diversification , 2016, IEEE Transactions on Knowledge and Data Engineering.

[28]  Chirag Shah,et al.  Evaluating high accuracy retrieval techniques , 2004, SIGIR '04.

[29]  M. de Rijke,et al.  A Survey of Query Auto Completion in Information Retrieval , 2016, Found. Trends Inf. Retr..

[30]  Craig MacDonald,et al.  Intent models for contextualising and diversifying query suggestions , 2013, CIKM.

[31]  Fei Cai Behavior-Based Personalization in Web Search* , 2016 .

[32]  M. de Rijke,et al.  Behavior‐based personalization in web search , 2017, J. Assoc. Inf. Sci. Technol..

[33]  Sreenivas Gollapudi,et al.  Diversifying search results , 2009, WSDM '09.

[34]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[35]  de RijkeMaarten,et al.  Learning from homologous queries and semantically related terms for query auto completion , 2016 .

[36]  Yee Whye Teh,et al.  On Smoothing and Inference for Topic Models , 2009, UAI.

[37]  Ling Liu,et al.  Query-URL Bipartite Based Approach to Personalized Query Recommendation , 2008, AAAI.

[38]  Filip Radlinski,et al.  Simple Personalized Search Based on Long-Term Behavioral Signals , 2017, ECIR.

[39]  Reynold Cheng,et al.  DQR: a probabilistic approach to diversified query recommendation , 2012, CIKM '12.

[40]  Abdur Chowdhury,et al.  A picture of search , 2006, InfoScale '06.

[41]  M. de Rijke,et al.  Diversifying Query Auto-Completion , 2016, ACM Trans. Inf. Syst..

[42]  Olfa Nasraoui,et al.  Mining search engine query logs for query recommendation , 2006, WWW '06.