Time-sensitive Personalized Query Auto-Completion

Query auto-completion (QAC) is a prominent feature of modern search engines. It is aimed at saving user's time and enhancing the search experience. Current QAC models mostly rank matching QAC candidates according to their past popularity, i.e., frequency. However, query popularity changes over time and may vary drastically across users. Hence, rankings of QAC candidates should be adjusted accordingly. In previous work time-sensitive QAC models and user-specific QAC models have been developed separately. Both types of QAC model lead to important improvements over models that are neither time-sensitive nor personalized. We propose a hybrid QAC model that considers both of these aspects: time-sensitivity and personalization. Using search logs, we return the top N QAC candidates by predicted popularity based on their recent trend and cyclic behavior. We use auto-correlation to detect query periodicity by long-term time-series analysis, and anticipate the query popularity trend based on observations within an optimal time window returned by a regression model. We rerank the returned top N candidates by integrating their similarities with a user's preceding queries (both in the current session and in previous sessions by the same user) on a character level to produce a final QAC list. Our experimental results on two real-world datasets show that our hybrid QAC model outperforms state-of-the-art time-sensitive QAC baseline, achieving total improvements of between 3% and 7% in terms of MRR.

[1]  Xueqi Cheng,et al.  Intent-aware query similarity , 2011, CIKM '11.

[2]  Craig MacDonald,et al.  Learning to rank query suggestions for adhoc and diversity search , 2012, Information Retrieval.

[3]  Surajit Chaudhuri,et al.  Extending autocompletion to tolerate errors , 2009, SIGMOD Conference.

[4]  Peter Haider,et al.  Learning to Complete Sentences , 2005, ECML.

[5]  Qi He,et al.  Web Query Recommendation via Sequential Query Prediction , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[6]  Ingmar Weber,et al.  The demographics of web search , 2010, SIGIR.

[7]  Michael Gertz,et al.  CONQUER: a system for efficient context-aware query suggestions , 2011, WWW.

[8]  M. de Rijke,et al.  Personalized document re-ranking based on Bayesian probabilistic matrix factorization , 2014, SIGIR.

[9]  Laura Hollink,et al.  Search behavior of media professionals at an audiovisual archive: A transaction log analysis , 2010 .

[10]  Yehuda Koren,et al.  Expediting search trend detection via prediction of query counts , 2013, WSDM.

[11]  Tobias Scheffer,et al.  Sentence Completion , 1921, SIGIR '04.

[12]  Milad Shokouhi,et al.  Detecting seasonal queries by time-series analysis , 2011, SIGIR.

[13]  Enhong Chen,et al.  Context-aware query suggestion by mining click-through and session data , 2008, KDD.

[14]  Milad Shokouhi,et al.  Learning to personalize query auto-completion , 2013, SIGIR.

[15]  Weiguo Fan,et al.  Web Query Prediction by Unifying Model , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[16]  Milad Shokouhi,et al.  Time-sensitive query auto-completion , 2012, SIGIR '12.

[17]  Ingmar Weber,et al.  Type less, find more: fast autocompletion search with a succinct index , 2006, SIGIR.

[18]  Ziv Bar-Yossef,et al.  Context-sensitive query auto-completion , 2011, WWW.

[19]  Pavel Serdyukov,et al.  Actualization of query suggestions using query logs , 2012, WWW.

[20]  Steve Chien,et al.  Semantic similarity between search engine queries using temporal correlation , 2005, WWW '05.

[21]  Gary Marchionini,et al.  Examining the effectiveness of real-time query expansion , 2007, Inf. Process. Manag..

[22]  Hao Wu,et al.  Suggesting Topic-Based Query Terms as You Type , 2010, 2010 12th International Asia-Pacific Web Conference.

[23]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[24]  Enhong Chen,et al.  Mining Concept Sequences from Large-Scale Search Logs for Context-Aware Query Suggestion , 2011, TIST.

[25]  Abdur Chowdhury,et al.  A picture of search , 2006, InfoScale '06.

[26]  Prasenjit Mitra,et al.  Query suggestions in the absence of query logs , 2011, SIGIR.

[27]  Tetsuya Sakai,et al.  Time-aware structured query suggestion , 2013, SIGIR.

[28]  P. A. Blight The Analysis of Time Series: An Introduction , 1991 .

[29]  Enrique Alfonseca,et al.  Gazpacho and summer rash: lexical relationships from temporal patterns of web search queries , 2009, EMNLP.

[30]  Wei Chu,et al.  Modeling the impact of short- and long-term behavior on search personalization , 2012, SIGIR '12.

[31]  Witold Litwin,et al.  Fast nGram-Based String Search Over Data Encoded Using Algebraic Signatures , 2007, VLDB.

[32]  Pablo de la Fuente,et al.  Context-Based Personalization for Mobile Web Search , 2008, PersDB.

[33]  Joemon M. Jose,et al.  Recent and robust query auto-completion , 2014, WWW.

[34]  Ingmar Weber,et al.  Efficient interactive query expansion with complete search , 2007, CIKM '07.

[35]  Dimitrios Gunopulos,et al.  Identifying similarities, periodicities and bursts for online search queries , 2004, SIGMOD '04.

[36]  Susan T. Dumais,et al.  Understanding temporal query dynamics , 2011, WSDM '11.