论文信息 - Does Online Evaluation Correspond to Offline Evaluation in Query Auto Completion?

Does Online Evaluation Correspond to Offline Evaluation in Query Auto Completion?

Query Auto Completion is the task of suggesting queries to the users of a search engine while they are typing a query in the search box. Over the recent years there has been a renewed interest in research on improving the quality of this task. The published improvements were assessed by using offline evaluation techniques and metrics. In this paper, we provide a comparison of online and offline assessments for Query Auto Completion. We show that there is a large potential for significant bias if the raw data used in an online experiment is re-used for offline experiments afterwards to evaluate new methods.

Allan Hanbury | Mihai Lupu | João R. M. Palotti | Jon Brassey | Alexandros Bampoulidis

[1] Ron Kohavi,et al. Controlled experiments on the web: survey and practical guide , 2009, Data Mining and Knowledge Discovery.

[2] Pu-Jen Cheng,et al. Learning user reformulation behavior for query auto-completion , 2014, SIGIR.

[3] Filip Radlinski,et al. Large-scale validation and analysis of interleaved search evaluation , 2012, TOIS.

[4] M. de Rijke,et al. Time-sensitive Personalized Query Auto-Completion , 2014, CIKM.

[5] Filip Radlinski,et al. How does clickthrough data reflect retrieval quality? , 2008, CIKM '08.

[6] Ziv Bar-Yossef,et al. Context-sensitive query auto-completion , 2011, WWW.

[7] Gonzalo Navarro,et al. Word-based self-indexes for natural language text , 2012, TOIS.

[8] Milad Shokouhi,et al. Learning to personalize query auto-completion , 2013, SIGIR.

[9] M. de Rijke,et al. Selectively Personalizing Query Auto-Completion , 2016, SIGIR.

[10] Milad Shokouhi,et al. Time-sensitive query auto-completion , 2012, SIGIR '12.

[11] Craig MacDonald,et al. Comparing Approaches for Query Autocompletion , 2015, SIGIR.

[12] Thorsten Joachims,et al. Evaluating Retrieval Performance Using Clickthrough Data , 2003, Text Mining.