论文信息 - ’ s repository of research publications and other research outputs RGU-ISTI-Essex at TREC 2011 Session Track Conference or Workshop Item

’ s repository of research publications and other research outputs RGU-ISTI-Essex at TREC 2011 Session Track Conference or Workshop Item

Mining query recommendation from query logs has attracted a lot of attention in recent years. We propose to use query recommendations extracted from the logs of a web search engine to solve the session track tasks. The runs are obtained by using the Search Shortcuts recommender system. The Search Shortcuts technique uses an inverted index and the concept of “successful sessions” present in a web search engine’s query log to produce effective recommendations for both frequent and rare/unseen queries. We adapt the above technique as a query expansion tool and use it to expand the given queries for Session Track at TREC 2011. The expansion is generated by using a method which aims to consider all past queries in the session. The expansion terms obtained are then used to build a global, uniformly weighted, representation of the user session (RL2). Furthermore, the expansion terms are then combined with a ranked list of results in order to boost terms appearing more frequently in the final results lists (RL3). Finally, we also integrate dwell times and the weighting method obtained taking both result lists and clicks into account for assigning weights to the terms to expand the final query of the session. In addition to that, we submitted a baseline run. It is based on the observation that using the term “wikipedia” to expand the query resulted in a better retrieval performance for the tasks at last year’s session track at TREC 2010.

[1] W. Bruce Croft,et al. Relevance-Based Language Models , 2001, SIGIR '01.

[2] W. Bruce Croft,et al. Combining the language model and inference network approaches to retrieval , 2004, Inf. Process. Manag..

[3] Filip Radlinski,et al. Query chains: learning to rank from implicit feedback , 2005, KDD '05.

[4] Rosie Jones,et al. Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs , 2008, CIKM '08.

[5] Emine Yilmaz,et al. A simple and efficient sampling method for estimating AP and NDCG , 2008, SIGIR '08.

[6] Hugo Zaragoza,et al. The Probabilistic Relevance Framework: BM25 and Beyond , 2009, Found. Trends Inf. Retr..

[7] Victor Carneiro,et al. Search shortcuts: a new approach to the recommendation of queries , 2009, RecSys '09.

[8] Olivier Chapelle,et al. Expected reciprocal rank for graded relevance , 2009, CIKM.

[9] Stephen E. Robertson,et al. Extending average precision to graded relevance judgments , 2010, SIGIR.

[10] Paul D. Clough,et al. Session Track Overview ∗ , 2010 .

[11] Udo Kruschwitz,et al. The Use of Domain Modelling to Improve Performance Over a Query Session , 2011 .

[12] Charles L. A. Clarke,et al. Efficient and effective spam filtering and re-ranking for large web datasets , 2010, Information Retrieval.

[13] Fabrizio Silvestri,et al. Identifying task-based sessions in search engine query logs , 2011, WSDM '11.

[14] Fabrizio Silvestri,et al. Generating suggestions for queries in the long tail with an inverted index , 2012, Inf. Process. Manag..