论文信息 - Reducing the Uncertainty in Resource Selection

Reducing the Uncertainty in Resource Selection

The distributed retrieval process is plagued by uncertainty. Sampling, selection, merging and ranking are all based on very limited information compared to centralized retrieval. In this paper, we focus our attention on reducing the uncertainty within the resource selection phase by obtaining a number of estimates, rather than relying upon only one point estimate. We propose three methods for reducing uncertainty which are compared against state-of-the-art baselines across three distributed retrieval testbeds. Our results show that the proposed methods significantly improve baselines, reduce the uncertainty and improve robustness of resource selection.

Fabio Crestani | Leif Azzopardi | Ilya Markov

[1] Fabio Crestani,et al. Adaptive query-based sampling for distributed IR , 2006, SIGIR.

[2] Jun Wang,et al. Portfolio theory of information retrieval , 2009, SIGIR.

[3] Claudia Hauff,et al. Predicting the effectiveness of queries and retrieval systems , 2010, SIGF.

[4] Andrew Trotman,et al. Sound and complete relevance assessment for XML retrieval , 2008, TOIS.

[5] W. Bruce Croft,et al. Cluster-based language models for distributed retrieval , 1999, SIGIR '99.

[6] Milad Shokouhi,et al. Central-Rank-Based Collection Selection in Uncooperative Distributed Information Retrieval , 2007, ECIR.

[7] Avi Arampatzis,et al. On CORI Results Merging , 2013, ECIR.

[8] Fernando Diaz,et al. Sources of evidence for vertical selection , 2009, SIGIR.

[9] Milad Shokouhi,et al. Robust result merging using sample-based score estimates , 2009, TOIS.

[10] Milad Shokouhi,et al. Evaluating Server Selection for Federated Search , 2010, ECIR.

[11] James P. Callan,et al. Query-based sampling of text databases , 2001, TOIS.