Exploitation of information repositories available on the Internet requires users to separately query each repository and manually gather retrieved results. Such a solution could be simplified by using a centralized server that acts as a gateway between the user and repositories: the centralized server forwards the user query to federated repositories and fuses retrieved documents for presentation to the user. To perform these tasks efficiently, the centralized server should perform two main functions: resource selection and data fusion. The former is required to forward the user query only to the repositories that are candidate to contain relevant documents. The latter is used to gather all retrieved documents and conveniently arrange them for presentation to the user. In the case of image repositories, data fusion is particularly challenging owing to the difficulty to normalize document scores returned by different repositories. In this paper a novel solution is presented for fusion of results returned by different image repositories. Experimental results are presented that show the potential of the proposed approach.
[1]
James P. Callan,et al.
Query-based sampling of text databases
,
2001,
TOIS.
[2]
Kui-Lam Kwok,et al.
TREC-3 Ad-Hoc, Routing Retrieval and Thresholding Experiments using PIRCS
,
1994,
TREC.
[3]
Mounia Lalmas,et al.
Merging techniques for performing data fusion on the web
,
2001,
CIKM '01.
[4]
Alberto Del Bimbo,et al.
Using indexing structures for resource descriptors extraction from distributed image repositories
,
2002,
Proceedings. IEEE International Conference on Multimedia and Expo.
[5]
Ellen M. Voorhees,et al.
Learning collection fusion strategies
,
1995,
SIGIR '95.
[6]
Luo Si,et al.
Using sampled data and regression to merge search engine results
,
2002,
SIGIR '02.