Recherche d'informations dans un environnement distribué

The Web and digital libraries offer the possibility to send natural language queries to various information servers (corpora or search engines) raising the difficult problem of selecting the best document sources and merging the results provided by different servers. In this paper, a new approach for collections selection based on decision trees is described. Moreover, different merging and selection procedures have been evaluated leading to an overview of the suggested approaches.

[1]  Kui-Lam Kwok,et al.  TREC-3 Ad-Hoc, Routing Retrieval and Thresholding Experiments using PIRCS , 1994, TREC.

[2]  Jacques Savoy,et al.  Database merging strategy based on logistic regression , 2000, Inf. Process. Manag..

[3]  Susan T. Dumais,et al.  Latent Semantic Indexing (LSI) and TREC-2 , 1993, TREC.

[4]  Michael D. Gordon,et al.  Finding Information on the World Wide Web: The Retrieval Effectiveness of Search Engines , 1999, Inf. Process. Manag..

[5]  James C. French,et al.  The impact of database selection on distributed searching , 2000, SIGIR '00.

[6]  Chris Buckley,et al.  New Retrieval Approaches Using SMART: TREC 4 , 1995, TREC.

[7]  Oren Etzioni,et al.  Towards comprehensive web search , 1999 .

[8]  Luis Gravano,et al.  STARTS: Stanford Protocol Proposal for Internet Retrieval and Search , 1997 .

[9]  Ellen M. Voorhees,et al.  Learning collection fusion strategies , 1995, SIGIR '95.

[10]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[11]  Martin Dillon,et al.  Application of Loglinear Models to Informetric Phenomena , 1992, Inf. Process. Manag..

[12]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[13]  James P. Callan,et al.  Effective retrieval with distributed collections , 1998, SIGIR '98.

[14]  David Hawking,et al.  Methods for information server selection , 1999, TOIS.

[15]  C. Lee Giles,et al.  Accessibility of information on the web , 1999, Nature.

[16]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[17]  Norbert Fuhr,et al.  A decision-theoretic approach to database selection in networked IR , 1999, TOIS.

[18]  W. Bruce Croft,et al.  Searching distributed collections with inference networks , 1995, SIGIR '95.

[19]  Stephen E. Robertson,et al.  Large Test Collection Experiments on an Operational, Interactive System: Okapi at TREC , 1995, Inf. Process. Manag..