论文信息 - Un regard statistique sur l'évaluation de performance : L'exemple de CLEF 2005

Un regard statistique sur l'évaluation de performance : L'exemple de CLEF 2005

RESUME . Cette communication evalue et compare l'efficacite du depistage de l'information de onze modeles a l'aide de quatre collections de documents rediges dans les langues francaise, portugaise- bresilienne, hongroise et bulgare. Pour les deux dernieres langues, on compare egalement l'indexation basee sur des mots a celle reposant sur des quadrigrammes (4-grams). En recourant a quatre tests statistiques et deux regles ad hoc, nous analysons les performances obtenues pour savoir si les differences de performance observees sont significatives. Enfin, nous comparons les resultats de ces differentes regles de decision afin de verifier leur degre de concordance.

Jacques Savoy | J. Savoy

[1] C. J. van Rijsbergen,et al. Probabilistic models of information retrieval based on measuring the divergence from randomness , 2002, TOIS.

[2] M. F. Fuller,et al. Practical Nonparametric Statistics; Nonparametric Statistical Inference , 1973 .

[3] Stephen E. Robertson,et al. Experimentation as a way of life: Okapi at TREC , 2000, Inf. Process. Manag..

[4] Jacques Savoy,et al. Monolingual, Bilingual, and GIRT Information Retrieval at CLEF-2005 , 2005, CLEF.

[5] Bogdan Sacaleanu,et al. Working Notes for the CLEF 2008 Workshop , 2008 .

[6] David A. Hull. Using statistical testing in the evaluation of retrieval experiments , 1993, SIGIR.

[7] Chris Buckley,et al. New Retrieval Approaches Using SMART: TREC 4 , 1995, TREC.

[8] R. A. Groeneveld,et al. Practical Nonparametric Statistics (2nd ed). , 1981 .

[9] Ellen M. Voorhees,et al. Overview of TREC 2004 , 2004, TREC.

[10] Laurence G. Grimm,et al. Statistical Applications for the Behavioral Sciences , 1993 .

[11] Jacques Savoy,et al. Statistical inference in retrieval effectiveness evaluation , 1997, Inf. Process. Manag..

[12] Donna K. Harman,et al. Overview of the Sixth Text REtrieval Conference (TREC-6) , 1997, Inf. Process. Manag..

[13] Ellen M. Voorhees,et al. The effect of topic set size on retrieval experiment error , 2002, SIGIR '02.

[14] Mark Sanderson,et al. Information retrieval system evaluation: effort, sensitivity, and reliability , 2005, SIGIR '05.