Cheshire at GeoCLEF 2008: Text and Fusion Approaches for GIR

In this paper we will briefly describe the approaches taken by the Berkeley Cheshire group for the main GeoCLEF 2008 tasks (Mono and Bilingual retrieval), and present some analyses of the fusion approach used. This year our submissions used probabilistic text retrieval based on logistic regression and incorporating blind relevance feedback for all of the runs and in addition we ran a number of tests combining this type of search with OKAPI BM25 searches using a fusion approach. We did not, however, use any explicit geographic processing. All translation for bilingual tasks was performed using the LEC Power Translator PC-based MT system.

[1]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[2]  Fredric C. Gey,et al.  Full Text Retrieval based on Probalistic Equations with Coefficients fitted by Logistic Regression , 1993, TREC.

[3]  Carol Peters,et al.  Advances in Multilingual and Multimodal Information Retrieval, 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers , 2008, CLEF.

[4]  Fredric C. Gey,et al.  Probabilistic retrieval based on staged logistic regression , 1992, SIGIR '92.

[5]  Ray R. Larson,et al.  A Fusion Approach to XML Structured Document Retrieval , 2005, Information Retrieval.

[6]  Aitao Chen,et al.  Cross-language Retrieval Experiments at CLEF 2002 , 2002, CLEF.

[7]  Ray R. Larson,et al.  Probabilistic Retrieval, Component Fusion and Blind Feedback for XML Retrieval , 2005, INEX.

[8]  Aitao Chen Multilingual Information Retrieval using English and Chinese Queries , 2001, CLEF.

[9]  Fredric C. Gey,et al.  Accessing Multilingual Information Repositories, 6th Workshop of the Cross-Language Evalution Forum, CLEF 2005, Vienna, Austria, 21-23 September, 2005, Revised Selected Papers , 2006, CLEF.

[10]  Ray R. Larson Cheshire at GeoCLEF 2007: Retesting Text Retrieval Baselines , 2007, CLEF.

[11]  Carol Peters,et al.  Evaluation of Cross-Language Information Retrieval Systems , 2002, Lecture Notes in Computer Science.

[12]  Fredric C. Gey,et al.  Multilingual Information Retrieval Using Machine Translation, Relevance Feedback and Decompounding , 2004, Information Retrieval.

[13]  Ray R. Larson,et al.  Geographic information retrieval and spatial browsing , 1996 .

[14]  Stephen E. Robertson,et al.  On relevance weights with little relevance information , 1997, SIGIR '97.

[15]  Fredric C. Gey,et al.  Berkeley at GeoCLEF: Logistic Regression and Fusion for Geographic Information Retrieval , 2005, CLEF.

[16]  Jong-Hak Lee,et al.  Analyses of multiple evidence combination , 1997, SIGIR '97.

[17]  Ray R. Larson Experiments in Classification Clustering and Thesaurus Expansion for Domain Specific Cross-Language Retrieval , 2007, CLEF.