Exploring Probabilistic Toponym Resolution for Geographical Information Retrieval

A key problem that arises when unstructured text is being queried is that of properly recognizing and exploiting geographical terms and entities. Here we describe a mechanism for probabilistic toponym resolution, and our experiments with the new method in the setting of the 2005 GeoCLEF queries and judgments. The new method gives improved retrieval effectiveness on a subset of the topics.

[1]  Fernando Llopis,et al.  The University of Alicante at GeoCLEF 2005 , 2005, CLEF.

[2]  Sven Hartrumpf,et al.  University of Hagen at GeoCLEF 2005: Using Semantic Networks for Interpreting Geographical Queries , 2005, CLEF.

[3]  Erik Rauch,et al.  A confidence-based framework for disambiguating geographic terms , 2003, HLT-NAACL 2003.

[4]  Fredric C. Gey,et al.  Berkeley2 at GeoCLEF: Cross-Language Geographic Information Retrieval of German and English Documents , 2005, CLEF.

[5]  Stephen E. Robertson,et al.  Okapi at TREC-6 Automatic ad hoc, VLC, routing, filtering and QSDR , 1997, TREC.

[6]  Daniel Ferrés,et al.  The GeoTALP-IR System at GeoCLEF-2005: Experiments Using a QA-based IR System, Linguistic Analysis, and a Geographical Thesaurus , 2005, CLEF.

[7]  Ron Sivan,et al.  Web-a-where: geotagging web content , 2004, SIGIR '04.

[8]  Jochen L. Leidner Preliminary Experiments with Geo-Filtering Predicates for Geographic IR , 2005, CLEF.

[9]  Fredric C. Gey,et al.  GeoCLEF: the CLEF 2005 Cross-Language Geographic Information Retrieval Track , 2005, CLEF.

[10]  Mário J. Silva,et al.  The XLDB Group at GeoCLEF 2005 , 2005, CLEF.

[11]  Gideon S. Mann,et al.  Bootstrapping toponym classifiers , 2003, HLT-NAACL 2003.

[12]  Inderjeet Mani,et al.  Disambiguating Toponyms in News , 2005, HLT/EMNLP.

[13]  Hugh E. Williams,et al.  The Zettair Search Engine , 1998 .

[14]  José Carlos González,et al.  MIRACLE's 2005 Approach to Geographical Information Retrieval , 2005, CLEF.