Experiments with Geo-Temporal Expressions Filtering and Query Expansion at Document and Phrase Context Resolution

We describe an evaluation experiment on GeoTemporal Document Retrieval created for the GeoTime evaluation task of NTCIR 2010. GeoTemporal Retrieval aims at to improve retrieval results using Geographic and Temporal dimensions of relevance. To accomplish that task, systems need to extract geographic and temporal information from the documents, and then explore semantic relations among those dimensions within the documents. Since this is the first time the task is taking place our aim is to evaluate some basic techniques in order to set some research directions of our work. We aim to understand the relevance of temporal and geographic expressions for filtering purposes. The geographic expressions were extracted with Yahoo PlaceMaker and for temporal expressions we used the TIMEXTAG system. We experimented techniques using both the overall document and sentence resolutions, as also one mixed approach. We also used a query expansion mechanism in topics with no filters defined. We used the BM25 as retrieval model and preprocessed the topics with a semi-automatic methodology to create structures that let us create our filters and expansions. We learned that the sentence level is not a very good approach (but we got clues that probably the paragraph context resolution could improve the results) and the geographic and temporal expressions base filters had shown good performance.

[1]  Fredric C. Gey,et al.  An Evaluation Resource for Geographic Information Retrieval , 2008, LREC.

[2]  Rafael Muñoz,et al.  Event ordering using TERSEO system , 2004, Data Knowl. Eng..

[3]  Fredric C. Gey,et al.  GeoCLEF 2008: the CLEF 2008 Cross-Language Geographic Information Retrieval Track Overview , 2008, CLEF.

[4]  Fredric C. Gey,et al.  NTCIR-GeoTime Overview: Evaluating Geographic and Temporal Search , 2010, NTCIR.

[5]  Elisabeth Métais,et al.  Application of natural language to information systems (NLDB04) , 2006, Data Knowl. Eng..

[6]  Fredric C. Gey,et al.  GeoCLEF 2008: the CLEF 2008 Cross-Language Geographic Information Retrieval Track Overview , 2008, Conference and Labs of the Evaluation Forum.

[7]  Nuno Cardoso,et al.  The University of Lisbon at GeoCLEF 2007 , 2007, CLEF.

[8]  José Luis Borbinha,et al.  Experiments with N-Gram Prefixes on a Multinomial Language Model versus Lucene's Off-the-shelf Ranking Scheme and Rocchio Query Expansion (TEL@CLEF Monolingual Task) , 2009, CLEF.

[9]  M. de Rijke,et al.  A Cascaded Machine Learning Approach to Interpreting Temporal Expressions , 2007, NAACL.

[10]  James F. Allen Towards a General Theory of Action and Time , 1984, Artif. Intell..

[11]  Michael Gertz,et al.  On the value of temporal information in information retrieval , 2007, SIGF.

[12]  Ray R. Larson Cheshire at GeoCLEF 2008: Text and Fusion Approaches for GIR , 2008, CLEF.

[13]  Daniel Ferrés,et al.  TALP at GeoCLEF 2007: Results of a Geographical Knowledge Filtering Approach with Terrier , 2007, CLEF.