Geographic expansion of queries to improve the geographic information retrieval task

Geographic Information Retrieval (GIR) is concerned with improving the quality of geographically-specific Information Retrieval (IR), focusing on access to unstructured documents. Since GIR can be considered as an extension of IR, the application of Natural Language Processing (NLP) techniques, such as query expansion, can lead to significant improvements. In this paper we propose two NLP techniques of query expansion related to the augmentation of the geospatial part that is usually identified in a geographic query. The aim of both approaches is to retrieve possible relevant documents that are not retrieved using the original query. Then, we propose to add such new documents to the list of documents retrieved using the original query. In this way, the geo-reranking process takes into account more possible relevant documents. We have evaluated the proposed approaches using GeoCLEF as evaluation framework for GIR systems. The results obtained show that the use of proposed query expansion techniques can be a good strategy to improve the overall performance of a GIR system.

[1]  Paolo Rosso,et al.  Using the WordNet Ontology in the GeoCLEF Geographical Information Retrieval Task , 2005, CLEF.

[2]  Mário J. Silva,et al.  Query expansion through geographical feature types , 2007, GIR '07.

[3]  Alia I. Abdelmoty,et al.  Ontology-Based Spatial Query Expansion in Information Retrieval , 2005, OTM Conferences.

[4]  M. Sanderson,et al.  Analyzing geographic queries , 2004 .

[5]  Zahir Tari,et al.  On the Move to Meaningful Internet Systems 2002: CoopIS, DOA, and ODBASE , 2002, Lecture Notes in Computer Science.

[6]  Amanda Spink,et al.  Use of query reformulation and relevance feedback by Excite users , 2000, Internet Res..

[7]  Vijayan Sugumaran,et al.  Natural Language and Information Systems, 13th International Conference on Applications of Natural Language to Information Systems, NLDB 2008, London, UK, June 24-27, 2008, Proceedings , 2008, NLDB.

[8]  Yi Li,et al.  Exploring Probabilistic Toponym Resolution for Geographical Information Retrieval , 2006, GIR.

[9]  Fredric C. Gey,et al.  GeoCLEF 2008: the CLEF 2008 Cross-Language Geographic Information Retrieval Track Overview , 2008, CLEF.

[10]  Yi Li,et al.  An empirical study of the effects of NLP components on Geographic IR performance , 2008, Int. J. Geogr. Inf. Sci..

[11]  Miguel A. García-Cumbreras,et al.  Using query reformulation and keywords in the geographic information retrieval task , 2008 .

[12]  Amanda Spink,et al.  Patterns of query reformulation during Web searching , 2009 .

[13]  Xing Xie,et al.  Indexing implicit locations for geographical information retrieval , 2006, GIR.

[14]  Ray R. Larson,et al.  Geographic information retrieval and spatial browsing , 1996 .

[15]  Wei Vivian Zhang,et al.  Geographic intention and modification in web search , 2008, Int. J. Geogr. Inf. Sci..

[16]  Torsten Suel,et al.  Analysis of geographic queries in a search engine log , 2008, LocWeb.

[17]  Peter G. Anick Using terminological feedback for web search refinement: a log-based study , 2003, SIGIR.

[18]  Christopher B. Jones,et al.  Geographical information retrieval , 2008, Int. J. Geogr. Inf. Sci..

[19]  Luis Alfonso Ureña López,et al.  Geo-NER: un reconocedor de entidades geográficas para inglés basado en GeoNames y Wikipedia , 2009, Proces. del Leng. Natural.

[20]  Mário J. Silva,et al.  Relevance Ranking for Geographic IR , 2006, GIR.

[21]  Carol Peters,et al.  Evaluating Systems for Multilingual and Multimodal Information Access, 9th Workshop of the Cross-Language Evaluation Forum, CLEF 2008, Aarhus, Denmark, September 17-19, 2008, Revised Selected Papers , 2009, CLEF.

[22]  José M. Perea-Ortega,et al.  Comparing Several Textual Information Retrieval Systems for the Geographical Information Retrieval Task , 2008, NLDB 2008.

[23]  Luis Gravano,et al.  Categorizing web queries according to geographical locality , 2003, CIKM '03.

[24]  Fredric C. Gey,et al.  GeoCLEF: the CLEF 2005 Cross-Language Geographic Information Retrieval Track , 2005, CLEF.

[25]  Mor Naaman,et al.  Proceedings of the first international workshop on Location and the web , 2008, The Web Conference.

[26]  Fredric C. Gey,et al.  Accessing Multilingual Information Repositories, 6th Workshop of the Cross-Language Evalution Forum, CLEF 2005, Vienna, Austria, 21-23 September, 2005, Revised Selected Papers , 2006, CLEF.