论文信息 - Georeferencing Wikipedia pages using language models from Flickr

Georeferencing Wikipedia pages using language models from Flickr

The task of assigning geographic coordinates to web resources has recently gained in popularity. In particular, several recent initiatives have focused on the use of language models for georeferencing Flickr photos, with promising results. Such techniques, however, require the availability of large numbers of spatially grounded training data. They are therefore not directly applicable for georeferencing other types of resources, such as Wikipedia pages. As an alternative, in this paper we explore the idea of using language models that are trained on Flickr photos for finding the coordinates of Wikipedia pages. Our experimental results show that the resulting method is able to outperform popular methods that are based on gazetteer look-up.

Steven Schockaert | Bart Dhoedt | Olivier Van Laere | Chris De Rouck

[1] Mor Naaman,et al. Methods for extracting place semantics from Flickr tags , 2009, TWEB.

[2] Jochen L. Leidner. Toponym resolution in text: annotation, evaluation and applications of spatial grounding , 2007, SIGF.

[3] Pavel Serdyukov,et al. Placing flickr photos on a map , 2009, SIGIR.

[4] Paolo Rosso,et al. A comparison of methods for the automatic identification of locations in wikipedia , 2007, GIR '07.

[5] Daniel S. Weld,et al. Autonomously semantifying wikipedia , 2007, CIKM '07.

[6] M. Goodchild. Citizens as sensors: the world of volunteered geography , 2007 .

[7] Steven Schockaert,et al. Finding locations of flickr resources using language models and similarity search , 2011, ICMR.

[8] Mohammad Soleymani,et al. Automatic tagging and geotagging in video collections and communities , 2011, ICMR.

[9] Adrian Popescu,et al. Spatiotemporal mapping of Wikipedia concepts , 2010, JCDL '10.

[10] Gerhard Weikum,et al. YAGO2: exploring and querying world knowledge in time, space, context, and many languages , 2011, WWW.