Georeferencing Flickr resources based on textual meta-data

The task of automatically estimating the location of web resources is of central importance in location-based services on the Web. Much attention has been focused on Flickr photos and videos, for which it was found that language modeling approaches are particularly suitable. In particular, state-of-the art systems for georeferencing Flickr photos tend to cluster the locations on Earth in a relatively small set of disjoint regions, apply feature selection to identify location-relevant tags, then use a form of text classification to identify which area is most likely to contain the true location of the resource, and finally attempt to find an appropriate location within the identified area. In this paper, we present a systematic discussion of each of the aforementioned components, based on the lessons we have learned from participating in the 2010 and 2011 editions of MediaEval's Placing Task. Extensive experimental results allow us to analyze why certain methods work well on this task and show that a median error of just over 1km can be achieved on a standard benchmark test set.

[1]  Tat-Seng Chua,et al.  Research and applications on georeferenced multimedia: a survey , 2010, Multimedia Tools and Applications.

[2]  Steven Schockaert,et al.  Georeferencing Flickr photos using language models at different levels of granularity: An evidence based approach , 2012, J. Web Semant..

[3]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Martine De Cock,et al.  Neighborhood restrictions in geographic IR , 2007, SIGIR.

[5]  Mor Naaman,et al.  Towards automatic extraction of event and place semantics from flickr tags , 2007, SIGIR.

[6]  Jens Hartmann,et al.  Placing media items using the Xtrieval Framework , 2011, MediaEval.

[7]  Geert-Jan Houben,et al.  Placing images on the world map: a microblog-based enrichment approach , 2012, SIGIR '12.

[8]  Jaeyoung Choi,et al.  Video2GPS: a demo of multimodal location estimation on flickr videos , 2011, MM '11.

[9]  Steven Schockaert,et al.  Georeferencing Wikipedia pages using language models from Flickr , 2011, ISWC 2011.

[10]  Adam Rae,et al.  Working Notes for the Placing Task at MediaEval 2011 , 2011, MediaEval.

[11]  Steven Schockaert,et al.  Combining Multi-resolution Evidence for Georeferencing Flickr Images , 2010, SUM.

[12]  Mor Naaman,et al.  World explorer: visualizing aggregate data from unstructured text in geo-referenced collections , 2007, JCDL '07.

[13]  Gaussian Mixture Models 1.1 Random Variables , 2022 .

[14]  Alia I. Abdelmoty,et al.  The SPIRIT Spatial Search Engine: Architecture, Ontologies and Spatial Indexing , 2004, GIScience.

[15]  Jurandy Almeida,et al.  RECOD Working Notes for Placing Task MediaEval 2011 , 2011, MediaEval.

[16]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[17]  Gerald Friedland,et al.  The 2010 ICSI Video Location Estimation System , 2010 .

[18]  Steven Schockaert,et al.  Towards automated georeferencing of Flickr photos , 2010, GIR.

[19]  B. S. Manjunath,et al.  Spirittagger: a geo-aware tag suggestion tool mined from flickr , 2008, MIR '08.

[20]  Geert-Jan Houben,et al.  WISTUD at MediaEval 2011: Placing Task , 2011, MediaEval.

[21]  Hanan Samet,et al.  Geotagging: using proximity, sibling, and prominence clues to understand comma groups , 2010, GIR.

[22]  Brendan T. O'Connor,et al.  A Latent Variable Model for Geographic Lexical Variation , 2010, EMNLP.

[23]  Martin Raubal,et al.  GeoSR: Geographically Explore Semantic Relations in World Knowledge , 2008, AGILE Conf..

[24]  Jon M. Kleinberg,et al.  Mapping the world's photos , 2009, WWW '09.

[25]  James Allan,et al.  An Investigation of Dirichlet Prior Smoothing's Performance Advantage , 2005 .

[26]  Steven Schockaert,et al.  Ghent University at the 2011 Placing Task , 2011, MediaEval.

[27]  Zhiguo Gong,et al.  Identifying points of interest by self-tuning clustering , 2011, SIGIR.

[28]  P. Schmitz,et al.  Inducing Ontology from Flickr Tags , 2006 .

[29]  John D. Lafferty,et al.  A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[30]  Steven Schockaert,et al.  Finding locations of flickr resources using language models and similarity search , 2011, ICMR.

[31]  Jon M. Kleinberg,et al.  Spatial variation in search engine queries , 2008, WWW.

[32]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[33]  Kyumin Lee,et al.  You are where you tweet: a content-based approach to geo-locating twitter users , 2010, CIKM.

[34]  Adrian Popescu,et al.  Creating Visual Summaries for Geographic Regions , 2009 .

[35]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[36]  Jurandy Almeida,et al.  A visual approach for video geocoding using bag-of-scenes , 2012, ICMR.

[37]  Pavel Serdyukov,et al.  Placing flickr photos on a map , 2009, SIGIR.

[38]  Horacio Rodríguez Hontoria,et al.  TALP at MediaEval 2011 Placing Task: georeferencing Flickr videos with geographical knowledge and information retrieval , 2011 .

[39]  Mor Naaman,et al.  Generating diverse and representative image search results for landmarks , 2008, WWW.

[40]  Alexei A. Efros,et al.  IM2GPS: estimating geographic information from a single image , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[41]  Jason Baldridge,et al.  Simple supervised document geolocation with geodesic grids , 2011, ACL.

[42]  Mor Naaman,et al.  Methods for extracting place semantics from Flickr tags , 2009, TWEB.