LASOM: Location Aware Self-Organizing Map for discovering similar and unique visual features of geographical locations

Can a machine tell us if an image was taken in Beijing or New York? Automated identification of the geographical coordinates based on image content is of particular importance to data mining systems, because geolocation provides a large source of context for other useful features of an image. However, successful localization of unannotated images requires a large collection of images that cover all possible locations. Brute-force searches over the entire databases are costly in terms of computation and storage requirements, and achieve limited results. Knowing what visual features make a particular location unique or similar to other locations can be used for choosing a better match between spatially distance locations. However, doing this at global scales is a challenging problem. In this paper we propose an on-line, unsupervised, clustering algorithm called Location Aware Self-Organizing Map (LASOM), for learning the similarity graph between different regions. The goal of LASOM is to select key features in specific locations so as to increase the accuracy in geotagging untagged images, while also reducing computational and storage requirements. Different from other Self-Organizing Map algorithms, LASOM provides the means to learn a conditional distribution of visual features, conditioned on geospatial coordinates. We demonstrate that the generated map not only preserves important visual information, but provides additional context in the form of visual similarity relationships between different geographical areas. We show how this information can be used to improve geotagging results when using large databases.

[1]  Jon M. Kleinberg,et al.  Mapping the world's photos , 2009, WWW '09.

[2]  Dana H. Ballard,et al.  Novelty detection using growing neural gas for visuo-spatial memory , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Michael Gertz,et al.  Latent geographic feature extraction from social media , 2012, SIGSPATIAL/GIS.

[4]  Alexei A. Efros,et al.  Image sequence geolocation with human travel priors , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[5]  Yongdong Zhang,et al.  Web Video Geolocation by Geotagged Social Resources , 2012, IEEE Transactions on Multimedia.

[6]  Jiebo Luo,et al.  Geo-location inference on news articles via multimodal pLSA , 2012, ACM Multimedia.

[7]  Teuvo Kohonen,et al.  Self-Organizing Maps, Third Edition , 2001, Springer Series in Information Sciences.

[8]  Wen-Huang Cheng,et al.  Augmenting mobile city-view image retrieval with context-rich user-contributed photos , 2011, ACM Multimedia.

[9]  Jiebo Luo,et al.  Aworldwide tourism recommendation system based on geotaggedweb photos , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Jurandy Almeida,et al.  A visual approach for video geocoding using bag-of-scenes , 2012, ICMR.

[11]  Jiebo Luo,et al.  Geotagging in multimedia and computer vision—a survey , 2010, Multimedia Tools and Applications.

[12]  Pavel Serdyukov,et al.  Placing flickr photos on a map , 2009, SIGIR.

[13]  Jiebo Luo,et al.  A WORLDWIDE TOURISM RECOMMENDATION SYSTEM BASED ON GEOTAGGED WEB PHOTOS , 2010 .

[14]  M. V. Velzen,et al.  Self-organizing maps , 2007 .

[15]  Alexei A. Efros,et al.  IM2GPS: estimating geographic information from a single image , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Shen Furao,et al.  Self-Organizing Incremental Neural Network and Its Application , 2010, ICANN.

[17]  Noah Snavely,et al.  Graph-Based Discriminative Learning for Location Recognition , 2013, International Journal of Computer Vision.

[18]  Jiebo Luo,et al.  Exploring user image tags for geo-location inference , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[19]  U. Castellani,et al.  Geo-located image categorization and location recognition , 2009, Pattern Recognition and Image Analysis.

[20]  Bernd Fritzke,et al.  A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[21]  Shen Furao,et al.  An enhanced self-organizing incremental neural network for online unsupervised learning , 2007, Neural Networks.

[22]  Antonio Torralba,et al.  Building the gist of a scene: the role of global image features in recognition. , 2006, Progress in brain research.