Geo-referenced Tourist Attraction Photo Tagging by Mining Community Photo Collections

The advent of photo sharing sites like Flickr has drastically increased the volume of community photo collections on the web. Also the rising popularity of the mobile devices with GPS cameras like iPhone has made most of the photos geo-tagged. These provide new opportunities for automatically tagging the geo-referenced photos such as the tourist attraction photos. In this paper, we propose a framework for automatically tagging geo-referenced tourist attraction photos through mining the community photo collections. The photos collected from social sites are first clustered by fusing several modalities such as GPS and visual features, and then the tags of each cluster are extracted via a simple TF-IDF weighted voting scheme. Finally, for a tourist attraction photo taken with a GPS camera, it is annotated by the tags of the best matched cluster. We download a lot of photos located around some places of interest in Beijing from Flickr and manually construct a geo-referenced photo dataset. Experiment on the dataset shows an overall good performance.

[1]  Noel E. O'Connor,et al.  Automated Annotation of Landmark Images Using Community Contributed Datasets and Web Resources , 2010, SAMT.

[2]  Luc Van Gool,et al.  World-scale mining of objects and events from community photo collections , 2008, CIVR '08.

[3]  Larry D. Hostetler,et al.  The estimation of the gradient of a density function, with applications in pattern recognition , 1975, IEEE Trans. Inf. Theory.

[4]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[5]  Zi Huang,et al.  Mining multi-tag association for image tagging , 2011, World Wide Web.

[6]  B. S. Manjunath,et al.  Global annotation on georeferenced photographs , 2009, CIVR '09.

[7]  Touradj Ebrahimi,et al.  Object-based tag propagation for semi-automatic annotation of images , 2010, MIR '10.

[8]  Changsheng Xu,et al.  Paint the City Colorfully: Location Visualization from Multiple Themes , 2013, MMM.

[9]  Marcel Worring,et al.  Learning Social Tag Relevance by Neighbor Voting , 2009, IEEE Transactions on Multimedia.

[10]  Alexei A. Efros,et al.  IM2GPS: estimating geographic information from a single image , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Wei-Ying Ma,et al.  AnnoSearch: Image Auto-Annotation by Search , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[12]  Dong Liu,et al.  Image retagging , 2010, ACM Multimedia.

[13]  Steffen Staab,et al.  Semantic Multimedia , 2008, Reasoning Web.

[14]  Luc Van Gool,et al.  I know what you did last summer: object-level auto-annotation of holiday snaps , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[15]  Tat-Seng Chua,et al.  Research and applications on georeferenced multimedia: a survey , 2010, Multimedia Tools and Applications.

[16]  Xiaowei Xu,et al.  SCAN: a structural clustering algorithm for networks , 2007, KDD '07.

[17]  Vladimir Pavlovic,et al.  Baselines for Image Annotation , 2010, International Journal of Computer Vision.

[18]  Jiebo Luo,et al.  Inferring generic activities and events from image content and bags of geo-tags , 2008, CIVR '08.