Identifying related landmark tags in urban scenes using spatial and semantic clustering

Abstract There is considerable interest in developing landmark saliency models as a basis for describing urban landscapes, and in constructing wayfinding instructions, for text and spoken dialogue based systems. The challenge lies in knowing the truthfulness of such models; is what the model considers salient the same as what is perceived by the user? The method developed in this research identifies related annotated tags supplied from a web based experiment in which users were asked to tag the most salient features on urban images for the purposes of navigation and exploration. The tag collections may be used to rank landmark popularity in each scene, but the challenge is in determining which tags relate to the same object (e.g. tags relating to a particular cafe). Existing clustering techniques did not perform well for this task, and it was therefore necessary to develop a new spatial-semantic clustering method which considered the proximity of nearby tags and the similarity of their label content. The annotation similarity was initially calculated using trigrams in conjunction with a synonym list, generating a set of networks formed from the links between related tags. These networks were used to build related word lists encapsulating conceptual connections (e.g. church tower related to clock) so that during a secondary pass of the data, related network segments could be merged. This approach gives interesting insight into the partonomic relationships between the constituent parts of landmarks and the range and frequency of terms used to describe them.

[1]  Michael Isard,et al.  Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[2]  Ervin H. Zube,et al.  Landscape perception: Research, application and theory , 1982 .

[3]  William A. Mackaness,et al.  Development of a Speech-Based Augmented Reality System to Support Exploration of Cityscape , 2006, Trans. GIS.

[4]  Stephan Winter,et al.  Computation of the Salience of Features , 2003 .

[5]  Yiannis Kompatsiaris,et al.  Cluster-Based Landmark and Event Detection for Tagged Photo Collections , 2011, IEEE MultiMedia.

[6]  Sung-Bae Cho,et al.  Exploiting indoor location and mobile information for context-awareness service , 2012, Inf. Process. Manag..

[7]  HuaXian-Sheng,et al.  Content-based tag processing for Internet social images , 2011 .

[8]  Martin Tomko,et al.  A dialog-driven process of generating route directions , 2008, Comput. Environ. Urban Syst..

[9]  B. Jiang The Image of the City out of the Underlying Scaling of City Artifacts or Locations , 2012, 1209.1112.

[10]  Eitan Marder-Eppstein,et al.  Project Tango , 2016, SIGGRAPH Real-Time Live!.

[11]  Karin Schweizer,et al.  Spatial Cognition: The Role of Landmark, Route, and Survey Knowledge in Human and Robot Navigation , 1997, GI Jahrestagung.

[12]  Barbara Tversky,et al.  Cognitive Maps, Cognitive Collages, and Spatial Mental Models , 1993, COSIT.

[13]  E. Shafer,et al.  How to measure preferences for photographs of natural landscapes , 1977 .

[14]  Stephan Winter,et al.  Selection of Salient Features for Route Directions , 2004, Spatial Cogn. Comput..

[15]  John Seely Brown,et al.  The Origins of Ubiquitous Computing Research at PARC in the Late 1980s , 1999, IBM Syst. J..

[16]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[17]  M. Raubal,et al.  Focalizing measures of salience for wayfinding , 2005 .

[18]  Stephan Winter,et al.  Enriching Wayfinding Instructions with Local Landmarks , 2002, GIScience.

[19]  Birgit Elias,et al.  Extracting Landmarks with Data Mining Methods , 2003, COSIT.

[20]  Tom Gaertner,et al.  Behavior And Environment Psychological And Geographical Approaches , 2016 .

[21]  A. Siegel,et al.  The development of spatial representations of large-scale environments. , 1975, Advances in child development and behavior.

[22]  Sue Long,et al.  Cyberguide: prototyping context-aware mobile applications , 1996, CHI 1996.

[23]  Jude W. Shavlik,et al.  Machine Learning: Proceedings of the Fifteenth International Conference , 1998 .

[24]  Dong Liu,et al.  Content-based tag processing for Internet social images , 2010, Multimedia Tools and Applications.

[25]  Stéphane Herbin,et al.  Semantic hierarchies for image annotation: A survey , 2012, Pattern Recognit..

[26]  Oliver Lemon,et al.  Talk the Walk and Walk the talk: Design, Implementation and Evaluation of a Spoken Dialogue System for Route Following and City Learning , 2014 .

[27]  Daniel R. Montello,et al.  Elements of Good Route Directions in Familiar and Unfamiliar Environments , 1999, COSIT.

[28]  Guanling Chen,et al.  A Survey of Context-Aware Mobile Computing Research , 2000 .

[29]  Alan T. Murray,et al.  Spatial Clustering Overview and Comparison: Accuracy, Sensitivity, and Computational Expense , 2014 .

[30]  Stephan Winter,et al.  Including landmarks in routing instructions , 2010, J. Locat. Based Serv..

[31]  B. Silverman Density estimation for statistics and data analysis , 1986 .

[32]  Martin Tomko,et al.  Landmark Hierarchies in Context , 2008 .

[33]  Claus Brenner,et al.  Automatic Generation and Application of Landmarks in Navigation Data Sets , 2004, SDH.

[34]  T. Daniel,et al.  Methodological Issues in the Assessment of Landscape Quality , 1983 .

[35]  J. Jonides,et al.  Evidence of hierarchies in cognitive maps , 1985, Memory & cognition.

[36]  Hiroshi Ishii,et al.  Iterative design of seamless collaboration media , 1994, CACM.

[37]  W. Bruce Croft,et al.  Query expansion using local and global document analysis , 1996, SIGIR '96.

[38]  Siobhan Chapman Logic and Conversation , 2005 .

[39]  W. Bruce Croft,et al.  Quary Expansion Using Local and Global Document Analysis , 1996, SIGIR Forum.

[40]  D. L. Linton,et al.  The assessment of scenery as a natural resource , 1968 .

[41]  Qi Tian,et al.  Intelligent photo clustering with user interaction and distance metric learning , 2012, Pattern Recognit. Lett..

[42]  Sabine Timpf,et al.  On the assessment of landmark salience for human navigation , 2007, Cognitive Processing.

[43]  Mor Naaman,et al.  World explorer: visualizing aggregate data from unstructured text in geo-referenced collections , 2007, JCDL '07.

[44]  Bernard W. Silverman,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[45]  Stefan Winkler,et al.  PhotoCluster a multi-clustering technique for near-duplicate detection in personal photo collections , 2015, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[46]  Elena M. Zamora,et al.  The use of trigram analysis for spelling error detection , 1981, Inf. Process. Manag..

[47]  P. Heidorn,et al.  Chapter 7 The Structure of Cognitive Maps: Representations and Processes , 1993 .