Recognizing City Identity via Attribute Analysis of Geo-tagged Images

After hundreds of years of human settlement, each city has formed a distinct identity, distinguishing itself from other cities. In this work, we propose to characterize the identity of a city via an attribute analysis of 2 million geo-tagged images from 21 cities over 3 continents. First, we estimate the scene attributes of these images and use this representation to build a higher-level set of 7 city attributes, tailored to the form and function of cities. Then, we conduct the city identity recognition experiments on the geo-tagged images and identify images with salient city identity on each city attribute. Based on the misclassification rate of the city identity recognition, we analyze the visual similarity among different cities. Finally, we discuss the potential application of computer vision to urban planning.

[1]  Justin Cranshaw,et al.  Exploring venue-based city-to-city similarity measures , 2013, UrbComp '13.

[2]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[3]  César A. Hidalgo,et al.  The Collaborative Image of The City: Mapping the Inequality of Urban Perception , 2013, PloS one.

[4]  Zhe Jiang,et al.  Spatial Statistics , 2013 .

[5]  Alexei A. Efros,et al.  IM2GPS: estimating geographic information from a single image , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Jon M. Kleinberg,et al.  Mapping the world's photos , 2009, WWW '09.

[7]  Xin Chen,et al.  City-scale landmark identification on mobile devices , 2011, CVPR 2011.

[8]  Slava Kisilevich,et al.  Event-Based Analysis of People's Activities and Behavior Using Flickr and Panoramio Geotagged Photo Collections , 2010, 2010 14th International Conference Information Visualisation.

[9]  Thomas Deselaers,et al.  ClassCut for Unsupervised Class Segmentation , 2010, ECCV.

[10]  Jan-Michael Frahm,et al.  Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs , 2008, International Journal of Computer Vision.

[11]  James Hays,et al.  SUN attribute database: Discovering, annotating, and recognizing scene attributes , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Yang Song,et al.  Tour the world: Building a web-scale landmark recognition engine , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Jack L. Nasar,et al.  The evaluative image of the city , 1997 .

[15]  Alexei A. Efros,et al.  What makes Paris look like Paris? , 2015, Commun. ACM.

[16]  Yong Jae Lee,et al.  Style-Aware Mid-level Representation for Discovering Visual Connections in Space and Time , 2013, 2013 IEEE International Conference on Computer Vision.

[17]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[18]  H. Proshansky,et al.  Place-identity: Physical world socialization of the self , 1983 .

[19]  Marco Brambilla,et al.  A revenue sharing mechanism for federated search and advertising , 2012, WWW.

[20]  Daniel P. Huttenlocher,et al.  Landmark classification in large-scale image collections , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[21]  Pietro Perona,et al.  Visual Recognition with Humans in the Loop , 2010, ECCV.

[22]  D. Sivakumar,et al.  A Tale of Two (Similar) Cities - Inferring City Similarity through Geo-spatial Query Log Analysis , 2011, KDIR.

[23]  Serge J. Belongie,et al.  Cross-View Image Geolocalization , 2013, CVPR.

[24]  Kevin Lynch,et al.  The Image of the City , 1960 .

[25]  David J. Crandall,et al.  Mining photo-sharing websites to study ecological phenomena , 2012, WWW.

[26]  Kristen Grauman,et al.  Relative attributes , 2011, 2011 International Conference on Computer Vision.

[27]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[28]  Byoungkwon An,et al.  Looking Beyond the Visible Scene , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Andrew J. Davison,et al.  Active Matching , 2008, ECCV.

[30]  N. Stanietsky,et al.  The interaction of TIGIT with PVR and PVRL2 inhibits human NK cell cytotoxicity , 2009, Proceedings of the National Academy of Sciences.

[31]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.