论文信息 - Discovering Geographic Regions in the City Using Social Multimedia and Open Data

Discovering Geographic Regions in the City Using Social Multimedia and Open Data

In this paper we investigate the potential of social multimedia and open data for automatically identifying regions within the city. We conjecture that the regions may be characterized by specific patterns related to their visual appearance, the manner in which the social media users describe them, and the human mobility patterns. Therefore, we collect a dataset of Foursquare venues, their associated images and users, which we further enrich with a collection of city-specific Flickr images, annotations and users. Additionally, we collect a large number of neighbourhood statistics related to e.g., demographics, housing and services. We then represent visual content of the images using a large set of semantic concepts output by a convolutional neural network and extract latent Dirichlet topics from their annotations. User, text and visual information as well as the neighbourhood statistics are further aggregated at the level of postal code regions, which we use as the basis for detecting larger regions in the city. To identify those regions, we perform clustering based on individual modalities as well as their ensemble. The experimental analysis shows that the automatically detected regions are meaningful and have a potential for better understanding dynamics and complexity of a city.

Marcel Worring | Stevan Rudinac | Jan Zahálka

[1] Norman M. Sadeh,et al. The Livehoods Project: Utilizing Social Media to Understand the Dynamics of a City , 2012, ICWSM.

[2] Pierre Geurts,et al. Extremely randomized trees , 2006, Machine Learning.

[3] Petr Sojka,et al. Software Framework for Topic Modelling with Large Corpora , 2010 .

[4] Francis R. Bach,et al. Online Learning for Latent Dirichlet Allocation , 2010, NIPS.

[5] Michael I. Jordan,et al. On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[6] Jiebo Luo,et al. Geotagging in multimedia and computer vision—a survey , 2010, Multimedia Tools and Applications.

[7] Joydeep Ghosh,et al. Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[8] Mor Naaman,et al. How flickr helps us make sense of the world: context and content in community-contributed media collections , 2007, ACM Multimedia.

[9] Marcel Worring,et al. Interactive Multimodal Learning for Venue Recommendation , 2015, IEEE Transactions on Multimedia.

[10] Bart Thomee,et al. Uncovering locally characterizing regions within geotagged data , 2013, WWW.

[11] Dietmar Bauer,et al. Inferring land use from mobile phone activity , 2012, UrbComp '12.