Hoodsquare: Modeling and Recommending Neighborhoods in Location-Based Social Networks

Information garnered from activity on location-based social networks can be harnessed to characterize urban spaces and organize them into neighborhoods. We represent geographic points in the city using spatio-temporal information about Foursquare user check-ins and semantic information about places, with the goal of developing features to input into a novel neighborhood detection algorithm. The algorithm first employs a similarity metric that assesses the homogeneity of a geographic area, and then with a simple mechanism of geographic navigation, it detects the boundaries of a city's neighborhoods. The models and algorithms devised are subsequently integrated into a publicly available, map-based tool named Hood square that allows users to explore activities and neighborhoods in cities around the world. Finally, we evaluate Hood square in the context of are commendation application where user profiles are matched to urban neighborhoods. By comparing with a number of baselines, we demonstrate how Hood square can be used to accurately predict the home neighborhood of Twitter users. We also show that we are able to suggest neighborhoods geographically constrained in size, a desirable property in mobile recommendation scenarios for which geographical precision is key.

[1]  Cecilia Mascolo,et al.  An Empirical Study of Geographic User Activity Patterns in Foursquare , 2011, ICWSM.

[2]  Tim Butler,et al.  Social Capital, Gentrification and Neighbourhood Change in London: A Comparison of Three South London Neighbourhoods , 2001 .

[3]  Michael Mehaffy,et al.  Urban nuclei and the geometry of streets: The ‘emergent neighborhoods’ model , 2009 .

[4]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[5]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[6]  Eric Sun,et al.  Location3: How Users Share and Respond to Location-Based Data on Social , 2011, ICWSM.

[7]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[8]  H. Hotelling Stability in Competition , 1929 .

[9]  Norman M. Sadeh,et al.  The Livehoods Project: Utilizing Social Media to Understand the Dynamics of a City , 2012, ICWSM.

[10]  Anne R. Pebley,et al.  Neighborhood Definitions and the Spatial Dimension of Daily Life in Los Angeles , 2002 .

[11]  Robert J. Chaskin,et al.  Perspectives on Neighborhood and Community: A Review of the Literature , 1997, Social Service Review.

[12]  Justin Cranshaw Seeing a home away from the home : Distilling proto-neighborhoods from incidental data with Latent Topic Modeling , 2010 .

[13]  A. D. Diez Roux,et al.  Investigating neighborhood and area effects on health. , 2001, American journal of public health.

[14]  S. Gosling,et al.  A Theory of the Emergence, Persistence, and Expression of Geographic Variation in Psychological Characteristics , 2008, Perspectives on psychological science : a journal of the Association for Psychological Science.

[15]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[16]  Sandro Galea,et al.  Defining neighborhood boundaries for urban health research. , 2007, American journal of preventive medicine.

[17]  Charles M. Tiebout A Pure Theory of Local Expenditures , 1956, Journal of Political Economy.

[18]  Virgílio A. F. Almeida,et al.  Psychological maps 2.0: a web engagement enterprise starting in London , 2013, WWW.

[19]  Zhiyong Lu,et al.  Automatic Extraction of Clusters from Hierarchical Clustering Representations , 2003, PAKDD.