Beyond "local", "categories" and "friends": clustering foursquare users with latent "topics"

In this work, we use foursquare check-ins to cluster users via topic modeling, a technique commonly used to classify text documents according to latent "themes". Here, however, the latent variables which group users can be thought of not as themes but rather as factors which drive check in behaviors, allowing for a qualitative understanding of influences on user check ins. Our model is agnostic of geo-spatial location, time, users' friends on social networking sites and the venue categories-we treat the existence of and intricate interactions between these factors as being latent, allowing them to emerge entirely from the data. We instantiate our model on data from New York and the San Francisco Bay Area and find evidence that the model is able to identify groups of people which are of different types (e.g. tourists), communities (e.g. users tightly clustered in space) and interests (e.g. people who enjoy athletics).

[1]  R. Park,et al.  Introduction to the Science of Sociology , 1921 .

[2]  S. Milgram A Psychological Map of New York City , 1972 .

[3]  P. Blau Inequality and Heterogeneity: A Primitive Theory of Social Structure , 1978 .

[4]  S. Raudenbush,et al.  Neighborhoods and violent crime: a multilevel study of collective efficacy. , 1997, Science.

[5]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  Xing Xie,et al.  Mining user similarity based on location history , 2008, GIS '08.

[8]  Daniel Barbará,et al.  On-line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[9]  T. Jebara,et al.  CitySenseTM : multiscale space time clustering of GPS points and trajectories , 2009 .

[10]  Sara B. Soderstrom,et al.  Dynamics of Dyads in Social Networks: Assortative, Relational, and Proximity Mechanisms , 2010 .

[11]  Justin Cranshaw Seeing a home away from the home : Distilling proto-neighborhoods from incidental data with Latent Topic Modeling , 2010 .

[12]  John Zimmerman,et al.  I'm the mayor of my house: examining why people use foursquare - a social-driven location sharing application , 2011, CHI.

[13]  Kazutoshi Sumiya,et al.  Crowd-based urban characterization: extracting crowd behavioral patterns in urban areas from Twitter , 2011, LBSN '11.

[14]  Cecilia Mascolo,et al.  Exploiting place features in link prediction on location-based social networks , 2011, KDD.

[15]  Kyumin Lee,et al.  Exploring Millions of Footprints in Location Sharing Services , 2011, ICWSM.

[16]  Yu Zheng,et al.  Location-Based Social Networks: Users , 2011, Computing with Spatial Trajectories.

[17]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[18]  Wei-Ying Ma,et al.  Recommending friends and locations based on individual location history , 2011, ACM Trans. Web.

[19]  Chihiro Sato,et al.  Experience-Based Curiosity Model: Curiosity Extracting Model Regarding Individual Experiences of Urban Spaces , 2011, HCI.

[20]  Jiawei Han,et al.  LPTA: A Probabilistic Model for Latent Periodic Topic Analysis , 2011, 2011 IEEE 11th International Conference on Data Mining.

[21]  Cecilia Mascolo,et al.  An Empirical Study of Geographic User Activity Patterns in Foursquare , 2011, ICWSM.

[22]  Norman M. Sadeh,et al.  The Livehoods Project: Utilizing Social Media to Understand the Dynamics of a City , 2012, ICWSM.

[23]  jimi adams,et al.  Capturing context: Integrating spatial and social network analyses , 2012, Soc. Networks.

[24]  Robert W. Faris,et al.  Measuring 'neighborhood': Constructing network neighborhoods , 2012, Soc. Networks.

[25]  Henry A. Kautz,et al.  Finding your friends and following them to where you are , 2012, WSDM '12.

[26]  Cecilia Mascolo,et al.  A Tale of Many Cities: Universal Patterns in Human Urban Mobility , 2011, PloS one.