Who, Where, When, and What

Micro-blogging services and location-based social networks, such as Twitter, Weibo, and Foursquare, enable users to post short messages with timestamps and geographical annotations. The rich spatial-temporal-semantic information of individuals embedded in these geo-annotated short messages provides exciting opportunity to develop many context-aware applications in ubiquitous computing environments. Example applications include contextual recommendation and contextual search. To obtain accurate recommendations and most relevant search results, it is important to capture users’ contextual information (e.g., time and location) and to understand users’ topical interests and intentions. While time and location can be readily captured by smartphones, understanding user’s interests and intentions calls for effective methods in modeling user mobility behavior. Here, user mobility refers to who visits which place at what time for what activity. That is, user mobility behavior modeling must consider user (Who), spatial (Where), temporal (When), and activity (What) aspects. Unfortunately, no previous studies on user mobility behavior modeling have considered all of the four aspects jointly, which have complex interdependencies. In our preliminary study, we propose the first solution named W4 (short for Who, Where, When, and What) to discover user mobility behavior from the four aspects. In this article, we further enhance W4 and propose a nonparametric Bayesian model named EW4 (short for Enhanced W4). EW4 requires no parameter tuning and achieves better results over W4 in our experiments. Given some of the four aspects of a user (e.g., time), our model is able to infer information of the other aspects (e.g., location and topical words). Thus, our model has a variety of context-aware applications, particularly in contextual search and recommendation. Experimental results on two real-world datasets show that the proposed model is effective in discovering users’ spatial-temporal topics. The model also significantly outperforms state-of-the-art baselines for various tasks including location prediction for tweets and requirement-aware location recommendation.

[1]  Jason Baldridge,et al.  Simple supervised document geolocation with geodesic grids , 2011, ACL.

[2]  Changhu Wang,et al.  Equip tourists with knowledge mined from travelogues , 2010, WWW '10.

[3]  Brendan T. O'Connor,et al.  A Latent Variable Model for Geographic Lexical Variation , 2010, EMNLP.

[4]  Sheila Kinsella,et al.  "I'm eating a sandwich in Glasgow": modeling locations with tweets , 2011, SMUC '11.

[5]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[6]  Alexander J. Smola,et al.  Discovering geographical topics in the twitter stream , 2012, WWW.

[7]  Martin Ester,et al.  Spatial topic modeling in online social media for location recommendation , 2013, RecSys.

[8]  Cecilia Mascolo,et al.  Talking Places: Modelling and Analysing Linguistic Content in Foursquare , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[9]  Nadia Magnenat-Thalmann,et al.  Time-aware point-of-interest recommendation , 2013, SIGIR.

[10]  Kyumin Lee,et al.  Exploring Millions of Footprints in Location Sharing Services , 2011, ICWSM.

[11]  Beng Chin Ooi,et al.  Big data: the driver for innovation in databases , 2014 .

[12]  Carl E. Rasmussen,et al.  The Infinite Gaussian Mixture Model , 1999, NIPS.

[13]  Chong Wang,et al.  Mining geographic knowledge using location aware topic model , 2007, GIR '07.

[14]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[15]  T. Geisel,et al.  The scaling laws of human travel , 2006, Nature.

[16]  Nadia Magnenat-Thalmann,et al.  Who, where, when and what: discover spatio-temporal topics for twitter users , 2013, KDD.

[17]  Albert-László Barabási,et al.  Limits of Predictability in Human Mobility , 2010, Science.

[18]  Alexander J. Smola,et al.  Hierarchical geographical modeling of user locations from social media posts , 2013, WWW.

[19]  Jiawei Han,et al.  Geographical topic discovery and comparison , 2011, WWW.

[20]  Martha Larson,et al.  The where in the tweet , 2011, CIKM '11.

[21]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[22]  Chenliang Li,et al.  Twevent: segment-based event detection from tweets , 2012, CIKM.

[23]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[24]  Gregor Heinrich “ Infinite LDA ” – Implementing the HDP with minimum code complexity , 2011 .

[25]  Gao Cong,et al.  Graph-based Point-of-interest Recommendation with Geographical and Temporal Influences , 2014, CIKM.

[26]  Sergej Sizov,et al.  GeoFolk: latent spatial semantics in web 2.0 social media , 2010, WSDM '10.

[27]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[28]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[29]  Filip Radlinski,et al.  Inferring and using location metadata to personalize web search , 2011, SIGIR.

[30]  Chaoming Song,et al.  Modelling the scaling properties of human mobility , 2010, 1010.0436.

[31]  Oren Etzioni,et al.  Open domain event extraction from twitter , 2012, KDD.

[32]  Chao Liu,et al.  A probabilistic approach to spatiotemporal theme pattern mining on weblogs , 2006, WWW '06.

[33]  Steffen Staab,et al.  Detecting non-gaussian geographical topics in tagged photo collections , 2014, WSDM.

[34]  Mao Ye,et al.  Exploiting geographical influence for collaborative point-of-interest recommendation , 2011, SIGIR.

[35]  Fernando Diaz,et al.  Geographic features in web search retrieval , 2008, GIR '08.

[36]  Wei Chu,et al.  Cohort modeling for enhanced personalized search , 2014, SIGIR.