Mining Mobility Evolution from Check-In Datasets

The advances in location-acquisition and smart phone technologies have led to a myriad of location-based social media. Therefore, analyzing the increasing amount of spatio-temporal data emerges as an important topic. Most studies on geographic data mining focus on exploring static mobility patterns. As the amount of incoming data streams increases, revealing the temporal aspect of user mobility patterns is worth investigating. This paper targets on mining user mobility patterns over time (referred to as mobility evolution) from streams of check-in records. Intuitively, at each time slot, a mobility pattern indicates spatial regions where users stay. Therefore, given a set of time slots, mobility evolution refers a sequence of spatial regions at each time slot. Note that nearby time slots may have similar spatial region distribution. Thus, given check-in datasets, we use the idea of data compression to obtain a sequence of representative segments, where each representative segment captures spatial region distribution at the corresponding time interval. To measure the quality of a segmentation result, we propose a representation cost function based on the Minimum Description Length (MDL) principle. In addition, because deriving the sequence of segments incurs expensive computational cost, we propose a family of greedy algorithms for segmentation to serve diverse requirements: efficient compression, informative compression, and cost-effective compression. Besides, to handle the massive amount of incoming check-in data, we also propose an incremental compression approach to incrementally update the mobility evolution. We conduct experiments on Foursquare datasets to demonstrate both the effectiveness and efficiency of our proposed algorithms.

[1]  Cecilia Mascolo,et al.  Mining User Mobility Features for Next Place Prediction in Location-Based Services , 2012, 2012 IEEE 12th International Conference on Data Mining.

[2]  Christian S. Jensen,et al.  Path prediction and predictive range querying in road network databases , 2010, The VLDB Journal.

[3]  Shou-De Lin,et al.  Exploiting large-scale check-in data to recommend time-sensitive routes , 2012, UrbComp '12.

[4]  Kyumin Lee,et al.  Exploring Millions of Footprints in Location Sharing Services , 2011, ICWSM.

[5]  Changhu Wang,et al.  Equip tourists with knowledge mined from travelogues , 2010, WWW '10.

[6]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[7]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[8]  Christian S. Jensen,et al.  Retrieving top-k prestige-based relevant spatial web objects , 2010, Proc. VLDB Endow..

[9]  Christian S. Jensen,et al.  Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects , 2009, Proc. VLDB Endow..

[10]  Jae-Gil Lee,et al.  MoveMine: Mining moving object data for discovery of animal movement patterns , 2011, TIST.

[11]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[12]  Mao Ye,et al.  Exploiting geographical influence for collaborative point-of-interest recommendation , 2011, SIGIR.

[13]  V. Lakshmanan Automating the Analysis of Spatial Grids , 2012 .

[14]  Norman M. Sadeh,et al.  The Livehoods Project: Utilizing Social Media to Understand the Dynamics of a City , 2012, ICWSM.

[15]  Qing Liu,et al.  A Hybrid Prediction Model for Moving Objects , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[16]  V. Lakshmanan Automating the Analysis of Spatial Grids: A Practical Guide to Data Mining Geospatial Images for Human & Environmental Applications , 2012 .

[17]  Xing Xie,et al.  Collaborative location and activity recommendations with GPS history data , 2010, WWW '10.

[18]  Christian S. Jensen,et al.  Joint Top-K Spatial Keyword Query Processing , 2012, IEEE Transactions on Knowledge and Data Engineering.

[19]  Haixun Wang,et al.  An algorithmic approach to event summarization , 2010, SIGMOD Conference.

[20]  Anna Monreale,et al.  WhereNext: a location predictor on trajectory pattern mining , 2009, KDD.

[21]  Yu-Ru Lin,et al.  Mesoscopic Structure and Social Aspects of Human Mobility , 2012, PloS one.

[22]  Philip S. Yu,et al.  GraphScope: parameter-free mining of large time-evolving graphs , 2007, KDD '07.

[23]  Beng Chin Ooi,et al.  Collective spatial keyword querying , 2011, SIGMOD '11.

[24]  Xing Xie,et al.  Learning transportation mode from raw gps data for geographic applications on the web , 2008, WWW.