Mining online footprints to predict user’s next location

ABSTRACT Social media applications are widely deployed in mobile platforms equipped with built-in GPS tracking devices, and these devices have led to an unprecedented collection of geolocated data (geo-tags). Geo-tags, along with place names, offer new opportunities to explore the trajectory and mobility patterns of social media users. However, trajectory data captured by social media are sparsely and irregularly spaced and therefore have varying degrees of resolution in both space and time. Previous studies on next location prediction are mostly applicable for detecting the upcoming location of a moving object using dense GPS trajectories where locations are recorded at regular time intervals (e.g., 1 minute). Additionally, point features are commonly used to represent the locations of visits, but using point features cannot capture the variability of human mobility. This article introduces a new methodology to predict an individual’s next location based on sparse footprints accumulated over a long time period using social networks, and uses polygons to represent the location corresponding to the physical activity area of individuals. First, the density-based spatial clustering algorithm is employed to discover the most representative activity zones that an individual frequently visits on a daily basis, and a polygon-based region is then derived for each representative activity zone. A sparse mobility Markov chain model considering both the movements and online behaviors of the social media user is trained and used to predict the user’s next location. Initial experiments with a group of Washington DC Twitter users demonstrate that the proposed methodology successfully discovers the activity regions and predicts the user’s next location with accuracy approaching 78.94%.

[1]  Qunying Huang,et al.  Activity patterns, socioeconomic status and urban spatial structure: what can social media data tell us? , 2016, Int. J. Geogr. Inf. Sci..

[2]  Shashi Shekhar,et al.  Discovering personal gazetteers: an interactive clustering approach , 2004, GIS '04.

[3]  Qunying Huang,et al.  From where do tweets originate?: a GIS approach for user location inference , 2014, LBSN '14.

[4]  Cecilia Mascolo,et al.  Exploiting place features in link prediction on location-based social networks , 2011, KDD.

[5]  Thad Starner,et al.  Using GPS to learn significant locations and predict movement across multiple users , 2003, Personal and Ubiquitous Computing.

[6]  Jiawei Han,et al.  Geographic Data Mining and Knowledge Discovery , 2001 .

[7]  Özgür Ulusoy,et al.  A data mining approach for location prediction in mobile environments , 2005, Data Knowl. Eng..

[8]  David W. S. Wong,et al.  Modeling and Visualizing Regular Human Mobility Patterns with Uncertainty: An Example Using Twitter Data , 2015 .

[9]  Bart Kuijpers,et al.  Towards Semantic Trajectory Knowledge Discovery , 2007 .

[10]  Ryosuke Shibasaki,et al.  Activity-Aware Map: Identifying Human Daily Activity Pattern Using Mobile Phone Data , 2010, HBU.

[11]  Akinori Asahara,et al.  Pedestrian-movement prediction based on mixed Markov-chain model , 2011, GIS.

[12]  J. R. Quinlan Learning With Continuous Classes , 1992 .

[13]  Kristina Chodorow,et al.  MongoDB: The Definitive Guide , 2010 .

[14]  Daqiang Zhang,et al.  Predicting Mobile Phone User Locations by Exploiting Collective Behavioral Patterns , 2012, 2012 9th International Conference on Ubiquitous Intelligence and Computing and 9th International Conference on Autonomic and Trusted Computing.

[15]  Chun How Tan,et al.  Beyond "local", "categories" and "friends": clustering foursquare users with latent "topics" , 2012, UbiComp.

[16]  Marc-Olivier Killijian,et al.  Next place prediction using mobility Markov chains , 2012, MPM '12.

[17]  Wang-Chien Lee,et al.  Semantic trajectory mining for location prediction , 2011, GIS.

[18]  Shih-Lung Shaw,et al.  Exploratory data analysis of activity diary data: a space-time GIS approach , 2011 .

[19]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[20]  Qing Liu,et al.  A Hybrid Prediction Model for Moving Objects , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[21]  Sébastien Gambs,et al.  Show me how you move and I will tell you who you are , 2010, SPRINGL '10.

[22]  Mikolaj Morzy,et al.  Mining Frequent Trajectories of Moving Objects for Location Prediction , 2007, MLDM.

[23]  Jeremy Mennis,et al.  Spatial data mining and geographic knowledge discovery - An introduction , 2009, Comput. Environ. Urban Syst..

[24]  Anna Monreale,et al.  WhereNext: a location predictor on trajectory pattern mining , 2009, KDD.

[25]  Sechang Oh,et al.  Using an Adaptive Search Tree to Predict User Location , 2012, J. Inf. Process. Syst..

[26]  Xing Xie,et al.  GeoLife: A Collaborative Social Networking Service among User, Location and Trajectory , 2010, IEEE Data Eng. Bull..

[27]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[28]  Cecilia Mascolo,et al.  A Tale of Many Cities: Universal Patterns in Human Urban Mobility , 2011, PloS one.

[29]  Qunying Huang,et al.  A data-driven framework for archiving and exploring social media data , 2014, Ann. GIS.