Predict User In-World Activity via Integration of Map Query and Mobility Trace

People often resort to map search engine or other locationbased services for location information when planning long trips or local navigation, and their map queries as well as mobility trace will be accumulated and stored in user log. These data offers valuable information for studying the mechanism of human mobility pattern, furthermore, map query data enable us to sense users’ real-time interests towards locations, and even to forecast their in-world activity in the near future. In this paper, we unveil the connection between users’ map queries and their in-world explorations, and prove the predictability of query-activity formation using two large-scale datasets, the complete map query log and mobility traces of 4 million Baidu map users, which comprise of 118 million map queries and 6.5 billion GPS location records during consecutive 3 months. To the best of our knowledge, it is the first attempt to extensively assess the unique qualities of map query data and predict whether queries about one location would actually lead to in-world visits using heterogeneous data sources. We first characterize the properties of these two datasets, then extract interesting features to quantify their correlation, finally we construct gradient boosting model for prediction, and describe applications empowered by our findings, such as mobility modeling and urban flow estimation.

[1]  Jaime Teevan,et al.  Understanding the importance of location, time, and people in mobile local search behavior , 2011, Mobile HCI.

[2]  Carlo Ratti,et al.  The City Browser: Utilizing Massive Call Data to Infer City Mobility Dynamics , 2014 .

[3]  H. Varian,et al.  Predicting the Present with Google Trends , 2009 .

[4]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[5]  Xing Xie,et al.  Mining Individual Life Pattern Based on Location History , 2009, 2009 Tenth International Conference on Mobile Data Management: Systems, Services and Middleware.

[6]  Torsten Suel,et al.  Analysis of geographic queries in a search engine log , 2008, LocWeb.

[7]  Ling Chen,et al.  A context-aware personalized travel recommendation system based on geotagged social media data mining , 2013, Int. J. Geogr. Inf. Sci..

[8]  Marc-Olivier Killijian,et al.  Next place prediction using mobility Markov chains , 2012, MPM '12.

[9]  Jaime Teevan,et al.  Information re-retrieval: repeat queries in Yahoo's logs , 2007, SIGIR.

[10]  Ryen W. White,et al.  Here and there: goals, activities, and predictions about location from geotagged queries , 2013, SIGIR.

[11]  Norman M. Sadeh,et al.  The Livehoods Project: Utilizing Social Media to Understand the Dynamics of a City , 2012, ICWSM.

[12]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[13]  Albert-László Barabási,et al.  Limits of Predictability in Human Mobility , 2010, Science.

[14]  Andrew McCallum,et al.  Chinese Segmentation and New Word Detection using Conditional Random Fields , 2004, COLING.

[15]  林茂喜 Location-based service system and method , 2011 .

[16]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[17]  Ryen W. White,et al.  Pursuing insights about healthcare utilization via geocoded search queries , 2013, SIGIR.

[18]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Nicholas Jing Yuan,et al.  We know how you live: exploring the spectrum of urban lifestyles , 2013, COSN '13.