Selecting Individual and Population Models for Predicting Human Mobility

A large plethora of models to predict human mobility exists in the literature. The problem of how to select the most appropriate model to solve a specific mobility prediction task has however received only little attention. Yet, a wrong model choice may lead to severe performance losses. In this paper, we address the model selection problem in human mobility prediction and make the following contributions. We present SELECTOR, a generic framework to explore human mobility data and compute both population models and individual models to predict human mobility. The former are models that are adapted to the characteristics of an entire population of users and can be used to overcome the cold-start problem. The latter are prediction models optimized for individual users. We present and analyze the results obtainable using SELECTOR on the Nokia data set, which is one of the largest and richest, publicly available data sets of human mobility data. We show that for many users, generic population models can be used in place of individual models with negligible performance losses. Yet for about 25 percent of the users, individual models perform at least three percentage points better than population models. Thereby, we show that the use of phone context data does not lead to significantly better performance of human mobility predictors with respect to the case in which only temporal and spatial features are used. We further observe that the population models we derive are robust against the demographics of the users and that building different population models for different periods of the day leads to performance improvements. We make SELECTOR publicly available to allow other researchers and practitioners to explore further mobility data sets and to embed the code base of SELECTOR in their applications.

[1]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[2]  Gaetano Borriello,et al.  Extracting places from traces of locations , 2004, MOCO.

[3]  Marta C. González,et al.  Understanding individual human mobility patterns , 2008, Nature.

[4]  Cecilia Mascolo,et al.  A Tale of Many Cities: Universal Patterns in Human Urban Mobility , 2011, PloS one.

[5]  Deborah Estrin,et al.  Discovering semantically meaningful places from pervasive RF-beacons , 2009, UbiComp.

[6]  Cecilia Mascolo,et al.  NextPlace: A Spatio-temporal Prediction Framework for Pervasive Systems , 2011, Pervasive.

[7]  D. Gática-Pérez,et al.  Towards rich mobile phone datasets: Lausanne data collection campaign , 2010 .

[8]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[9]  Q. Vuong Likelihood Ratio Tests for Model Selection and Non-Nested Hypotheses , 1989 .

[10]  Timo Ojala,et al.  Bluetooth and WAP push based location-aware mobile advertising system , 2004, MobiSys '04.

[11]  Josef Kittler,et al.  Floating search methods in feature selection , 1994, Pattern Recognit. Lett..

[12]  Francesc J. Ferri,et al.  Comparative study of techniques for large-scale feature selection* *This work was suported by a SERC grant GR/E 97549. The first author was also supported by a FPI grant from the Spanish MEC, PF92 73546684 , 1994 .

[13]  Sunny Consolvo,et al.  Learning and Recognizing the Places We Go , 2005, UbiComp.

[14]  Saeed Moghaddam,et al.  MobileMiner: mining your frequent patterns on your phone , 2014, UbiComp.

[15]  Lionel M. Ni,et al.  An unsupervised framework for sensing individual and cluster behavior patterns from human mobile data , 2012, UbiComp.

[16]  Silvia Santini,et al.  How long are you staying?: predicting residence time from human mobility traces , 2013, MobiCom.

[17]  W. Bischof,et al.  Daily time spent indoors in German homes--baseline data for the assessment of indoor exposure of German occupants. , 2005, International journal of hygiene and environmental health.

[18]  David Kotz,et al.  Extracting a Mobility Model from Real User Traces , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[19]  Gavin Smith,et al.  A refined limit on the predictability of human mobility , 2014, 2014 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[20]  Anna Monreale,et al.  WhereNext: a location predictor on trajectory pattern mining , 2009, KDD.

[21]  Roy D. Yates,et al.  EdgeBuffer: Caching and prefetching content at the edge in the MobilityFirst future Internet architecture , 2015, 2015 IEEE 16th International Symposium on A World of Wireless, Mobile and Multimedia Networks (WoWMoM).

[22]  A. Wayne Whitney,et al.  A Direct Method of Nonparametric Measurement Selection , 1971, IEEE Transactions on Computers.

[23]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[24]  Imad Aad,et al.  The Mobile Data Challenge: Big Data for Mobile Computing Research , 2012 .

[25]  A. Rogers,et al.  Exploring Periods of Low Predictability in Daily Life Mobility , 2012 .

[26]  Mingyan Liu,et al.  Building realistic mobility models from coarse-grained traces , 2006, MobiSys '06.

[27]  Silvia Santini,et al.  Predicting household occupancy for smart heating control: A comparative performance analysis of state-of-the-art approaches , 2014 .

[28]  Ravi Jain,et al.  Predictability of WLAN Mobility and Its Effects on Bandwidth Provisioning , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[29]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[30]  Daniel Gatica-Perez,et al.  Where and what: Using smartphones to predict next locations and applications in daily life , 2014, Pervasive Mob. Comput..

[31]  Katarzyna Wac,et al.  Getting closer: an empirical investigation of the proximity of user to their smart phones , 2011, UbiComp '11.

[32]  John Krumm,et al.  Placer: semantic place labels from diary data , 2013, UbiComp.

[33]  Silvia Santini,et al.  The influence of temporal and spatial features on the performance of next-place prediction algorithms , 2013, UbiComp.

[34]  Antonio Lima,et al.  Interdependence and predictability of human mobility and social interactions , 2012, Pervasive Mob. Comput..

[35]  John Krumm,et al.  PreHeat: controlling home heating using occupancy prediction , 2011, UbiComp '11.

[36]  Daniel Gatica-Perez,et al.  Discovering places of interest in everyday life from smartphone data , 2011, Multimedia Tools and Applications.

[37]  Mirco Musolesi,et al.  InterruptMe: designing intelligent prompting mechanisms for pervasive applications , 2014, UbiComp.

[38]  Jin-Hyuk Hong,et al.  Understanding and prediction of mobile application usage for smart phones , 2012, UbiComp.

[39]  Eric Horvitz,et al.  Predestination: Inferring Destinations from Partial Trajectories , 2006, UbiComp.

[40]  Xing Xie,et al.  Mining Individual Life Pattern Based on Location History , 2009, 2009 Tenth International Conference on Mobile Data Management: Systems, Services and Middleware.

[41]  Ian Oakley,et al.  Indoor-ALPS: an adaptive indoor location prediction system , 2014, UbiComp.

[42]  Hojung Cha,et al.  Evaluating mobility models for temporal prediction with high-granularity mobility data , 2012, 2012 IEEE International Conference on Pervasive Computing and Communications.

[43]  Silvia Santini,et al.  A population model for predicting human mobility , 2015, UbiComp/ISWC Adjunct.

[44]  Nicholas R. Jennings,et al.  Modelling heterogeneous location habits in human populations for location prediction under data sparsity , 2013, UbiComp.

[45]  David M. Pennock,et al.  Categories and Subject Descriptors , 2001 .

[46]  Thad Starner,et al.  Using GPS to learn significant locations and predict movement across multiple users , 2003, Personal and Ubiquitous Computing.

[47]  Albert-László Barabási,et al.  Limits of Predictability in Human Mobility , 2010, Science.

[48]  Alastair R. Beresford,et al.  Device Analyzer: Understanding Smartphone Usage , 2013, MobiQuitous.

[49]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[50]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[51]  E. M. Wright,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[52]  Alex Pentland,et al.  Reality mining: sensing complex social systems , 2006, Personal and Ubiquitous Computing.

[53]  Wen-Jing Hsu,et al.  Predictability of individuals' mobility with high-resolution positioning data , 2012, UbiComp.

[54]  Olivier Dousse,et al.  Place Learning via Direct WiFi Fingerprint Clustering , 2012, 2012 IEEE 13th International Conference on Mobile Data Management.

[55]  Silvia Santini,et al.  Analyzing the Influence of Phone Context Data on the Performance of Human Mobility Predictors , 2015 .

[56]  Daniel Gatica-Perez,et al.  Contextual conditional models for smartphone-based human mobility prediction , 2012, UbiComp.