Reconstructing Activity Location Sequences From Incomplete Check-In Data: A Semi-Markov Continuous-Time Bayesian Network Model

Geo-location data from the check-ins made in online social media offers us information, in new ways, to understand activity-location choices of a large number of people. However, one of the major challenges of using check-in data is that it has missing activities, since users share their activities voluntarily. In this paper, we present a probabilistic modeling approach to reconstruct user activity-location sequences from this incomplete activity participation information. Specifically, we answer the question of how to predict an individual’s next activity, its duration and location given the incomplete trajectory data. The model describes the dynamics of individual activity participation behavior evolving over continuous time. A semi-Markov modeling approach is used to capture the stochastic processes involved in the activity generation mechanism. We present a particle-based Markov chain Monte Carlo sampler to run inference over the model. We further develop an expectation-maximization algorithm to learn the unknown parameters of the model from incomplete trajectory data. Finally, the method is applied to synthetically generated activity-location sequences and a data set of Foursquare check-ins of the users from New York City. Our experiments show that this method can successfully extract the true transition and duration distributions given the incomplete trajectory information. The proposed approach can help building many intelligent transportation applications using check-in data.

[1]  Satish V. Ukkusuri,et al.  Urban activity pattern classification using topic models from online geo-location data , 2014 .

[2]  Ryuichi Kitamura,et al.  Recurrence of Daily Travel Patterns , 2007 .

[3]  Jing Xu,et al.  Continuous Time Bayesian Networks for Host Level Network Intrusion Detection , 2008, ECML/PKDD.

[4]  Fabio Pianese,et al.  Discovering and predicting user routines by differential analysis of social network traces , 2013, 2013 IEEE 14th International Symposium on "A World of Wireless, Mobile and Multimedia Networks" (WoWMoM).

[5]  Vinayak A. Rao,et al.  Markov-modulated Marked Poisson Processes for Check-in Data , 2016, ICML.

[6]  Pu Wang,et al.  Development of origin–destination matrices using mobile phone call data , 2014 .

[7]  Carlo Ratti,et al.  Understanding individual mobility patterns from urban sensing data: A mobile phone trace example , 2013 .

[8]  Jeff A. Bilmes,et al.  A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models , 1998 .

[9]  Christian P. Robert,et al.  The Bayesian choice : from decision-theoretic foundations to computational implementation , 2007 .

[10]  Steven R. Lerman,et al.  The Use of Disaggregate Choice Models in Semi-Markov Process Models of Trip Chaining Behavior , 1979 .

[11]  Ryuichi Kitamura,et al.  Micro-simulation of daily activity-travel patterns for travel demand forecasting , 2000 .

[12]  Cecilia Mascolo,et al.  A Tale of Many Cities: Universal Patterns in Human Urban Mobility , 2011, PloS one.

[13]  Kay W. Axhausen,et al.  Location choice modeling for shopping and leisure activities with MATSim , 2009 .

[14]  Yee Whye Teh,et al.  MCMC for continuous-time discrete-state systems , 2012, NIPS.

[15]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[16]  Hjp Harry Timmermans,et al.  A learning-based transportation oriented simulation system , 2004 .

[17]  Henry A. Kautz,et al.  Extending Continuous Time Bayesian Networks , 2005, AAAI.

[18]  Daphne Koller,et al.  Expectation Maximization and Complex Duration Distributions for Continuous Time Bayesian Networks , 2005, UAI.

[19]  S. Chib,et al.  Understanding the Metropolis-Hastings Algorithm , 1995 .

[20]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[21]  Torsten Hägerstraand WHAT ABOUT PEOPLE IN REGIONAL SCIENCE , 1970 .

[22]  Fred Mannering,et al.  Modeling Travelers' Postwork Activity Involvement: Toward a New Methodology , 1993, Transp. Sci..

[23]  Eric Sun,et al.  Location 3 : How Users Share and Respond to Location-Based Data on Social Networking Sites , 2011 .

[24]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[25]  Jing Xu,et al.  Importance Sampling for Continuous Time Bayesian Networks , 2010, J. Mach. Learn. Res..

[26]  Cecilia Mascolo,et al.  An Empirical Study of Geographic User Activity Patterns in Foursquare , 2011, ICWSM.

[27]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[28]  Matthew J. Roorda,et al.  Validation of TASHA: A 24-Hour Activity Scheduling Microsimulation Model , 2008 .

[29]  Ben Taskar,et al.  Graphical Models in a Nutshell , 2007 .

[30]  Lun Wu,et al.  Intra-Urban Human Mobility and Activity Transition: Evidence from Social Media Check-In Data , 2014, PloS one.

[31]  Samiul Hasan Modeling urban mobility dynamics using geo-location data , 2013 .

[32]  Norman M. Sadeh,et al.  The Livehoods Project: Utilizing Social Media to Understand the Dynamics of a City , 2012, ICWSM.

[33]  Harry J. P. Timmermans,et al.  Unconditional and conditional competing risk models of activity duration and activity sequencing decisions: An empirical comparison , 2002, J. Geogr. Syst..

[34]  A. Doucet,et al.  Particle Markov chain Monte Carlo methods , 2010 .

[35]  Daphne Koller,et al.  Continuous Time Bayesian Networks , 2012, UAI.

[36]  Yong Gao,et al.  Uncovering Patterns of Inter-Urban Trip and Spatial Interaction from Social Media Check-In Data , 2013, PloS one.

[37]  Santi Phithakkitnukoon,et al.  Sensing Urban Social Geography Using Online Social Networking Data , 2011, The Social Mobile Web.

[38]  Xing Xie,et al.  Collaborative activity recognition via check-in history , 2011, LBSN '11.

[39]  Ryuichi Kitamura,et al.  Generation of Synthetic Daily Activity-Travel Patterns , 1997 .

[40]  Carlo Ratti,et al.  Real-Time Urban Monitoring Using Cell Phones: A Case Study in Rome , 2011, IEEE Transactions on Intelligent Transportation Systems.

[41]  Satish V. Ukkusuri,et al.  Understanding urban human activity and mobility patterns using large-scale location-based data from online social media , 2013, UrbComp '13.

[42]  Chandra R. Bhat,et al.  Activity-based Travel Demand Analysis , 2011 .

[43]  Kyumin Lee,et al.  Exploring Millions of Footprints in Location Sharing Services , 2011, ICWSM.

[44]  Will Recker,et al.  Activity Pattern Recognition by Using Support Vector Machines with Multiple Classes , 2013 .

[45]  Christian Schneider,et al.  Spatiotemporal Patterns of Urban Human Mobility , 2012, Journal of Statistical Physics.

[46]  Henry A. Kautz,et al.  Learning and inferring transportation routines , 2004, Artif. Intell..

[47]  Chandra R. Bhat,et al.  Comprehensive Econometric Microsimulator for Daily Activity-Travel Patterns , 2004 .

[48]  A. Cohen,et al.  Maximum Likelihood Estimation in the Weibull Distribution Based On Complete and On Censored Samples , 1965 .

[49]  Toshiyuki Yamamoto,et al.  Florida activity mobility simulator - Overview and preliminary validation results , 2005 .

[50]  Zoubin Ghahramani,et al.  Learning Dynamic Bayesian Networks , 1997, Summer School on Neural Networks.

[51]  Satish V. Ukkusuri,et al.  Location Contexts of User Check-Ins to Model Urban Geo Life-Style Patterns , 2015, PloS one.

[52]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.