Fusing Mobile Phone and Travel Survey Data to Model Urban Activity Dynamics

A key issue to understand urban system is to characterize the activity dynamics in a city—when, where, what, and how activities happen in a city. To better understand the urban activity dynamics, city-wide and multiday activity participation sequence data, namely, activity chain as well as suitable spatiotemporal models, are needed. The commonly used household travel survey data in activity analysis suffers from limited sample size and temporal coverage. The emergence of large-scale spatiotemporal data in urban areas, such as mobile phone data, provides a new opportunity to infer urban activities and the underlying dynamics. However, the challenge is the absence of labeled activity information in mobile phone data. Consequently, how to fuse the useful information in household survey data and mobile phone data to build city-wide, multiday, and all-time activity chains becomes an important research question. Moreover, the multidimension structure of the activity data (e.g., location, start time, duration, type) makes the extraction of spatiotemporal activity patterns another difficult problem. In this study, the authors first introduce an activity chain inference model based on tensor decomposition to infer the missing activity labels in large-scale and multiday activity data, and then develop a spatiotemporal event clustering model based on DBSCAN, called STE-DBSCAN, to identify the spatiotemporal activity patterns. The proposed approaches achieved good accuracy and produced patterns with a high level of interpretability.

[1]  Chenghu Zhou,et al.  Windowed nearest neighbour method for mining spatio-temporal clusters in the presence of noise , 2010, Int. J. Geogr. Inf. Sci..

[2]  Peter R. Stopher,et al.  A process for trip purpose imputation from Global Positioning System data , 2013 .

[3]  Dino Pedreschi,et al.  Time-focused clustering of trajectories of moving objects , 2006, Journal of Intelligent Information Systems.

[4]  Yasuo Asakura,et al.  Behavioural data mining of transit smart card data: A data fusion approach , 2014 .

[5]  Panagiotis Papaioannou,et al.  Utilizing Social Media in Transport Planning and Public Transit Quality: Survey of Literature , 2018 .

[6]  Cao Jing,et al.  Approaches for scaling DBSCAN algorithm to large spatial databases , 2000 .

[7]  Rasmus Bro,et al.  Improving the speed of multi-way algorithms:: Part I. Tucker3 , 1998 .

[8]  Morten Mørup,et al.  Applications of tensor (multiway array) factorizations and decompositions in data mining , 2011, WIREs Data Mining Knowl. Discov..

[9]  Ahmad Tavassoli,et al.  Public transport trip purpose inference using smart card fare data , 2018 .

[10]  Chao Chen,et al.  TripImputor: Real-Time Imputing Taxi Trip Purpose Leveraging Multi-Sourced Urban Data , 2018, IEEE Transactions on Intelligent Transportation Systems.

[11]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[12]  Yee Leung,et al.  Applying mobile phone data to travel behaviour research: A literature review , 2017 .

[13]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[14]  Esteban Moro,et al.  Impact of human activity patterns on the dynamics of information diffusion. , 2009, Physical review letters.

[15]  Will Recker,et al.  Mining activity pattern trajectories and allocating activities in the network , 2015 .

[16]  Kate Revoredo,et al.  A Combined Solution for Real-Time Travel Mode Detection and Trip Purpose Prediction , 2019, IEEE Transactions on Intelligent Transportation Systems.

[17]  H. Kiers Weighted least squares fitting using ordinary least squares algorithms , 1997 .

[18]  Lelitha Vanajakshi,et al.  Pattern-Based Time-Discretized Method for Bus Travel Time Prediction , 2017 .

[19]  Christian Schneider,et al.  Spatiotemporal Patterns of Urban Human Mobility , 2012, Journal of Statistical Physics.

[20]  Xiaolei Ma,et al.  Mining smart card data for transit riders’ travel patterns , 2013 .

[21]  Bin Ran,et al.  Tensor based missing traffic data completion with spatial–temporal correlation , 2016 .

[22]  Rasmus Bro,et al.  The N-way Toolbox for MATLAB , 2000 .

[23]  Chenghu Zhou,et al.  Density-based clustering for data containing two types of points , 2015, Int. J. Geogr. Inf. Sci..

[24]  Ryuichi Kitamura,et al.  Panel Analysis in Transportation Planning: An Overview , 1990 .

[25]  Bin Ran,et al.  Trip-Chain-Based Travel-Mode-Shares-Driven Framework using Cellular Signaling Data and Web-Based Mapping Service Data , 2019 .

[26]  Yan Huang,et al.  Correlation Analysis of Spatial Time Series Datasets: A Filter-and-Refine Approach , 2003, PAKDD.

[27]  Lijun Sun,et al.  A Bayesian tensor decomposition approach for spatiotemporal traffic data imputation , 2019, Transportation Research Part C: Emerging Technologies.

[28]  Kees Maat,et al.  Deriving and validating trip purposes and travel modes for multi-day GPS-based travel surveys: A large-scale application in the Netherlands , 2009 .

[29]  Yingling Fan,et al.  Trip chain extraction using smartphone-collected trajectory data , 2019 .

[30]  Hong Yang,et al.  Modeling and Analysis of Daily Driving Patterns of Taxis in Reshuffled Ride-Hailing Service Market , 2019, Journal of Transportation Engineering, Part A: Systems.

[31]  Haris N. Koutsopoulos,et al.  Inferring patterns in the multi-week activity sequences of public transport users , 2016 .