Open PFLOW: Creation and evaluation of an open dataset for typical people mass movement in urban areas

Abstract Understanding people flow at a citywide level is critical for urban planning and commercial development. Thanks to the ubiquity of human location tracking devices, many studies on people mass movement with mobility logs have been conducted. However, high cost and severe privacy policy constraints still complicate utilization of these data in practice. There is no dataset that anyone can freely access, use, modify, and share for any purpose. To tackle this problem, we propose a novel dataset creation approach (called Open PFLOW) that continuously reports the spatiotemporal positions of all individual’s in urban areas based on open data. With fully consideration of the privacy protection, each entity in our dataset does not match the actual movement of any real person, so that the dataset can be totally open to public as part of data infrastructure. Because the result is shown at a disaggregate level, users can freely modify, process, and visualize the dataset for any purpose. We evaluate the accuracy of the dataset by comparing it with commercial datasets and traffic census indicates that it has a high correlation with mesh population and link-based traffic volume.

[1]  Marco Fiore,et al.  Estimating urban mobility with open data: A case study in Bologna , 2016, 2016 IEEE International Smart Cities Conference (ISC2).

[2]  Jean-François Paiement,et al.  A Generative Model of Urban Activities from Cellular Data , 2018, IEEE Transactions on Intelligent Transportation Systems.

[3]  Thomas Engel,et al.  Luxembourg SUMO Traffic (LuST) Scenario: 24 hours of mobility for vehicular networking research , 2015, 2015 IEEE Vehicular Networking Conference (VNC).

[4]  L. Mihaylova,et al.  A particle filter for freeway traffic estimation , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[5]  Yoshihide Sekimoto,et al.  PFlow: Reconstructing People Flow Recycling Large-Scale Social Survey Data , 2011, IEEE Pervasive Computing.

[6]  Akshay Vij,et al.  When is big data big enough? Implications of using GPS-based surveys for travel demand analysis , 2015 .

[7]  Antonio Lima,et al.  Personalized routing for multitudes in smart cities , 2015, EPJ Data Science.

[8]  T. Toivonen,et al.  Modelling the potential effect of shared bicycles on public transport travel times in Greater Helsinki: An open data approach☆ , 2013 .

[9]  Joseph Ferreira,et al.  Activity-Based Human Mobility Patterns Inferred from Mobile Phone Data: A Case Study of Singapore , 2017, IEEE Transactions on Big Data.

[10]  Guangnian Xiao,et al.  Detecting trip purposes from smartphone-based travel surveys with artificial neural networks and particle swarm optimization , 2016 .

[11]  Dinesh Manocha,et al.  Virtualized Traffic: Reconstructing Traffic Flows from Discrete Spatio-Temporal Data , 2009, 2009 IEEE Virtual Reality Conference.

[12]  Gang Pan,et al.  Bike sharing station placement leveraging heterogeneous urban open data , 2015, UbiComp.

[13]  Marco Fiore,et al.  Population estimation from mobile network traffic metadata , 2016, 2016 IEEE 17th International Symposium on A World of Wireless, Mobile and Multimedia Networks (WoWMoM).

[14]  Kay W. Axhausen,et al.  Agent-Based Demand-Modeling Framework for Large-Scale Microsimulations , 2006 .

[15]  Moshe Ben-Akiva,et al.  PII: S0965-8564(99)00043-9 , 2000 .

[16]  Yoshihide Sekimoto,et al.  Transportation melting pot Dhaka: road-link based traffic volume estimation from sparse CDR data , 2014, Urb-IoT.

[17]  Carlo Ratti,et al.  Mobile Landscapes: Using Location Data from Cell Phones for Urban Analysis , 2006 .

[18]  Ana L. C. Bazzan,et al.  A review on agent-based technology for traffic and transportation , 2013, The Knowledge Engineering Review.

[19]  Charles M. Macal,et al.  Managing Business Complexity: Discovering Strategic Solutions with Agent-Based Modeling and Simulation , 2007 .

[20]  Carlo Ratti,et al.  Does Urban Mobility Have a Daily Routine? Learning from the Aggregate Data of Mobile Networks , 2010 .

[21]  Pamela Murray-Tuite,et al.  An agent-based modeling system for travel demand simulation for hurricane evacuation , 2014 .

[22]  Nicholas G. Polson,et al.  Deep learning for short-term traffic flow prediction , 2016, 1604.04527.

[23]  Francesco Ciari,et al.  Large-scale agent-based travel demand optimization applied to Switzerland, including mode choice , 2010 .

[24]  Toyotaro Suzumura,et al.  A high performance multi-modal traffic simulation platform and its case study with the Dublin city , 2015, 2015 Winter Simulation Conference (WSC).

[25]  R. Cervero,et al.  TRAVEL DEMAND AND THE 3DS: DENSITY, DIVERSITY, AND DESIGN , 1997 .

[26]  Dongjin Song,et al.  High resolution population estimates from telecommunications data , 2015, EPJ Data Science.

[27]  Marco Fiore,et al.  Generation and Analysis of a Large-Scale Urban Vehicular Mobility Dataset , 2014, IEEE Transactions on Mobile Computing.

[28]  Daisuke Fukuda,et al.  Updating origin–destination matrices with aggregated data of GPS traces , 2016 .

[29]  Licia Capra,et al.  Urban Computing: Concepts, Methodologies, and Applications , 2014, TIST.

[30]  Carlo Ratti,et al.  Cellular Census: Explorations in Urban Data Collection , 2007, IEEE Pervasive Computing.

[31]  Gavin McArdle,et al.  City-scale traffic simulation from digital footprints , 2012, UrbComp '12.

[32]  Ryuichi Kitamura,et al.  Micro-simulation of daily activity-travel patterns for travel demand forecasting , 2000 .

[33]  Liang Liu,et al.  Estimating Origin-Destination Flows Using Mobile Phone Location Data , 2011, IEEE Pervasive Computing.

[34]  Valeria V. Krzhizhanovskaya,et al.  Data-driven Travel Demand Modelling and Agent-based Traffic Simulation in Amsterdam Urban Area , 2016, ICCS.

[35]  Carlo Ratti,et al.  Computing Urban Mobile Landscapes Through Monitoring Population Density Based On Cell-phone Chatting , 2008 .

[36]  Marta C. González,et al.  The path most traveled: Travel demand estimation using big data resources , 2015, Transportation Research Part C: Emerging Technologies.

[37]  Luis Miguel Romero Pérez,et al.  Traffic Flow Estimation Models Using Cellular Phone Data , 2012, IEEE Transactions on Intelligent Transportation Systems.

[38]  Peter R. Stopher,et al.  Search for a global positioning system device to measure person travel , 2008 .

[39]  Xuan Song,et al.  Particle filter for real-time human mobility prediction following unprecedented disaster , 2016, SIGSPATIAL/GIS.

[40]  Yu Liu,et al.  The promises of big data and small data for travel behavior (aka human mobility) analysis , 2016, Transportation research. Part C, Emerging technologies.

[41]  Xuan Song,et al.  Modeling and probabilistic reasoning of population evacuation during large-scale disaster , 2013, KDD.

[42]  Kay W. Axhausen,et al.  An event-driven queue-based microsimulation of traffic flow , 2006 .