论文信息 - Intercity Simulation of Human Mobility at Rare Events via Reinforcement Learning

Intercity Simulation of Human Mobility at Rare Events via Reinforcement Learning

Agent-based simulations, combined with large scale mobility data, have been an effective method for understanding urban scale human dynamics. However, collecting such large scale human mobility datasets are especially difficult during rare events (e.g., natural disasters), reducing the performance of agent-based simulations. To tackle this problem, we develop an agent-based model that can simulate urban dynamics during rare events by learning from other cities using inverse reinforcement learning. More specifically, in our framework, agents imitate real human-beings' travel behavior from areas where rare events have occurred in the past (source area) and produce synthetic people movement in different cities where such rare events have never occurred (target area). Our framework contains three main stages: 1) recovering the reward function, where the people's travel patterns and preferences are learned from the source areas; 2) transferring the model of the source area to the target areas; 3) simulating the people movement based on learned model in the target area. We apply our approach in various cities for both normal and rare situations using real-world GPS data collected from more than 1 million people in Japan, and show higher simulation performance than previous models.

[1] Xing Xie,et al. Mining interesting locations and travel sequences from GPS trajectories , 2009, WWW '09.

[2] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.

[3] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[4] Davide Anguita,et al. Human Activity Recognition on Smartphones Using a Multiclass Hardware-Friendly Support Vector Machine , 2012, IWAAL.

[5] Yoshihide Sekimoto,et al. Trip reconstruction and transportation mode extraction on low data rate GPS data from mobile phone , 2013 .

[6] Yoshihide Sekimoto,et al. Replicating urban dynamics by generating human-like agents from smartphone GPS data , 2018, SIGSPATIAL/GIS.

[7] Davy Janssens,et al. Modeling Context-Sensitive Dynamic Activity-Travel Behavior Under Conditions of Uncertainty Incorporating Reinforcement Learning, Habit Formation, and Behavioral and Cognitive Adaptation Strategies , 2008 .

[8] Li Song,et al. What is the Human Mobility in a New City: Transfer Mobility Knowledge Across Cities , 2020, WWW.

[9] Xuan Song,et al. CityCoupling: bridging intercity human mobility , 2016, UbiComp.

[10] Albert-László Barabási,et al. Understanding individual human mobility patterns , 2008, Nature.

[11] Pratap S. Prasad,et al. Movement Prediction in Wireless Networks Using Mobility Traces , 2010, 2010 7th IEEE Consumer Communications and Networking Conference.

[12] Qiang Yang,et al. Transfer Knowledge between Cities , 2016, KDD.

[13] Bernhard Schölkopf,et al. Domain Generalization via Invariant Feature Representation , 2013, ICML.

[14] Albert-László Barabási,et al. Limits of Predictability in Human Mobility , 2010, Science.

[15] Jan Peters,et al. Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[16] Yoshihide Sekimoto,et al. City2City: Translating Place Representations across Cities , 2019, SIGSPATIAL/GIS.

[17] Jean-François Paiement,et al. A Generative Model of Urban Activities from Cellular Data , 2018, IEEE Transactions on Intelligent Transportation Systems.

[18] Xuan Song,et al. DeepUrbanMomentum: An Online Deep-Learning System for Short-Term Urban Mobility Prediction , 2018, AAAI.

[19] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[20] Yoshihide Sekimoto,et al. Predicting irregular individual movement following frequent mid-level disasters using location data from smartphones , 2016, SIGSPATIAL/GIS.

[21] Xin Wu,et al. Hierarchical travel demand estimation using multiple data sources: A forward and backward propagation algorithmic framework on a layered computational graph , 2018, Transportation Research Part C: Emerging Technologies.

[22] Marta C. González,et al. The path most traveled: Travel demand estimation using big data resources , 2015, Transportation Research Part C: Emerging Technologies.

[23] Xuan Song,et al. Prediction of human emergency behavior and their mobility following large-scale disaster , 2014, KDD.

[24] Yoshihide Sekimoto,et al. CityFlowFragility: Measuring the Fragility of People Flow in Cities to Disasters using GPS Data Collected from Smartphones , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[25] Xuan Song,et al. DeepTransport: Prediction and Simulation of Human Mobility and Transportation Mode at a Citywide Level , 2016, IJCAI.

[26] Chao Zhang,et al. DeepMove: Predicting Human Mobility with Attentional Recurrent Networks , 2018, WWW.

[27] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.

[28] Yoshihide Sekimoto,et al. Open PFLOW: Creation and evaluation of an open dataset for typical people mass movement in urban areas , 2017 .

[29] Feng Liu,et al. Cross-City Transfer Learning for Deep Spatio-Temporal Prediction , 2018, IJCAI.

[30] Yoshihide Sekimoto,et al. Development of people mass movement simulation framework based on reinforcement learning , 2020 .

[31] Tie-Yan Liu,et al. Sequential Click Prediction for Sponsored Search with Recurrent Neural Networks , 2014, AAAI.

[32] Xing Xie,et al. Learning transportation mode from raw gps data for geographic applications on the web , 2008, WWW.

[33] Yiliang Xiong. Modelling individual and household activity : travel scheduling behaviours in stochastic transportation networks , 2014 .