Ambulance Dispatch via Deep Reinforcement Learning

In this paper, we solve the ambulance dispatch problem with a reinforcement learning oriented strategy. The ambulance dispatch problem is defined as deciding which ambulance to pick up which patient. Traditional studies on ambulance dispatch mainly focus on predefined protocols and are verified on simple simulation data, which are not flexible enough when facing the dynamically changing real-world cases. In this paper, we propose an efficient ambulance dispatch method based on the reinforcement learning framework, i.e., Multi-Agent Q-Network with Experience Replay(MAQR). Specifically, we firstly reformulate the ambulance dispatch problem with a multi-agent reinforcement learning framework, and then design the state, action, and reward function correspondingly for the framework. Thirdly, we design a simulator that controls ambulance status, generates patient requests and interacts with ambulances. Finally, we design extensive experiments to demonstrate the superiority of the proposed method.

[1]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[2]  Yanjie Fu,et al.  Modeling the Interaction Coupling of Multi-View Spatiotemporal Contexts for Destination Prediction , 2018, SDM.

[3]  Ming Zhou,et al.  Mean Field Multi-Agent Reinforcement Learning , 2018, ICML.

[4]  R. Miglio,et al.  Emergency ambulance dispatches and apparent temperature: a time series analysis in Emilia-Romagna, Italy. , 2011, Environmental research.

[5]  Zhe Xu,et al.  Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning , 2018, KDD.

[6]  Zhenhui Li,et al.  IntelliLight: A Reinforcement Learning Approach for Intelligent Traffic Light Control , 2018, KDD.

[7]  Antonio Gasparrini,et al.  EMERGENCY AMBULANCE DISPATCHES AND APPARENT TEMPERATURE: A TIME-SERIES ANALYSIS WITH DISTRIBUTED LAG NONLINEAR MODELS , 2011 .

[8]  B. L. William Wong,et al.  Ambulance Dispatch Complexity and Dispatcher Decision Strategies: Implications for Interface Design , 2004, APCHI.

[9]  M. Stanković Multi-agent reinforcement learning , 2016 .

[10]  Michel Gendreau,et al.  A dynamic model and parallel tabu search heuristic for real-time ambulance relocation , 2001, Parallel Comput..

[11]  G Laporte,et al.  An emergency vehicle dispatching system for an electric utility in Chile , 1999, J. Oper. Res. Soc..

[12]  Richard L. Church,et al.  The maximal covering location problem , 1974 .

[13]  S. Bhulai,et al.  A dynamic ambulance management model for rural areas , 2017, Health care management science.

[14]  J. Goldberg Operations Research Models for the Deployment of Emergency Services Vehicles , 2004 .

[15]  Peng Peng,et al.  Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games , 2017, 1703.10069.

[16]  John J. Bernardo,et al.  Developing and validating a decision support system for locating emergency medical vehicles in Louisville, Kentucky , 1994 .

[17]  Zati Aqmar Zaharudin,et al.  Finding shortest path of the ambulance routing: Interface of A∗ algorithm using C# programming , 2012, 2012 IEEE Symposium on Humanities, Science and Engineering Research.

[18]  Gilbert Laporte,et al.  Ambulance location and relocation models , 2000, Eur. J. Oper. Res..

[19]  Weinan Zhang,et al.  MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence , 2017, AAAI.

[20]  Q. Henry Wu,et al.  Multi-objective optimization by reinforcement learning for power system dispatch and voltage stability , 2010, 2010 IEEE PES Innovative Smart Grid Technologies Conference Europe (ISGT Europe).

[21]  M. Morabito,et al.  Urban morbidity in summer: ambulance dispatch data, periodicity and weather , 2012 .

[22]  Hao Liu,et al.  AutoFS: Automated Feature Selection via Diversity-aware Interactive Reinforcement Learning , 2020, ArXiv.

[23]  David B. Shmoys,et al.  Mathematical Programming Guides Air-Ambulance Routing at Ornge , 2013, Interfaces.

[24]  Dorian Kodelja,et al.  Multiagent cooperation and competition with deep reinforcement learning , 2015, PloS one.

[25]  Yanjie Fu,et al.  Automating Feature Subspace Exploration via Multi-Agent Reinforcement Learning , 2019, KDD.