Reinforcement Learning Approach for Coordinated Passenger Inflow Control of Urban Rail Transit in Peak Hours

Abstract In peak hours, when the limited transportation capacity of urban rail transit is not adequate enough to meet the travel demands, the density of the passengers waiting at the platform can exceed the critical density of the platform. Coordinated passenger inflow control strategy is required to adjust/meter the inflow volume and relieve some of the demand pressure at crowded metro stations so as to ensure both operational efficiency and safety at such stations for all passengers. However, such strategy is usually developed by the operation staff at each station based on their practical working experience. As such, the best strategy/decision cannot always be made and sometimes can even be highly undesirable due to their inability to account for the dynamic performance of all metro stations in the entire rail transit network. In this paper, a new reinforcement learning-based method is developed to optimize the inflow volume during a certain period of time at each station with the aim of minimizing the safety risks imposed on passengers at the metro stations. Basic principles and fundamental components of the reinforcement learning, as well as the reinforcement learning-based problem-specific algorithm are presented. The simulation experiment carried out on a real-world metro line in Shanghai is constructed to test the performance of the approach. Simulation results show that the reinforcement learning-based inflow volume control strategy is highly effective in minimizing the safety risks by reducing the frequency of passengers being stranded. Additionally, the strategy also helps to relieve the passenger congestion at certain stations.

[1]  Ricardo Giesen,et al.  How much can holding and/or limiting boarding improve transit performance? , 2012 .

[2]  Haiying Li,et al.  Metro passenger flow control with station-to-station cooperation based on stop-skipping and boarding limiting , 2017 .

[3]  Peng Gao,et al.  A simulation model for estimating train and passenger delays in large-scale rail transit networks , 2012 .

[4]  Ning Ding,et al.  Simulation of pedestrian flow based on cellular automata: A case of pedestrian crossing street at section in China , 2013 .

[5]  Mohamed A. Khamis,et al.  Adaptive multi-objective reinforcement learning with hybrid exploration for traffic signal control based on cooperative multi-agent framework , 2014, Eng. Appl. Artif. Intell..

[6]  Fernando Fernández,et al.  MARL-Ped: A multi-agent reinforcement learning based framework to simulate pedestrian groups , 2014, Simul. Model. Pract. Theory.

[7]  Dietmar Bauer,et al.  Macroscopic pedestrian flow simulation for designing crowd control measures in public transport after special events , 2007, SCSC.

[8]  Abhijit Gosavi,et al.  Reinforcement Learning: A Tutorial Survey and Recent Advances , 2009, INFORMS J. Comput..

[9]  Matthijs T. J. Spaan,et al.  Traffic flow optimization: A reinforcement learning approach , 2016, Eng. Appl. Artif. Intell..

[10]  Haiying Li,et al.  Capacity-oriented passenger flow control under uncertain demand: Algorithm development and real-world case study , 2016 .

[11]  Jun Liu,et al.  Analysis of subway station capacity with the use of queueing theory , 2014 .

[12]  Antonio Placido,et al.  Methodology for Determining Dwell Times Consistent with Passenger Flows in the Case of Metro Services , 2017 .

[13]  D. S. Berry,et al.  PEAK-PERIOD CONTROL OF A FREEWAY SYSTEM-SOME THEORETICAL INVESTIGATIONS , 1965 .

[14]  Ali Selamat,et al.  Modeling of route planning system based on Q value-based dynamic programming with multi-agent reinforcement learning algorithms , 2014, Eng. Appl. Artif. Intell..

[15]  Marijan Žura,et al.  Reinforcement learning approach for train rescheduling on a single-track railway , 2016 .

[16]  Zhibin Jiang,et al.  A Turn-back Track Constraint Train Scheduling Algorithm on a Multi-interval Rail Transit Line , 2014 .

[17]  T. Urbanik,et al.  Reinforcement learning-based multi-agent system for network traffic signal control , 2010 .

[18]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[19]  Satish V. Ukkusuri,et al.  Accounting for dynamic speed limit control in a stochastic traffic environment: a reinforcement learning approach , 2014 .

[20]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[21]  Yorgos J. Stephanedes,et al.  COMPARATIVE EVALUATION OF ADAPTIVE AND NEURAL-NETWORK EXIT DEMAND PREDICTION FOR FREEWAY CONTROL , 1994 .

[22]  Ching-Hsien Hsu,et al.  Evaluating rail transit timetable using big passengers' data , 2016, J. Comput. Syst. Sci..

[23]  Xi Jiang,et al.  A multiagent-based model for pedestrian simulation in subway stations , 2017, Simul. Model. Pract. Theory.

[24]  Jia Wan,et al.  Research on evacuation in the subway station in China based on the Combined Social Force Model , 2014 .