论文信息 - Traffic Signal Control Using Deep Reinforcement Learning with Multiple Resources of Rewards

Traffic Signal Control Using Deep Reinforcement Learning with Multiple Resources of Rewards

Intelligent traffic signal control is an effective way to solve the traffic congestion problem in the real world. One trend is to use Deep Reinforcement Learning (DRL) to control traffic signals based on the snapshots of traffic states. While most of the research used single numeric reward to frame multiple objectives, such as minimizing waiting time and waiting queue length, they overlooked that one reward for multiple objectives misleads agents taking wrong actions in certain states, which causes following traffic fluctuation. In this paper, we propose a DRL-based framework that uses multiple rewards for multiple objectives. Our framework aims to solve the difficulty of assessing behaviours by single numeric reward and control traffic flows more steadily. We evaluated our approach on both synthetic traffic scenarios and a real-world traffic dataset in Toronto. The results show that our approach outperformed single reward-based approaches.

Azzedine Boukerche | Dunhao Zhong | A. Boukerche | Dunhao Zhong

[1] T. Urbanik,et al. Reinforcement learning-based multi-agent system for network traffic signal control , 2010 .

[2] Zhenhui Li,et al. IntelliLight: A Reinforcement Learning Approach for Intelligent Traffic Light Control , 2018, KDD.

[3] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4] Baher Abdulhai,et al. An agent-based learning towards decentralized and coordinated traffic signal control , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[5] Walid Gomaa,et al. Multi-objective traffic light control system based on Bayesian probability interpretation , 2012, 2012 15th International IEEE Conference on Intelligent Transportation Systems.

[6] Zhang Yi,et al. Multiobjective Reinforcement Learning for Traffic Signal Control Using Vehicular Ad Hoc Network , 2010, EURASIP J. Adv. Signal Process..

[7] Yun-Pang Flötteröd,et al. Microscopic Traffic Simulation using SUMO , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[8] Mee Hong Ling,et al. A Survey on Reinforcement Learning Models and Algorithms for Traffic Signal Control , 2017, ACM Comput. Surv..

[9] G. Dimitrakopoulos,et al. Intelligent Transportation Systems , 2010, IEEE Vehicular Technology Magazine.

[10] Trevor Reed,et al. INRIX Global Traffic Scorecard , 2019 .

[11] Marco Wiering,et al. Multi-Agent Reinforcement Learning for Traffic Light control , 2000 .

[12] Mohamed A. Khamis,et al. Adaptive multi-objective reinforcement learning with hybrid exploration for traffic signal control based on cooperative multi-agent framework , 2014, Eng. Appl. Artif. Intell..

[13] C Ajluni. INTELLIGENT TRANSPORTATION SYSTEMS HIT THE ROAD , 1997 .

[14] Kunihiko Fukushima,et al. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[15] Frans A. Oliehoek,et al. Coordinated Deep Reinforcement Learners for Traffic Light Control , 2016 .

[16] Matthew E. Taylor,et al. Distributed learning and multi-objectivity in traffic light control , 2014, Connect. Sci..

[17] Jim Duggan,et al. An Experimental Review of Reinforcement Learning Algorithms for Adaptive Traffic Signal Control , 2016, Autonomic Road Transport Support Systems.

[18] Saiedeh N. Razavi,et al. Using a Deep Reinforcement Learning Agent for Traffic Signal Control , 2016, ArXiv.