论文信息 - Reinforcement learning-based multi-agent system for network traffic signal control

Reinforcement learning-based multi-agent system for network traffic signal control

A challenging application of artificial intelligence systems involves the scheduling of traffic signals in multi-intersection vehicular networks. This paper introduces a novel use of a multi-agent system and reinforcement learning (RL) framework to obtain an efficient traffic signal control policy. The latter is aimed at minimising the average delay, congestion and likelihood of intersection cross-blocking. A five-intersection traffic network has been studied in which each intersection is governed by an autonomous intelligent agent. Two types of agents, a central agent and an outbound agent, were employed. The outbound agents schedule traffic signals by following the longest-queue-first (LQF) algorithm, which has been proved to guarantee stability and fairness, and collaborate with the central agent by providing it local traffic statistics. The central agent learns a value function driven by its local and neighbours' traffic conditions. The novel methodology proposed here utilises the Q-Learning algorithm with a feedforward neural network for value function approximation. Experimental results clearly demonstrate the advantages of multi-agent RL-based control over LQF governed isolated single-intersection control, thus paving the way for efficient distributed traffic signal control in complex settings.

[1] Brian Wolshon,et al. Analysis of intersection delay under real-time adaptive signal control , 1999 .

[2] Vinny Cahill,et al. A Collaborative Reinforcement Learning Approach to Urban Traffic Control Optimization , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[3] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..

[4] Baher Abdulhai,et al. Reinforcement learning for true adaptive traffic signal control , 2003 .

[5] Dipti Srinivasan,et al. Cooperative multi-agent system for coordinated traffic signal control , 2006 .

[6] Simon Haykin,et al. Neural Networks: A Comprehensive Foundation , 1998 .

[7] Wilfred W. Recker,et al. Stochastic adaptive control model for traffic signal systems , 2006 .

[8] J. Albus. A Theory of Cerebellar Function , 1971 .

[9] Lucas Barcelos de Oliveira,et al. Multi-agent Model Predictive Control of Signaling Split in Urban Traffic Networks ∗ , 2010 .

[10] Baher Abdulhai,et al. Automated Adaptive Traffic Corridor Control Using Reinforcement Learning , 2006 .

[11] Mike McDonald,et al. ITS and Traffic Management , 2007 .

[12] Markos Papageorgiou,et al. A multivariable regulator approach to traffic-responsive network-wide signal control , 2000 .

[13] Itamar Elhanany,et al. A Novel Signal-Scheduling Algorithm With Quality-of-Service Provisioning for an Isolated Intersection , 2008, IEEE Transactions on Intelligent Transportation Systems.

[14] Kevin Fehon. Adaptive Traffic Signals Are we missing the boat , 2004 .

[15] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[16] Wang,et al. Review of road traffic control strategies , 2003, Proceedings of the IEEE.

[17] B. Yegnanarayana,et al. Artificial Neural Networks , 2004 .

[18] Marco Wiering,et al. Multi-Agent Reinforcement Learning for Traffic Light control , 2000 .

[19] Chen Cai,et al. Adaptive traffic signal control using approximate dynamic programming , 2009 .

[20] A. Roli. Artificial Neural Networks , 2012, Lecture Notes in Computer Science.

[21] Chris Watkins,et al. Learning from delayed rewards , 1989 .

[22] David S. Broomhead,et al. Multivariable Functional Interpolation and Adaptive Networks , 1988, Complex Syst..

[23] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[24] Carlos Gershenson,et al. Self-organizing Traffic Lights , 2004, Complex Syst..

[25] S. Hyakin,et al. Neural Networks: A Comprehensive Foundation , 1994 .

[26] Stephen P. Mattingly,et al. PERFORMANCE STUDY OF SCOOT TRAFFIC CONTROL SYSTEM WITH NON-IDEAL DETECTORIZATION : FIELD OPERATIONAL TEST IN THE CITY OF ANAHEIM , 2001 .

[27] Markos Papageorgiou,et al. Chapter 11 ITS and Traffic Management , 2007, Transportation.