Reinforcement learning-based multi-agent system for network traffic signal control

A challenging application of artificial intelligence systems involves the scheduling of traffic signals in multi-intersection vehicular networks. This paper introduces a novel use of a multi-agent system and reinforcement learning (RL) framework to obtain an efficient traffic signal control policy. The latter is aimed at minimising the average delay, congestion and likelihood of intersection cross-blocking. A five-intersection traffic network has been studied in which each intersection is governed by an autonomous intelligent agent. Two types of agents, a central agent and an outbound agent, were employed. The outbound agents schedule traffic signals by following the longest-queue-first (LQF) algorithm, which has been proved to guarantee stability and fairness, and collaborate with the central agent by providing it local traffic statistics. The central agent learns a value function driven by its local and neighbours' traffic conditions. The novel methodology proposed here utilises the Q-Learning algorithm with a feedforward neural network for value function approximation. Experimental results clearly demonstrate the advantages of multi-agent RL-based control over LQF governed isolated single-intersection control, thus paving the way for efficient distributed traffic signal control in complex settings.

[1]  Brian Wolshon,et al.  Analysis of intersection delay under real-time adaptive signal control , 1999 .

[2]  Vinny Cahill,et al.  A Collaborative Reinforcement Learning Approach to Urban Traffic Control Optimization , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[3]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[4]  Baher Abdulhai,et al.  Reinforcement learning for true adaptive traffic signal control , 2003 .

[5]  Dipti Srinivasan,et al.  Cooperative multi-agent system for coordinated traffic signal control , 2006 .

[6]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[7]  Wilfred W. Recker,et al.  Stochastic adaptive control model for traffic signal systems , 2006 .

[8]  J. Albus A Theory of Cerebellar Function , 1971 .

[9]  Lucas Barcelos de Oliveira,et al.  Multi-agent Model Predictive Control of Signaling Split in Urban Traffic Networks ∗ , 2010 .

[10]  Baher Abdulhai,et al.  Automated Adaptive Traffic Corridor Control Using Reinforcement Learning , 2006 .

[11]  Mike McDonald,et al.  ITS and Traffic Management , 2007 .

[12]  Markos Papageorgiou,et al.  A multivariable regulator approach to traffic-responsive network-wide signal control , 2000 .

[13]  Itamar Elhanany,et al.  A Novel Signal-Scheduling Algorithm With Quality-of-Service Provisioning for an Isolated Intersection , 2008, IEEE Transactions on Intelligent Transportation Systems.

[14]  Kevin Fehon Adaptive Traffic Signals Are we missing the boat , 2004 .

[15]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[16]  Wang,et al.  Review of road traffic control strategies , 2003, Proceedings of the IEEE.

[17]  B. Yegnanarayana,et al.  Artificial Neural Networks , 2004 .

[18]  Marco Wiering,et al.  Multi-Agent Reinforcement Learning for Traffic Light control , 2000 .

[19]  Chen Cai,et al.  Adaptive traffic signal control using approximate dynamic programming , 2009 .

[20]  A. Roli Artificial Neural Networks , 2012, Lecture Notes in Computer Science.

[21]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[22]  David S. Broomhead,et al.  Multivariable Functional Interpolation and Adaptive Networks , 1988, Complex Syst..

[23]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[24]  Carlos Gershenson,et al.  Self-organizing Traffic Lights , 2004, Complex Syst..

[25]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[26]  Stephen P. Mattingly,et al.  PERFORMANCE STUDY OF SCOOT TRAFFIC CONTROL SYSTEM WITH NON-IDEAL DETECTORIZATION : FIELD OPERATIONAL TEST IN THE CITY OF ANAHEIM , 2001 .

[27]  Markos Papageorgiou,et al.  Chapter 11 ITS and Traffic Management , 2007, Transportation.