PressLight: Learning Max Pressure Control to Coordinate Traffic Signals in Arterial Network

Traffic signal control is essential for transportation efficiency in road networks. It has been a challenging problem because of the complexity in traffic dynamics. Conventional transportation research suffers from the incompetency to adapt to dynamic traffic situations. Recent studies propose to use reinforcement learning (RL) to search for more efficient traffic signal plans. However, most existing RL-based studies design the key elements - reward and state - in a heuristic way. This results in highly sensitive performances and a long learning process. To avoid the heuristic design of RL elements, we propose to connect RL with recent studies in transportation research. Our method is inspired by the state-of-the-art method max pressure (MP) in the transportation field. The reward design of our method is well supported by the theory in MP, which can be proved to be maximizing the throughput of the traffic network, i.e., minimizing the overall network travel time. We also show that our concise state representation can fully support the optimization of the proposed reward function. Through comprehensive experiments, we demonstrate that our method outperforms both conventional transportation approaches and existing learning-based methods.

[1]  Zhenhui Li,et al.  IntelliLight: A Reinforcement Learning Approach for Intelligent Traffic Light Control , 2018, KDD.

[2]  Baher Abdulhai,et al.  An agent-based learning towards decentralized and coordinated traffic signal control , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[3]  Marco Wiering,et al.  Multi-Agent Reinforcement Learning for Traffic Light control , 2000 .

[4]  Zihan Zhou,et al.  CityFlow: A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario , 2019, WWW.

[5]  Kevin Lee,et al.  Signal Timing Manual , 2015 .

[6]  Yong Li,et al.  Learning Phase Competition for Traffic Signal Control , 2019, CIKM.

[7]  S. Spraggs,et al.  Traffic Engineering , 2000 .

[8]  Thomas Urbanik,et al.  Signal Timing Manual - Second Edition , 2015 .

[9]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[10]  Sophie Midenet,et al.  The real-time urban traffic control system CRONOS: Algorithm and experiments , 2006 .

[11]  Matthew E. Taylor,et al.  Distributed learning and multi-objectivity in traffic light control , 2014, Connect. Sci..

[12]  Jennie Lioris,et al.  Adaptive Max Pressure Control of Network of Signalized Intersections , 2016 .

[13]  Frans A. Oliehoek,et al.  Coordinated Deep Reinforcement Learners for Traffic Light Control , 2016 .

[14]  Li Li,et al.  Traffic signal timing via deep reinforcement learning , 2016, IEEE/CAA Journal of Automatica Sinica.

[15]  F. Webster TRAFFIC SIGNAL SETTINGS , 1958 .

[16]  Roger P. Roess Traffic engineering / Roger P. Roess, Elena S. Prassas, William R. McShane , 2004 .

[17]  John D. C. Little,et al.  MAXBAND : a versatile program for setting signals on arteries and triangular networks , 1981 .

[18]  Nathan H. Gartner,et al.  OPAC: A DEMAND-RESPONSIVE STRATEGY FOR TRAFFIC SIGNAL CONTROL , 1983 .

[19]  Noe Casas,et al.  Deep Deterministic Policy Gradient for Urban Traffic Light Control , 2017, ArXiv.

[20]  Peter Corcoran,et al.  Traffic Light Control Using Deep Policy-Gradient and Value-Function Based Reinforcement Learning , 2017, ArXiv.

[21]  Pravin Varaiya,et al.  The Max-Pressure Controller for Arbitrary Networks of Signalized Intersections , 2013 .

[22]  Suvrajeet Sen,et al.  Controlled Optimization of Phases at an Intersection , 1997, Transp. Sci..

[23]  Baher Abdulhai,et al.  Reinforcement learning for true adaptive traffic signal control , 2003 .

[24]  D I Robertson,et al.  TRANSYT: A TRAFFIC NETWORK STUDY TOOL , 1969 .

[25]  Pravin Varaiya,et al.  Max pressure control of a network of signalized intersections , 2013 .

[26]  Jean-Loup Farges,et al.  THE PRODYN REAL TIME TRAFFIC ALGORITHM , 1983 .

[27]  Vikash V. Gayah,et al.  A Survey on Traffic Signal Control Methods , 2019, ArXiv.

[28]  Baher Abdulhai,et al.  Multiagent Reinforcement Learning for Integrated Network of Adaptive Traffic Signal Controllers (MARLIN-ATSC): Methodology and Large-Scale Application on Downtown Toronto , 2013, IEEE Transactions on Intelligent Transportation Systems.

[29]  Nan Xu,et al.  Diagnosing Reinforcement Learning for Traffic Signal Control , 2019, ArXiv.

[30]  Monireh Abdoos,et al.  Holonic multi-agent system for traffic signals control , 2013, Eng. Appl. Artif. Intell..

[31]  T. Urbanik,et al.  Reinforcement learning-based multi-agent system for network traffic signal control , 2010 .

[32]  Jean-Loup Farges,et al.  THE PRODYN REAL TIME TRAFFIC ALGORITHM , 1983 .

[33]  Shimon Whiteson,et al.  Multiagent Reinforcement Learning for Urban Traffic Control Using Coordination Graphs , 2008, ECML/PKDD.

[34]  Jim Duggan,et al.  An Experimental Review of Reinforcement Learning Algorithms for Adaptive Traffic Signal Control , 2016, Autonomic Road Transport Support Systems.