论文信息 - Extended Variable Speed Limit control using Multi-agent Reinforcement Learning

Extended Variable Speed Limit control using Multi-agent Reinforcement Learning

Variable Speed Limit (VSL) is a traffic control approach that optimises the mainstream traffic on motorways. Reinforcement Learning approach to VSL has been shown to achieve improvements in controlling the mainstream traffic bottleneck on motorways. However, single-agent VSL, applied to a shorter motorway segment, can produce a discontinuity in traffic flow by causing the significant differences in speeds between the uncontrolled upstream flow and the flow affected by VSL. A multi-agent control strategy can be used to overcome these problems by assigning speed limits in multiple upstream motorway sections enabling smoother speed transition. In this paper, we proposed a novel approach to set up multi-agent RLbased VSL by using the W-Learning algorithm (WL-VSL), in which two agents control two segments in the lead up to the congested area. The reward function for each agent is based on the agent’s local performance as well as the downstream bottleneck. WL-VSL is evaluated in a microscopic simulation on two traffic scenarios using dynamic and static traffic demand. We show that WL-VSL outperforms base cases (no control, single agent, and two independent agents) with the improvement of traffic parameters up to 18 %.

[1] Bin Ran,et al. A New Solution for Freeway Congestion: Cooperative Speed Limit Control Using Distributed Reinforcement Learning , 2019, IEEE Access.

[2] Stef Smulders,et al. Control of freeway traffic flow by variable speed signs , 1990 .

[3] R. Noland. Relationships between highway capacity and induced vehicle travel , 2001 .

[4] Satish V. Ukkusuri,et al. Accounting for dynamic speed limit control in a stochastic traffic environment: a reinforcement learning approach , 2014 .

[5] Vinny Cahill,et al. Distributed W-Learning: Multi-Policy Optimization in Self-Organizing Systems , 2009, 2009 Third IEEE International Conference on Self-Adaptive and Self-Organizing Systems.

[6] Maxim Raya,et al. TraCI: an interface for coupling road traffic and network simulators , 2008, CNS '08.

[7] E. van den Hoogen,et al. Control by variable speed signs: results of the Dutch experiment , 1994 .

[8] T Schmidt-Dumont,et al. A case for the adoption of decentralised reinforcement learning for the control of traffic flow on South African highways , 2019 .

[9] M. Papageorgiou,et al. Effects of Variable Speed Limits on Motorway Traffic Flow , 2008 .

[10] Markos Papageorgiou,et al. Feedback-Based Mainstream Traffic Flow Control for Multiple Bottlenecks on Motorways , 2015, IEEE Transactions on Intelligent Transportation Systems.

[11] Wei Wang,et al. Reinforcement Learning-Based Variable Speed Limit Control Strategy to Reduce Traffic Congestion at Freeway Recurrent Bottlenecks , 2017, IEEE Transactions on Intelligent Transportation Systems.

[12] Bart De Schutter,et al. Optimal coordination of variable speed limits to suppress shock waves , 2005, IEEE Transactions on Intelligent Transportation Systems.

[13] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[14] Markos Papageorgiou,et al. Local Feedback-Based Mainstream Traffic Flow Control on Motorways Using Variable Speed Limits , 2011, IEEE Transactions on Intelligent Transportation Systems.

[15] Markos Papageorgiou,et al. Microsimulation Analysis of Practical Aspects of Traffic Control With Variable Speed Limits , 2015, IEEE Transactions on Intelligent Transportation Systems.

[16] Pravin Varaiya,et al. A new approach for combined freeway Variable Speed Limits and Coordinated Ramp Metering , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[17] Mark Humphreys,et al. Action selection methods using reinforcement learning , 1997 .

[18] Robert Ziolkowski,et al. Effectiveness of Automatic Section Speed Control System Operating on National Roads in Poland , 2019, PROMET - Traffic&Transportation.

[19] Matthijs T. J. Spaan,et al. Traffic flow optimization: A reinforcement learning approach , 2016, Eng. Appl. Artif. Intell..

[20] Martin Gregurić,et al. Simulational analysis of two controllers for variable speed limit control , 2019 .

[21] Edouard Ivanjko,et al. A Comparison of Different State Representations for Reinforcement Learning Based Variable Speed Limit Control , 2018, 2018 26th Mediterranean Conference on Control and Automation (MED).

[22] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.

[23] Chris Nash,et al. Policy instruments for reducing greenhouse gas emissions from transport in Europe , 2010 .

[24] Andreas Hegyi,et al. Distributed Controller Design Approach to Dynamic Speed Limit Control against Shockwaves on Freeways , 2008 .

[25] Alexandre M. Bayen,et al. Lagrangian Control through Deep-RL: Applications to Bottleneck Decongestion , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[26] Stijn Daniels,et al. Safety effects of dynamic speed limits on motorways. , 2018, Accident; analysis and prevention.

[27] Per Strömgren,et al. Harmonization with Variable Speed Limits on Motorways , 2016 .

[28] Bruce Hellinga,et al. Variable Speed Limits: Safety and Operational Impacts of a Candidate Control Strategy for Freeway Applications , 2007, IEEE Transactions on Intelligent Transportation Systems.

[29] Bart De Schutter,et al. A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).