Integrated optimal control strategies for freeway traffic mixed with connected automated vehicles: A model-based reinforcement learning approach

Abstract This paper proposes an integrated freeway traffic flow control framework that aims to minimize the total travel cost, improve greenness and safety for freeway traffic mixed with connected automated vehicles (CAVs) and regular human-piloted vehicles (RHVs). The proposed framework devises an integrated action of several control strategies such as ramp metering, lane changing control (LCC) for CAVs and lane changing recommendation (LCR) for RHVs, variable speed limit control (VSLC) for CAVs and variable speed limit recommendation (VSLR) for RHVs with minimum safety gap control measures for lane changing and merging maneuvers. The CAVs are assumed to follow the system control instructions fully and immediately. In contrast, the RHVs would make decisions in response to the recommendations disseminated and also the behaviors of CAVs. The compliance rate of drivers to the LCR is captured by the underlying traffic flow model. A set of constraints is imposed to restrict VSLC/VSLR and LCC/LCR measures from changing too frequently or too sharply on both temporal and spatial dimensions to avoid excessive nuisance to passengers and traffic flow instability. A reinforcement learning based solution algorithm is proposed. First, a control parameterization technique is adopted to reduce the dimension of the original optimal control problem to increase computational efficiency. Then, a gradient-free Cross-Entropy-Method based algorithm is used to search the optimal parameters to circumvent the non-differentiability of the traffic flow model. The feasibility and effectiveness of the proposed framework are illustrated via numerical examples for a variety of penetration rates of CAVs under various traffic conditions. A sensitivity analysis is conducted to demonstrate the impacts of several important parameters such as the reaction time of the CAVs. It is found that the integrated control strategy can reduce the total travel cost by reducing the lane changing maneuvers and vehicles queuing at the bottleneck meanwhile smooth the traffic flow and suppress the adverse impact of shockwaves. The effect of ramp metering is not significant when the penetration rate of CAVs is high enough. Speed harmonization (with minimum gap control) in conjunction with LCC/LCR would be a better integrated control strategy under high penetration rate of CAVs.

[1]  Soyoung Ahn,et al.  Variable Speed Limit Control at Fixed Freeway Bottlenecks Using Connected Vehicles , 2017 .

[2]  Markos Papageorgiou,et al.  Overview and analysis of Vehicle Automation and Communication Systems from a motorway traffic management perspective , 2015 .

[3]  Marcel Sala,et al.  Effects of low speed limits on freeway traffic flow , 2017 .

[4]  Meng Wang,et al.  Connected variable speed limits control and car-following control with vehicle-infrastructure communication to resolve stop-and-go waves , 2016, J. Intell. Transp. Syst..

[5]  Markos Papageorgiou,et al.  Freeway ramp metering: an overview , 2002, IEEE Trans. Intell. Transp. Syst..

[6]  Zhong-Ping Jiang,et al.  Data-Driven Adaptive Optimal Control of Connected Vehicles , 2017, IEEE Transactions on Intelligent Transportation Systems.

[7]  Soyoung Ahn,et al.  A behavioural car-following model that captures traffic oscillations , 2012 .

[8]  Lina Kattan,et al.  Variable speed limit: A microscopic analysis in a connected vehicle environment , 2015 .

[9]  Peter Hidas,et al.  Modelling vehicle interactions in microscopic simulation of merging and weaving , 2005 .

[10]  Zhongzhen Yang,et al.  Linear complementarity system approach to macroscopic freeway traffic modelling: uniqueness and convexity , 2016 .

[11]  Soyoung Ahn,et al.  The effects of lane-changing on the immediate follower : anticipation, relaxation, and change in driver characteristics , 2013 .

[12]  Zhong-Ping Jiang,et al.  Predictive cruise control of connected and autonomous vehicles via reinforcement learning , 2019, IET Control Theory & Applications.

[13]  Meng Wang,et al.  Cooperative Car-Following Control: Distributed Algorithm and Impact on Moving Jam Features , 2016, IEEE Transactions on Intelligent Transportation Systems.

[14]  Stephen D. Boyles,et al.  A cell transmission model for dynamic lane reversal with autonomous vehicles , 2016 .

[15]  Stephen D. Boyles,et al.  A multiclass cell transmission model for shared human and autonomous vehicle roads , 2016 .

[16]  Markos Papageorgiou,et al.  Hierarchical model predictive control for multi-lane motorways in presence of Vehicle Automation and Communication Systems , 2016 .

[17]  Bart De Schutter,et al.  Model predictive control for optimal coordination of ramp metering and variable speed limits , 2005 .

[18]  Agachai Sumalee,et al.  Modeling the impacts of mandatory and discretionary lane-changing maneuvers , 2016 .

[19]  Agachai Sumalee,et al.  A cross-entropy method and probabilistic sensitivity analysis framework for calibrating microscopic traffic models , 2016 .

[20]  M Jepsen ON THE SPEED-FLOW RELATIONSHIPS IN ROAD TRAFFIC: A MODEL OF DRIVER BEHAVIOUR , 1998 .

[21]  Peng Hao,et al.  Connected Vehicle-Based Lane Selection Assistance Application , 2019, IEEE Transactions on Intelligent Transportation Systems.

[22]  Bruno Scherrer,et al.  Improvements on Learning Tetris with Cross Entropy , 2009, J. Int. Comput. Games Assoc..

[23]  Markos Papageorgiou,et al.  Highway traffic state estimation with mixed connected and conventional vehicles: Microscopic simulation-based testing , 2016 .

[24]  Dong Ngoduy,et al.  Enhanced cooperative car-following traffic model with the combination of V2V and V2I communication , 2016 .

[25]  Markos Papageorgiou,et al.  Local Feedback-Based Mainstream Traffic Flow Control on Motorways Using Variable Speed Limits , 2011, IEEE Transactions on Intelligent Transportation Systems.

[26]  Feng Zhu,et al.  Modeling the Proactive Driving Behavior of Connected Vehicles: A Cell‐Based Simulation Approach , 2018, Comput. Aided Civ. Infrastructure Eng..

[27]  Martin Treiber,et al.  Special issue on connected and automated traffic systems , 2020 .

[28]  Ning Zhu,et al.  A Jam-Absorption Driving Strategy for Mitigating Traffic Oscillations , 2017, IEEE Transactions on Intelligent Transportation Systems.

[29]  Soyoung Ahn,et al.  Towards vehicle automation: Roadway capacity formulation for traffic mixed with regular and automated vehicles , 2017 .

[30]  M. Papageorgiou,et al.  Effects of Variable Speed Limits on Motorway Traffic Flow , 2008 .

[31]  Mohsen Ramezani,et al.  Mixed flow of autonomous and human-driven vehicles: Analytical headway modeling and optimal lane management , 2019 .

[32]  Tie-Qiao Tang,et al.  Impacts of energy consumption and emissions on the trip cost without late arrival at the equilibrium state , 2017 .

[33]  Warren B. Powell,et al.  “Approximate dynamic programming: Solving the curses of dimensionality” by Warren B. Powell , 2007, Wiley Series in Probability and Statistics.

[34]  Markos Papageorgiou,et al.  Highway Traffic State Estimation With Mixed Connected and Conventional Vehicles , 2015, IEEE Transactions on Intelligent Transportation Systems.

[35]  Soyoung Ahn,et al.  Freeway Traffic Oscillations and Vehicle Lane-Change Maneuvers , 2007 .

[36]  Bart De Schutter,et al.  Optimal coordination of variable speed limits to suppress shock waves , 2005, IEEE Transactions on Intelligent Transportation Systems.

[37]  Dong Ngoduy,et al.  Platoon based cooperative driving model with consideration of realistic inter-vehicle communication , 2016 .

[38]  Bruno Scherrer,et al.  Approximate Dynamic Programming Finally Performs Well in the Game of Tetris , 2013, NIPS.

[39]  Wei Wang,et al.  Development of a Control Strategy of Variable Speed Limits to Reduce Rear-End Collision Risks Near Freeway Recurrent Bottlenecks , 2014, IEEE Transactions on Intelligent Transportation Systems.

[40]  Pravin Varaiya,et al.  Smart cars on smart roads: problems of control , 1991, IEEE Trans. Autom. Control..

[41]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[42]  Hai Le Vu,et al.  A multiclass microscopic model for heterogeneous platoon with vehicle-to-vehicle communication , 2019 .

[43]  Satish V. Ukkusuri,et al.  A linear programming formulation for autonomous intersection control within a dynamic traffic assignment and connected vehicle environment , 2015 .

[44]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[45]  Maria Laura Delle Monache,et al.  Dissipation of stop-and-go waves via control of autonomous vehicles: Field experiments , 2017, ArXiv.

[46]  Dirk P. Kroese,et al.  The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation and Machine Learning , 2004 .

[47]  Alireza Talebpour,et al.  Influence of connected and autonomous vehicles on traffic flow stability and throughput , 2016 .

[48]  Markos Papageorgiou,et al.  Traffic flow optimisation in presence of vehicle automation and communication systems – Part I: A first-order multi-lane model for motorway traffic , 2015 .

[49]  Agachai Sumalee,et al.  Optimal and robust strategies for freeway traffic management under demand and supply uncertainties: an overview and general theory , 2014 .

[50]  Petros A. Ioannou,et al.  Combined Variable Speed Limit and Lane Change Control for Highway Traffic , 2017, IEEE Transactions on Intelligent Transportation Systems.

[51]  Yougang Bian,et al.  A Survey on Cooperative Longitudinal Motion Control of Multiple Connected and Automated Vehicles , 2020, IEEE Intelligent Transportation Systems Magazine.

[52]  Bart De Schutter,et al.  Model Predictive Control for Freeway Networks Based on Multi-Class Traffic Flow and Emission Models , 2017, IEEE Transactions on Intelligent Transportation Systems.

[53]  Xiaobo Qu,et al.  On the Impact of Cooperative Autonomous Vehicles in Improving Freeway Merging: A Modified Intelligent Driver Model-Based Approach , 2017, IEEE Transactions on Intelligent Transportation Systems.

[54]  Markos Papageorgiou,et al.  Optimal Motorway Traffic Flow Control Involving Variable Speed Limits and Ramp Metering , 2010, Transp. Sci..

[55]  Markos Papageorgiou,et al.  Traffic flow optimisation in presence of vehicle automation and communication systems – Part II: Optimal control for multi-lane motorways , 2015 .

[56]  Bart De Schutter,et al.  Reinforcement Learning and Dynamic Programming Using Function Approximators , 2010 .

[57]  Wei Wang,et al.  Optimal Mainline Variable Speed Limit Control to Improve Safety on Large‐Scale Freeway Segments , 2016, Comput. Aided Civ. Infrastructure Eng..

[58]  Agachai Sumalee,et al.  Multiclass multilane model for freeway traffic mixed with connected automated vehicles and regular human-piloted vehicles , 2019, Transportmetrica A: Transport Science.

[59]  Andreas A. Malikopoulos,et al.  Optimal Control for Speed Harmonization of Automated Vehicles , 2016, IEEE Transactions on Intelligent Transportation Systems.

[60]  András Lörincz,et al.  Learning Tetris Using the Noisy Cross-Entropy Method , 2006, Neural Computation.

[61]  Markos Papageorgiou,et al.  Microsimulation Analysis of Practical Aspects of Traffic Control With Variable Speed Limits , 2015, IEEE Transactions on Intelligent Transportation Systems.