A Decision-Making Strategy for Vehicle Autonomous Braking in Emergency via Deep Reinforcement Learning

Autonomous braking through vehicle precise decision-making and control to reduce accidents is a key issue, especially in the early diffusion phase of autonomous vehicle development. This paper proposes a deep reinforcement learning (DRL)-based autonomous braking decision-making strategy in an emergency situation. Three key influencing factors, including efficiency, accuracy and passengers’ comfort, are fully considered and satisfied by the proposed strategy. First, the vehicle lane-changing process and the braking process are analyzed in detail, which include the critical factors in the design of the autonomous braking strategy. Second, we propose a DRL process that determines the optimal strategy for autonomous braking. Particularly, a multi-objective reward function is designed, which can compromise the rewards achieved of different brake moments, the degree of the accident, and the comfort of the passenger. Third, a typical actor-critic (AC) algorithm named deep deterministic policy gradient (DDPG) is adopted for solving the autonomous braking problem, which can improve the efficiency of the optimal strategy and be stable in continuous control tasks. Once the strategy is well trained, the vehicle can automatically take optimal braking behavior in an emergency to improve driving safety. Extensive simulations validate the effectiveness and efficiency of our proposal in terms of learning effectiveness, decision-making accuracy and driving safety.

[1]  Ge Guo,et al.  Autonomous Platoon Control Allowing Range-Limited Sensors , 2012, IEEE Transactions on Vehicular Technology.

[2]  Jianliang Wang,et al.  Distributed Adaptive Integrated-Sliding-Mode Controller Synthesis for String Stability of Vehicle Platoons , 2016, IEEE Transactions on Intelligent Transportation Systems.

[3]  Francesco Borrelli,et al.  Control of Connected and Automated Vehicles: State of the Art and Future Challenges , 2018, Annu. Rev. Control..

[4]  Mehrdad Dianati,et al.  A Survey of the State-of-the-Art Localization Techniques and Their Potentials for Autonomous Vehicle Applications , 2018, IEEE Internet of Things Journal.

[5]  Tao Zhang,et al.  Design and Experimental Validation of a Cooperative Adaptive Cruise Control System Based on Supervised Reinforcement Learning , 2018, Applied Sciences.

[6]  Charles Desjardins,et al.  Cooperative Adaptive Cruise Control: A Reinforcement Learning Approach , 2011, IEEE Transactions on Intelligent Transportation Systems.

[7]  John N. Tsitsiklis,et al.  Actor-Critic Algorithms , 1999, NIPS.

[8]  Kun Cao,et al.  A dynamic automated lane change maneuver based on vehicle-to-vehicle communication , 2016 .

[9]  Tom H. Luan,et al.  Graded Warning for Rear-End Collision: An Artificial Intelligence-Aided Algorithm , 2020, IEEE Transactions on Intelligent Transportation Systems.

[10]  Xin Wang,et al.  Distributed Multiagent Coordinated Learning for Autonomous Driving in Highways Based on Dynamic Coordination Graphs , 2020, IEEE Transactions on Intelligent Transportation Systems.

[11]  Kyongsu Yi,et al.  Human driving data-based design of a vehicle adaptive cruise control algorithm , 2008 .

[12]  Vicente Milanés Montero,et al.  Autonomous Pedestrian Collision Avoidance Using a Fuzzy Steering Controller , 2011, IEEE Transactions on Intelligent Transportation Systems.

[13]  Fang Zhang,et al.  Combining Deep Reinforcement Learning and Safety Based Control for Autonomous Driving , 2016, ArXiv.

[14]  Christopher M. Bingham,et al.  Application of fuzzy control algorithms for electric vehicle antilock braking/traction control systems , 2003, IEEE Trans. Veh. Technol..

[15]  Peter Stone,et al.  Reinforcement learning , 2019, Scholarpedia.

[16]  José Eugenio Naranjo,et al.  Cooperative Throttle and Brake Fuzzy Control for ACC$+$ Stop&Go Maneuvers , 2007, IEEE Transactions on Vehicular Technology.

[17]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[18]  Petros A. Ioannou,et al.  Longitudinal control of heavy trucks in mixed traffic: environmental and fuel economy considerations , 2006, IEEE Transactions on Intelligent Transportation Systems.

[19]  Brahim Chaib-draa,et al.  Partial Local FriendQ Multiagent Learning: Application to Team Automobile Coordination Problem , 2006, Canadian Conference on AI.

[20]  Mario di Bernardo,et al.  Design, Analysis, and Experimental Validation of a Distributed Protocol for Platooning in the Presence of Time-Varying Heterogeneous Delays , 2016, IEEE Transactions on Control Systems Technology.

[21]  Kyongsu Yi,et al.  Nonlinear brake control for vehicle CW/CA systems , 2001 .

[22]  S. Schot,et al.  Jerk: The time rate of change of acceleration , 1978 .

[23]  Ye-Hwa Chen,et al.  A Fuzzy Approach for Optimal Robust Control Design of an Automotive Electronic Throttle System , 2018, IEEE Transactions on Fuzzy Systems.

[24]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[25]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[26]  Chung Choo Chung,et al.  Autonomous braking system via deep reinforcement learning , 2017, 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC).

[27]  Yuval Tassa,et al.  Learning Continuous Control Policies by Stochastic Value Gradients , 2015, NIPS.

[28]  Wenhu Qin,et al.  Research on a DSRC-Based Rear-End Collision Warning Model , 2014, IEEE Transactions on Intelligent Transportation Systems.

[29]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[30]  Elias B. Kosmatopoulos,et al.  Collision avoidance analysis for lane changing and merging , 1999, IEEE Trans. Veh. Technol..