论文信息 - Freeway Merging in Congested Traffic based on Multipolicy Decision Making with Passive Actor Critic

Freeway Merging in Congested Traffic based on Multipolicy Decision Making with Passive Actor Critic

Freeway merging in congested traffic is a significant challenge toward fully automated driving. Merging vehicles need to decide not only how to merge into a spot, but also where to merge. We present a method for the freeway merging based on multi-policy decision making with a reinforcement learning method called {\em passive actor-critic} (pAC), which learns with less knowledge of the system and without active exploration. The method selects a merging spot candidate by using the state value learned with pAC. We evaluate our method using real traffic data. Our experiments show that pAC achieves 92\% success rate to merge into a freeway, which is comparable to human decision making.

Prashant Doshi | Danil V. Prokhorov | Tomoki Nishi

[1] John N. Tsitsiklis,et al. Actor-Critic Algorithms , 1999, NIPS.

[2] Emanuel Todorov,et al. Efficient computation of optimal actions , 2009, Proceedings of the National Academy of Sciences.

[3] Emanuel Todorov,et al. Eigenfunction approximation methods for linearly-solvable optimal control problems , 2009, 2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning.

[4] Robert Babuska,et al. A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[5] Matthew Derry,et al. Challenges in Perception and Decision Making for Intelligent Automotive Vehicles: A Case Study , 2016, IEEE Transactions on Intelligent Vehicles.

[6] Vassili Alexiadis,et al. Video -Based Vehicle Trajectory Data Collection , 2007 .

[7] Edwin Olson,et al. Multipolicy decision-making for autonomous driving via changepoint-based behavior prediction: Theory and experiment , 2015, Autonomous Robots.