暂无分享,去创建一个
[1] John N. Tsitsiklis,et al. Actor-Critic Algorithms , 1999, NIPS.
[2] Emanuel Todorov,et al. Efficient computation of optimal actions , 2009, Proceedings of the National Academy of Sciences.
[3] Emanuel Todorov,et al. Eigenfunction approximation methods for linearly-solvable optimal control problems , 2009, 2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning.
[4] Robert Babuska,et al. A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).
[5] Matthew Derry,et al. Challenges in Perception and Decision Making for Intelligent Automotive Vehicles: A Case Study , 2016, IEEE Transactions on Intelligent Vehicles.
[6] Vassili Alexiadis,et al. Video -Based Vehicle Trajectory Data Collection , 2007 .
[7] Edwin Olson,et al. Multipolicy decision-making for autonomous driving via changepoint-based behavior prediction: Theory and experiment , 2015, Autonomous Robots.