暂无分享,去创建一个
Amnon Shashua | Shai Shalev-Shwartz | Shaked Shammah | S. Shalev-Shwartz | A. Shashua | Shaked Shammah | Shai Shalev-Shwartz
[1] Amnon Shashua,et al. Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving , 2016, ArXiv.
[2] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[3] Siuming Lo,et al. Discrete Element Crowd Model for Pedestrian Evacuation Through an Exit , 2015 .
[4] Richard Bellman,et al. Introduction to the mathematical theory of control processes , 1967 .
[5] L. C. Baird,et al. Reinforcement learning in continuous time: advantage updating , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).
[6] Leslie G. Valiant,et al. A theory of the learnable , 1984, STOC '84.
[7] R Bellman,et al. DYNAMIC PROGRAMMING AND LAGRANGE MULTIPLIERS. , 1956, Proceedings of the National Academy of Sciences of the United States of America.
[8] Pete L. Clark,et al. The Instructor’s Guide to Real Induction , 2012, Mathematics Magazine.
[9] John L. Casti. Introduction to the Mathematical Theory of Control Processes, Volume I: Linear Equations and Quadratic Criteria, Volume II: Nonlinear Processes , 1978, IEEE Transactions on Systems, Man, and Cybernetics.
[10] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..