Continuity and Smoothness Analysis and Possible Improvement of Traditional Reinforcement Learning Methods
暂无分享,去创建一个
Jianjun Yuan | Tianhao Chen | Wenchuan Jia | Shugen Ma | Limei Cheng | Shugen Ma | Jianjun Yuan | Tianhao Chen | Wenchuan Jia | Limei Cheng
[1] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[2] Sergey Levine,et al. Residual Reinforcement Learning for Robot Control , 2018, 2019 International Conference on Robotics and Automation (ICRA).
[3] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.
[4] Sergey Levine,et al. Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[5] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[6] Yang Liu,et al. Incremental Reinforcement Learning - a New Continuous Reinforcement Learning Frame Based on Stochastic Differential Equation methods , 2019, ArXiv.
[7] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[8] Kamyar Azizzadenesheli,et al. Reinforcement Learning of POMDPs using Spectral Methods , 2016, COLT.
[9] B. Øksendal. Stochastic differential equations : an introduction with applications , 1987 .
[10] Takashi Komeda,et al. REINFORCEMENT LEARNING FOR POMDP USING STATE CLASSIFICATION , 2008, MLMTA.
[11] Stephen Tyree,et al. Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU , 2016, ICLR.
[12] Yuichiro Yoshikawa,et al. Robot gains social intelligence through multimodal deep reinforcement learning , 2016, 2016 IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids).
[13] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[14] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.