On the use of Deep Autoencoders for Efficient Embedded Reinforcement Learning
暂无分享,去创建一个
Tim Oates | Tinoosh Mohsenin | Nicholas R. Waytowich | Mark Horton | Bharat Prakash | William David Hairston | T. Oates | W. Hairston | T. Mohsenin | Bharat Prakash | Mark Horton
[1] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[2] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[3] Peter Stone,et al. Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces , 2017, AAAI.
[4] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[5] Tim Oates,et al. SensorNet: A Scalable and Low-Power Deep Convolutional Neural Network for Multimodal Data Classification , 2019, IEEE Transactions on Circuits and Systems I: Regular Papers.
[6] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[7] Vinicius G. Goecks,et al. Cycle-of-Learning for Autonomous Systems from Human Interaction , 2018, ArXiv.
[8] Owain Evans,et al. Trial without Error: Towards Safe Reinforcement Learning via Human Intervention , 2017, AAMAS.
[9] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.