Towards Hardware Accelerated Reinforcement Learning for Application-Specific Robotic Control
暂无分享,去创建一个
Wayne Luk | Jason Tsai | Thomas C. P. Chau | Shengjia Shao | Alexander Warren | Michal Mysior | Ben Jeppesen | W. Luk | T. Chau | B. Jeppesen | Alexander Warren | Shengjia Shao | Jason Tsai | Michal Mysior
[1] Jan Peters,et al. Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..
[2] Barak A. Pearlmutter. Fast Exact Multiplication by the Hessian , 1994, Neural Computation.
[3] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[4] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[5] Sergey Levine,et al. Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[6] Federico Baronti,et al. An FPGA-based controller for collaborative robotics , 2017, 2017 IEEE 26th International Symposium on Industrial Electronics (ISIE).
[7] Pieter Abbeel,et al. Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.
[8] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[9] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[10] David B. Thomas,et al. Neural Network Based Reinforcement Learning Acceleration on FPGA Platforms , 2017, CARN.
[11] Wayne Luk,et al. Customised pearlmutter propagation: A hardware architecture for trust region policy optimisation , 2017, 2017 27th International Conference on Field Programmable Logic and Applications (FPL).