Reinforcement Learning Policy with Proportional-Integral Control
暂无分享,去创建一个
[1] Marcin Andrychowicz,et al. Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research , 2018, ArXiv.
[2] Elman Mansimov,et al. Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation , 2017, NIPS.
[3] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[4] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[5] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[6] Zachary Chase Lipton. A Critical Review of Recurrent Neural Networks for Sequence Learning , 2015, ArXiv.
[7] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[8] Sergey Levine,et al. Composable Deep Reinforcement Learning for Robotic Manipulation , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[9] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[10] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[11] M. Moghavvemi,et al. Modelling and PID controller design for a quadrotor unmanned air vehicle , 2010, 2010 IEEE International Conference on Automation, Quality and Testing, Robotics (AQTR).
[12] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[13] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[14] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[15] Bram Bakker,et al. Reinforcement Learning with Long Short-Term Memory , 2001, NIPS.
[16] Yun Li,et al. PID control system analysis, design, and technology , 2005, IEEE Transactions on Control Systems Technology.
[17] Sham M. Kakade,et al. Towards Generalization and Simplicity in Continuous Control , 2017, NIPS.
[18] F.L. Lewis,et al. Reinforcement learning and adaptive dynamic programming for feedback control , 2009, IEEE Circuits and Systems Magazine.
[19] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..