Optimal Output-Feedback Control of Unknown Continuous-Time Linear Systems Using Off-policy Reinforcement Learning
暂无分享,去创建一个
[1] Warren E. Dixon,et al. Concurrent learning-based network synchronization , 2014, 2014 American Control Conference.
[2] Warren B. Powell,et al. Approximate Dynamic Programming - Solving the Curses of Dimensionality , 2007 .
[3] Frank L. Lewis,et al. Adaptive Suboptimal Output-Feedback Control for Linear Systems Using Integral Reinforcement Learning , 2015, IEEE Transactions on Control Systems Technology.
[4] Bijnan Bandyopadhyay,et al. Output Feedback Sliding-Mode Control for Uncertain Systems Using Fast Output Sampling Technique , 2006, IEEE Transactions on Industrial Electronics.
[5] Frank L. Lewis,et al. Adaptive optimal control for continuous-time linear systems based on policy iteration , 2009, Autom..
[6] Frank L. Lewis,et al. Optimal Control: Lewis/Optimal Control 3e , 2012 .
[7] Tingwen Huang,et al. Off-Policy Reinforcement Learning for $ H_\infty $ Control Design , 2013, IEEE Transactions on Cybernetics.
[8] Frank L. Lewis,et al. Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems , 2014, Autom..
[9] Frank L. Lewis,et al. Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics , 2014, Autom..
[10] Derong Liu,et al. Neural Network H ∞ Tracking Control of Nonlinear Systems Using GHJI Method , 2013, ISNN.
[11] Derong Liu,et al. Optimal Tracking Control Scheme for Discrete-Time Nonlinear Systems with Approximation Errors , 2013, ISNN.
[12] Yu Jiang,et al. Robust Adaptive Dynamic Programming and Feedback Stabilization of Nonlinear Systems , 2014, IEEE Transactions on Neural Networks and Learning Systems.
[13] Jae Young Lee,et al. Integral Reinforcement Learning for Continuous-Time Input-Affine Nonlinear Systems With Simultaneous Invariant Explorations , 2015, IEEE Transactions on Neural Networks and Learning Systems.
[14] Herbert Werner. Robust control of a laboratory flight simulator by nondynamic multirate output feedback , 1996, Proceedings of 35th IEEE Conference on Decision and Control.
[15] Le Yi Wang,et al. State Observability and Observers of Linear-Time-Invariant Systems Under Irregular Sampling and Sensor Limitations , 2011, IEEE Transactions on Automatic Control.
[16] Zhong-Ping Jiang,et al. Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics , 2012, Autom..
[17] Huaguang Zhang,et al. Online optimal tracking control of continuous-time linear systems with unknown dynamics by using adaptive dynamic programming , 2014, Int. J. Control.
[18] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[19] Zhong-Ping Jiang,et al. Adaptive and optimal output feedback control of linear systems: An adaptive dynamic programming approach , 2014, Proceeding of the 11th World Congress on Intelligent Control and Automation.
[20] D. Kleinman. On an iterative technique for Riccati equation computations , 1968 .
[21] Frank L. Lewis,et al. 2009 Special Issue: Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems , 2009 .
[22] Frank L. Lewis,et al. Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning , 2014, IEEE Transactions on Automatic Control.
[23] Approximate dynamic programming for output feedback control , 2010, Proceedings of the 29th Chinese Control Conference.
[24] Frank L. Lewis,et al. Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).
[25] Frank L. Lewis,et al. Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning , 2014, Autom..
[26] Huaguang Zhang,et al. Optimal Tracking Control for a Class of Nonlinear Discrete-Time Systems With Time Delays Based on Heuristic Dynamic Programming , 2011, IEEE Transactions on Neural Networks.