A boundedness result for the direct heuristic dynamic programming
暂无分享,去创建一个
Feng Liu | Shengwei Mei | Jian Sun | Jennie Si | Wentao Guo | J. Si | Jian Sun | S. Mei | Wentao Guo | Feng Liu
[1] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[2] and Charles K. Taft Reswick,et al. Introduction to Dynamic Systems , 1967 .
[3] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[4] Chao Lu,et al. Direct Heuristic Dynamic Programming for Damping Oscillations in a Large Power System , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).
[5] Frank L. Lewis,et al. Policy Iterations on the Hamilton–Jacobi–Isaacs Equation for $H_{\infty}$ State Feedback Control With Input Saturation , 2006, IEEE Transactions on Automatic Control.
[6] Frank L. Lewis,et al. Neurodynamic Programming and Zero-Sum Games for Constrained Control Systems , 2008, IEEE Transactions on Neural Networks.
[7] Yoh-Han Pao,et al. Stochastic choice of basis functions in adaptive function approximation and the functional-link net , 1995, IEEE Trans. Neural Networks.
[8] Frank L. Lewis,et al. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach , 2005, Autom..
[9] Feng Liu,et al. Direct heuristic dynamic programming with augmented states , 2011, The 2011 International Joint Conference on Neural Networks.
[10] Lei Yang,et al. Direct Heuristic Dynamic Programming for Nonlinear Tracking Control With Filtered Tracking Error , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).
[11] D. Ernst,et al. Power systems stability control: reinforcement learning framework , 2004, IEEE Transactions on Power Systems.
[12] Paul J. Werbos,et al. Neural networks for control and system identification , 1989, Proceedings of the 28th IEEE Conference on Decision and Control,.
[13] Jennie Si,et al. Helicopter trimming and tracking control using direct neural dynamic programming , 2003, IEEE Trans. Neural Networks.
[14] Tomas Landelius,et al. Reinforcement Learning and Distributed Local Model Synthesis , 1997 .
[15] Jennie Si,et al. Online learning control by association and reinforcement. , 2001, IEEE transactions on neural networks.
[16] Stuart E. Dreyfus,et al. Applied Dynamic Programming , 1965 .
[17] Jennie Si,et al. Helicopter Flight-Control Reconfiguration for Main Rotor Actuator Failures , 2003 .
[18] Jennie Si,et al. Apache Helicopter Stabilization Using Neural Dynamic Programming , 2002 .
[19] Andrew G. Barto,et al. Adaptive linear quadratic control using policy iteration , 1994, Proceedings of 1994 American Control Conference - ACC '94.
[20] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[21] Frank L. Lewis,et al. Generalized Policy Iteration for continuous-time systems , 2009, 2009 International Joint Conference on Neural Networks.
[22] Roberto A. Santiago,et al. Adaptive critic designs: A case study for neurocontrol , 1995, Neural Networks.
[23] Frank L. Lewis,et al. Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.