Reinforcement Learning and Approximate Dynamic Programming for Feedback Control
暂无分享,去创建一个
[1] Frank L. Lewis,et al. Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).
[2] S. N. Balakrishnan,et al. Adaptive-critic based neural networks for aircraft optimal control , 1996 .
[3] Frank L. Lewis,et al. Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2010, Autom..
[4] Frank L. Lewis,et al. A Cost Function Based Single Network Adaptive Critic architecture for optimal control synthesis for a class of nonlinear systems , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).
[5] Lyle Noakes,et al. Continuous-Time Adaptive Critics , 2007, IEEE Transactions on Neural Networks.
[6] Radhakant Padhi,et al. A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems , 2006, Neural Networks.
[7] S. N. Balakrishnan,et al. State-constrained agile missile control with adaptive-critic-based neural networks , 2002, IEEE Trans. Control. Syst. Technol..
[8] P.J. Werbos,et al. Using ADP to Understand and Replicate Brain Intelligence: the Next Level Design , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.
[9] Radhakant Padhi,et al. Robust/optimal temperature profile control using neural networks , 2006, 2006 IEEE Conference on Computer Aided Control System Design, 2006 IEEE International Conference on Control Applications, 2006 IEEE International Symposium on Intelligent Control.
[10] Frank L. Lewis,et al. Applied Optimal Control and Estimation , 1992 .
[11] J. Si,et al. Robust Dynamic Programming for Discounted Infinite-Horizon Markov Decision Processes with Uncertain Stationary Transition Matrice , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.
[12] S.N. Balakrishnan,et al. Optimal beaver population management using reduced order distributed parameter model and single network adaptive critics , 2004, Proceedings of the 2004 American Control Conference.
[13] Robert F. Stengel,et al. Online Adaptive Critic Flight Control , 2004 .
[14] Radhakant Padhi,et al. Proper orthogonal decomposition based optimal neurocontrol synthesis of a chemical reactor process using approximate dynamic programming , 2003, Neural Networks.
[15] A. Heydari,et al. Finite-horizon input-constrained nonlinear optimal control using single network adaptive critics , 2011, Proceedings of the 2011 American Control Conference.
[16] Daniel J. Scheeres,et al. Solving Optimal Continuous Thrust Rendezvous Problems with Generating Functions , 2005 .
[17] George G. Lendaris,et al. Adaptive critic design for intelligent steering and speed control of a 2-axle vehicle , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.