Handbook of Learning and Approximate Dynamic Programming
暂无分享,去创建一个
Warren B. Powell | Jennie Si | Donald C. Wunsch | Andrew G. Barto | A. Barto | Warrren B Powell | J. Si | D. Wunsch
[1] Bernard Widrow,et al. Punish/Reward: Learning with a Critic in Adaptive Threshold Systems , 1973, IEEE Trans. Syst. Man Cybern..
[2] Derrick H. Nguyen,et al. Truck backer-upper: an example of self-learning in neural networks , 1990, Defense, Security, and Sensing.
[3] Richard S. Sutton,et al. Neural networks for control , 1990 .
[4] R.J. Williams,et al. Reinforcement learning is direct adaptive optimal control , 1991, IEEE Control Systems.
[5] Donald A. Sofge,et al. Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches , 1992 .
[6] Eduardo D. Sontag,et al. Neural Networks for Control , 1993 .
[7] Snehasis Mukhopadhyay,et al. Adaptive control of nonlinear multivariable systems using neural networks , 1993, Proceedings of 32nd IEEE Conference on Decision and Control.
[8] Richard S. Sutton,et al. A Menu of Designs for Reinforcement Learning Over Time , 1995 .
[9] Roberto A. Santiago,et al. Adaptive critic designs: A case study for neurocontrol , 1995, Neural Networks.
[10] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[11] Li-Xin Wang,et al. A Course In Fuzzy Systems and Control , 1996 .
[12] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[13] George G. Lendaris,et al. More on training strategies for critic and action neural networks in dual heuristic programming method , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.
[14] A. Rantzer,et al. System Analysis via Integral Quadratic Constraints. Part II , 1997 .
[15] Nikita A. Visnevski. Control of a nonlinear multivariable system with adaptive critic designs , 1997 .
[16] Donald C. Wunsch,et al. Adaptive critic designs and their applications , 1997 .
[17] George G. Lendaris,et al. Training strategies for critic and action neural networks in dual heuristic programming method , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).
[18] James C. Neidhoefer,et al. Immunized adaptive critics , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).
[19] J. Yen,et al. Fuzzy Logic: Intelligence, Control, and Information , 1998 .
[20] T. T. Shannon,et al. Application considerations for the DHP methodology , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).
[21] Christopher Macleod,et al. INTELLIGENT SIGNAL PROCESSING , 1999 .
[22] T. Shannon,et al. Qualitative models for adaptive critic neurocontrol , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).
[23] Thaddeus T. Shannon. Partial, noisy and qualitative models for adaptive critic based neurocontrol , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).
[24] George G. Lendaris,et al. A comparison of training algorithms for DHP adaptive critic neurocontrol , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).
[25] R. Saeks,et al. Adaptive critic control of the power train in a hybrid electric vehicle , 1999, SMCia/99 Proceedings of the 1999 IEEE Midnight - Sun Workshop on Soft Computing Methods in Industrial Applications (Cat. No.99EX269).
[26] J. Neidhoefer,et al. Immunized Adaptive Critic for an Autonomous Aircraft Control Application , 1999 .
[27] James Christian Neidhoefer. Intelligent control for autonomous aircraft missions , 1999 .
[28] Rein Luus,et al. Iterative dynamic programming , 2019, Iterative Dynamic Programming.
[29] George G. Lendaris,et al. Adaptive critic based approximate dynamic programming for tuning fuzzy controllers , 2000, Ninth IEEE International Conference on Fuzzy Systems. FUZZ- IEEE 2000 (Cat. No.00CH37063).
[30] Charles W. Anderson,et al. Approximating a Policy Can be Easier Than Approximating a Value Function , 2000 .
[31] George G. Lendaris,et al. A New Hybrid Critic-Training Method for Approximate Dynamic Programming , 2000 .
[32] George G. Lendaris,et al. Adaptive critic design for intelligent steering and speed control of a 2-axle vehicle , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.
[33] George G. Lendaris,et al. Controller design (from scratch) using approximate dynamic programming , 2000, Proceedings of the 2000 IEEE International Symposium on Intelligent Control. Held jointly with the 8th IEEE Mediterranean Conference on Control and Automation (Cat. No.00CH37147).
[34] Donald C. Wunsch,et al. Neurocontroller alternatives for "fuzzy" ball-and-beam systems with nonuniform nonlinear friction , 2000, IEEE Trans. Neural Networks Learn. Syst..
[35] Andrew G. Barto,et al. Automatic Discovery of Subgoals in Reinforcement Learning using Diverse Density , 2001, ICML.
[36] Douglas C. Hittle,et al. Robust reinforcement learning control with static and dynamic stability , 2001 .
[37] Radhakant Padhi,et al. A systematic synthesis of optimal process control with neural networks , 2001, Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148).
[38] Zhongwu Huang,et al. ROBUST ADAPTIVE CRITIC BASED NEUROCONTROLLERS FOR MISSILES WITH MODEL UNCERTAINTIES , 2001 .
[39] George G. Lendaris,et al. Using DHP adaptive critic methods to tune a fuzzy automobile steering controller , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).
[40] Simon Haykin,et al. Intelligent Signal Processing , 2001 .
[41] Radhakant Padhi,et al. Adaptive-critic based optimal neuro control synthesis for distributed parameter systems , 2001, Autom..
[42] George G. Lendaris,et al. Dual heuristic programming for fuzzy control , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).
[43] George G. Lendaris,et al. A comparison of DHP based antecedent parameter tuning strategies for fuzzy control , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).
[44] T. T. Shannon,et al. Adaptive critic based adaptation of a fuzzy policy manager for a logistic system , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).
[45] George G. Lendaris,et al. Adaptive critic based design of a fuzzy motor speed controller , 2001, Proceeding of the 2001 IEEE International Symposium on Intelligent Control (ISIC '01) (Cat. No.01CH37206).
[46] Andrew G. Barto,et al. Lyapunov Design for Safe Reinforcement Learning , 2003, J. Mach. Learn. Res..
[47] George G. Lendaris,et al. Proposed framework for applying adaptive critics in real-time realm , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).
[48] Nikita Barabanov,et al. Stability analysis of discrete-time recurrent neural networks , 2002, IEEE Trans. Neural Networks.
[49] N. E. Barabanov,et al. Two alternative stability criteria for discrete-time RMLP , 2002, Proceedings of the 41st IEEE Conference on Decision and Control, 2002..
[50] George G. Lendaris,et al. Adaptive dynamic programming , 2002, IEEE Trans. Syst. Man Cybern. Part C.
[51] George G. Lendaris,et al. Controller design via adaptive critic and model reference methods , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..
[52] Roberto A. Santiago,et al. Accelerating critic learning in approximate dynamic programming via value templates and perceptual learning , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..
[53] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.