Handbook of Learning and Approximate Dynamic Programming

Foreword. 1. ADP: goals, opportunities and principles. Part I: Overview. 2. Reinforcement learning and its relationship to supervised learning. 3. Model-based adaptive critic designs. 4. Guidance in the use of adaptive critics for control. 5. Direct neural dynamic programming. 6. The linear programming approach to approximate dynamic programming. 7. Reinforcement learning in large, high-dimensional state spaces. 8. Hierarchical decision making. Part II: Technical advances. 9. Improved temporal difference methods with linear function approximation. 10. Approximate dynamic programming for high-dimensional resource allocation problems. 11. Hierarchical approaches to concurrency, multiagency, and partial observability. 12. Learning and optimization - from a system theoretic perspective. 13. Robust reinforcement learning using integral-quadratic constraints. 14. Supervised actor-critic reinforcement learning. 15. BPTT and DAC - a common framework for comparison. Part III: Applications. 16. Near-optimal control via reinforcement learning. 17. Multiobjective control problems by reinforcement learning. 18. Adaptive critic based neural network for control-constrained agile missile. 19. Applications of approximate dynamic programming in power systems control. 20. Robust reinforcement learning for heating, ventilation, and air conditioning control of buildings. 21. Helicopter flight control using direct neural dynamic programming. 22. Toward dynamic stochastic optimal power flow. 23. Control, optimization, security, and self-healing of benchmark power systems.

[1]  Bernard Widrow,et al.  Punish/Reward: Learning with a Critic in Adaptive Threshold Systems , 1973, IEEE Trans. Syst. Man Cybern..

[2]  Derrick H. Nguyen,et al.  Truck backer-upper: an example of self-learning in neural networks , 1990, Defense, Security, and Sensing.

[3]  Richard S. Sutton,et al.  Neural networks for control , 1990 .

[4]  R.J. Williams,et al.  Reinforcement learning is direct adaptive optimal control , 1991, IEEE Control Systems.

[5]  Donald A. Sofge,et al.  Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches , 1992 .

[6]  Eduardo D. Sontag,et al.  Neural Networks for Control , 1993 .

[7]  Snehasis Mukhopadhyay,et al.  Adaptive control of nonlinear multivariable systems using neural networks , 1993, Proceedings of 32nd IEEE Conference on Decision and Control.

[8]  Richard S. Sutton,et al.  A Menu of Designs for Reinforcement Learning Over Time , 1995 .

[9]  Roberto A. Santiago,et al.  Adaptive critic designs: A case study for neurocontrol , 1995, Neural Networks.

[10]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[11]  Li-Xin Wang,et al.  A Course In Fuzzy Systems and Control , 1996 .

[12]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[13]  George G. Lendaris,et al.  More on training strategies for critic and action neural networks in dual heuristic programming method , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[14]  A. Rantzer,et al.  System Analysis via Integral Quadratic Constraints. Part II , 1997 .

[15]  Nikita A. Visnevski Control of a nonlinear multivariable system with adaptive critic designs , 1997 .

[16]  Donald C. Wunsch,et al.  Adaptive critic designs and their applications , 1997 .

[17]  George G. Lendaris,et al.  Training strategies for critic and action neural networks in dual heuristic programming method , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[18]  James C. Neidhoefer,et al.  Immunized adaptive critics , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[19]  J. Yen,et al.  Fuzzy Logic: Intelligence, Control, and Information , 1998 .

[20]  T. T. Shannon,et al.  Application considerations for the DHP methodology , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[21]  Christopher Macleod,et al.  INTELLIGENT SIGNAL PROCESSING , 1999 .

[22]  T. Shannon,et al.  Qualitative models for adaptive critic neurocontrol , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[23]  Thaddeus T. Shannon Partial, noisy and qualitative models for adaptive critic based neurocontrol , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[24]  George G. Lendaris,et al.  A comparison of training algorithms for DHP adaptive critic neurocontrol , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[25]  R. Saeks,et al.  Adaptive critic control of the power train in a hybrid electric vehicle , 1999, SMCia/99 Proceedings of the 1999 IEEE Midnight - Sun Workshop on Soft Computing Methods in Industrial Applications (Cat. No.99EX269).

[26]  J. Neidhoefer,et al.  Immunized Adaptive Critic for an Autonomous Aircraft Control Application , 1999 .

[27]  James Christian Neidhoefer Intelligent control for autonomous aircraft missions , 1999 .

[28]  Rein Luus,et al.  Iterative dynamic programming , 2019, Iterative Dynamic Programming.

[29]  George G. Lendaris,et al.  Adaptive critic based approximate dynamic programming for tuning fuzzy controllers , 2000, Ninth IEEE International Conference on Fuzzy Systems. FUZZ- IEEE 2000 (Cat. No.00CH37063).

[30]  Charles W. Anderson,et al.  Approximating a Policy Can be Easier Than Approximating a Value Function , 2000 .

[31]  George G. Lendaris,et al.  A New Hybrid Critic-Training Method for Approximate Dynamic Programming , 2000 .

[32]  George G. Lendaris,et al.  Adaptive critic design for intelligent steering and speed control of a 2-axle vehicle , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[33]  George G. Lendaris,et al.  Controller design (from scratch) using approximate dynamic programming , 2000, Proceedings of the 2000 IEEE International Symposium on Intelligent Control. Held jointly with the 8th IEEE Mediterranean Conference on Control and Automation (Cat. No.00CH37147).

[34]  Donald C. Wunsch,et al.  Neurocontroller alternatives for "fuzzy" ball-and-beam systems with nonuniform nonlinear friction , 2000, IEEE Trans. Neural Networks Learn. Syst..

[35]  Andrew G. Barto,et al.  Automatic Discovery of Subgoals in Reinforcement Learning using Diverse Density , 2001, ICML.

[36]  Douglas C. Hittle,et al.  Robust reinforcement learning control with static and dynamic stability , 2001 .

[37]  Radhakant Padhi,et al.  A systematic synthesis of optimal process control with neural networks , 2001, Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148).

[38]  Zhongwu Huang,et al.  ROBUST ADAPTIVE CRITIC BASED NEUROCONTROLLERS FOR MISSILES WITH MODEL UNCERTAINTIES , 2001 .

[39]  George G. Lendaris,et al.  Using DHP adaptive critic methods to tune a fuzzy automobile steering controller , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[40]  Simon Haykin,et al.  Intelligent Signal Processing , 2001 .

[41]  Radhakant Padhi,et al.  Adaptive-critic based optimal neuro control synthesis for distributed parameter systems , 2001, Autom..

[42]  George G. Lendaris,et al.  Dual heuristic programming for fuzzy control , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[43]  George G. Lendaris,et al.  A comparison of DHP based antecedent parameter tuning strategies for fuzzy control , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[44]  T. T. Shannon,et al.  Adaptive critic based adaptation of a fuzzy policy manager for a logistic system , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[45]  George G. Lendaris,et al.  Adaptive critic based design of a fuzzy motor speed controller , 2001, Proceeding of the 2001 IEEE International Symposium on Intelligent Control (ISIC '01) (Cat. No.01CH37206).

[46]  Andrew G. Barto,et al.  Lyapunov Design for Safe Reinforcement Learning , 2003, J. Mach. Learn. Res..

[47]  George G. Lendaris,et al.  Proposed framework for applying adaptive critics in real-time realm , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[48]  Nikita Barabanov,et al.  Stability analysis of discrete-time recurrent neural networks , 2002, IEEE Trans. Neural Networks.

[49]  N. E. Barabanov,et al.  Two alternative stability criteria for discrete-time RMLP , 2002, Proceedings of the 41st IEEE Conference on Decision and Control, 2002..

[50]  George G. Lendaris,et al.  Adaptive dynamic programming , 2002, IEEE Trans. Syst. Man Cybern. Part C.

[51]  George G. Lendaris,et al.  Controller design via adaptive critic and model reference methods , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[52]  Roberto A. Santiago,et al.  Accelerating critic learning in approximate dynamic programming via value templates and perceptual learning , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[53]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.