Optimization methods for brain-like intelligent control

This paper defines a more restricted class of designs, to be called "brain-like intelligent control". The paper explains the definition and concepts behind it, describes benefits in control engineering, emphasizing stability, mentions 4 groups who have implemented such designs, for the first time, since late 1993, and discusses the brain as a member of this class, one which suggests features to be sought in future research. These designs involve approximate dynamic programming-dynamic programming approximated in generic ways to make it affordable on large-scale nonlinear control problems. These designs are based on learning. They permit a neural net implementation-like the brain but do not require it. They include some but not all "reinforcement learning" or "adaptive critic" designs.

[1]  Paul J. Werbos,et al.  Neural networks for control and system identification , 1989, Proceedings of the 28th IEEE Conference on Decision and Control,.

[2]  Bernard Widrow,et al.  Punish/Reward: Learning with a Critic in Adaptive Threshold Systems , 1973, IEEE Trans. Syst. Man Cybern..

[3]  Paul J. Werbos,et al.  Building and Understanding Adaptive Systems: A Statistical/Numerical Approach to Factory Automation and Brain Research , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[4]  David Q. Mayne,et al.  Differential dynamic programming , 1972, The Mathematical Gazette.

[5]  Paul J. Werbos,et al.  Maximizing long-term gas industry profits in two minutes in Lotus using neural network methods , 1989, IEEE Trans. Syst. Man Cybern..

[6]  Roberto A. Santiago,et al.  Adaptive critic designs: A case study for neurocontrol , 1995, Neural Networks.

[7]  Paul J. Werbos,et al.  The Roots of Backpropagation: From Ordered Derivatives to Neural Networks and Political Forecasting , 1994 .

[8]  Paul J. Werbos,et al.  The roots of backpropagation , 1994 .

[9]  G Sachs,et al.  Robust control concept for a hypersonic test vehicle , 1995 .

[10]  M. Nicolelis,et al.  Induction of immediate spatiotemporal changes in thalamic networks by peripheral block of ascending cutaneous information , 1993, Nature.

[11]  S. N. Balakrishnan,et al.  Adaptive critic based neural networks for aircraft control , 1995 .

[12]  S. N. Balakrishnan,et al.  Adaptive-critic based neural networks for aircraft optimal control , 1996 .

[13]  Paul J. Werbos,et al.  Elastic Fuzzy Logic: A Better Fit to Neurocontrol and True Intelligence , 1993, J. Intell. Fuzzy Syst..

[14]  P. J. Werbos Optimal neurocontrol: practical benefits, new results and biological evidence , 1995, Proceedings of WESCON'95.

[15]  Joel L. Davis,et al.  Adaptive Critics and the Basal Ganglia , 1995 .

[16]  Paul J. Werbos,et al.  Neurocontrol and elastic fuzzy logic: capabilities, concepts, and applications , 1993, IEEE Trans. Ind. Electron..