A reinforcement learning fuzzy controller for set-point regulator problems

Intelligent control refers to controllers that can analyze their performance and make necessary changes to their behavior in order to satisfy certain predefined control goals. This paper describes a self-learning controller model that can efficiently learn the control law for complex systems through reinforcement learning techniques and dynamic programming-like algorithms. The controller is applied to a class of problems called general set-point regulator problems in which the objective is to drive the system to the set-point while optimizing some performance objective function, making no a priori assumptions about the dynamics of the plant or its optimal trajectory. The relevant tasks for a self-learning controller are discussed. Learning is accomplished via incremental, online dynamic programming-like algorithms. Both temporal differences and Q-learning are used in the learning algorithm. Experimental results with both are reported on the inverted pendulum balancing problem, the power system stabilization problem, and the tethered satellite system retrieval problem.

[1]  Om P. Malik,et al.  Self-tuning stabiliser for a multimachine power system , 1986 .

[2]  Augustine O. Esogbue,et al.  Optimal clustering of fuzzy data via fuzzy dynamic programming , 1986 .

[3]  Takashi Hiyama,et al.  Fuzzy logic control scheme for on-line stabilization of multi-machine power system , 1991 .

[4]  Hamid R. Berenji,et al.  A reinforcement learning--based architecture for fuzzy logic control , 1992, Int. J. Approx. Reason..

[5]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[6]  James Arthur Murrell A statistical fuzzy associative learning approach to intelligent control , 1993 .

[7]  Augustine O. Esogbue,et al.  A fuzzy adaptive controller using reinforcement learning neural networks , 1993, [Proceedings 1993] Second IEEE International Conference on Fuzzy Systems.

[8]  Hamid R. Berenji,et al.  Learning and tuning fuzzy logic controllers through reinforcements , 1992, IEEE Trans. Neural Networks.

[9]  Adel A. Ghandakly,et al.  A parametrically optimized self-tuning regulator for power system stabilizers , 1992 .

[10]  Hamid R. Berenji,et al.  Fuzzy Reinforcement Learning and Dynamic Programming , 1993, Fuzzy Logic in Artificial Intelligence.

[11]  Chin-Teng Lin,et al.  Reinforcement structure/parameter learning for neural-network-based fuzzy logic control systems , 1993, [Proceedings 1993] Second IEEE International Conference on Fuzzy Systems.

[12]  W. E. Hearnes,et al.  Constructive experiments with a new fuzzy adaptive controller , 1994, NAFIPS/IFIS/NASA '94. Proceedings of the First International Joint Conference of The North American Fuzzy Information Processing Society Biannual Conference. The Industrial Fuzzy Control and Intellige.

[13]  Akhtar Kalam,et al.  A fuzzy logic controller applied to power system stabilizer for a synchronous machine power system , 1992, TENCON'92 - Technology Enabling Tomorrow.

[14]  H. R. Berenji,et al.  Fuzzy Q-learning: a new approach for fuzzy dynamic programming , 1994, Proceedings of 1994 IEEE 3rd International Fuzzy Systems Conference.

[15]  O. Malik,et al.  A fuzzy logic based stabilizer for a synchronous machine , 1991 .

[16]  Yuan-Yih Hsu,et al.  A fuzzy controller for generator excitation control , 1993, IEEE Trans. Syst. Man Cybern..

[17]  Augustine O. Esogbue,et al.  Advances in fuzzy adaptive control , 1994 .

[18]  Andrew G. Barto,et al.  Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[19]  Yuan-Yih Hsu,et al.  Design of a Proportional-Integral Power System Stabilizer , 1986, IEEE Transactions on Power Systems.

[20]  C. S. George Lee,et al.  Reinforcement structure/parameter learning for neural-network-based fuzzy logic control systems , 1994, IEEE Trans. Fuzzy Syst..