Fuzzy PID controller design using Q-learning algorithm with a manipulated reward function

In this paper we propose a manipulated reward function for the Q-learning algorithm which is a reinforcement learning technique and utilize the proposed algorithm to tune the parameters of the input-output membership functions of fuzzy logic controllers. The use of a reward signal to formalize the idea of a goal is one of the most distinctive features of reinforcement learning. To improve both the performance and convergence criteria of the mentioned algorithm we propose a fuzzy structure for the reward function. In order to demonstrate the effectiveness of the algorithm we apply it to two second order linear systems with and without time delay and finally a nonlinear system will be examined.

[1]  M. Mizumoto Realization of PID controls by fuzzy control methods , 1995 .

[2]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[3]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[5]  Masayoshi Tomizuka,et al.  Fuzzy gain scheduling of PID controllers , 1993, IEEE Trans. Syst. Man Cybern..

[6]  Dr. Rainer Palm,et al.  Model Based Fuzzy Control , 1997, Springer Berlin Heidelberg.

[7]  徐玉滨,et al.  ANFIS (Adaptive Neural Fuzzy Inference System) indoor positioning method based on improved GA(Genetic Algorithm) optimization in WLAN (Wireless Local Area Network) environment , 2011 .

[8]  Pierre-Yves Glorennec,et al.  Tuning fuzzy PD and PI controllers using reinforcement learning. , 2010, ISA transactions.

[9]  Richard S. Sutton,et al.  Reinforcement Learning is Direct Adaptive Optimal Control , 1992, 1991 American Control Conference.

[10]  Engin Yesil,et al.  Self-tuning of PID-type fuzzy logic controller coefficients via relative rate observer , 2003 .

[11]  Karl Johan Åström,et al.  PID Controllers: Theory, Design, and Tuning , 1995 .

[12]  Chang Chieh Hang,et al.  Parallel structure and tuning of a fuzzy PID controller , 2000, Autom..

[13]  Bart De Schutter,et al.  Reinforcement Learning and Dynamic Programming Using Function Approximators , 2010 .

[14]  Charles W. Anderson,et al.  Strategy Learning with Multilayer Connectionist Representations , 1987 .

[15]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[16]  S. He,et al.  Fuzzy self-tuning of PID controllers , 1993 .

[17]  Ronald R. Yager,et al.  Essentials of fuzzy modeling and control , 1994 .

[18]  Chang Chieh Hang,et al.  Tuning and analysis of a fuzzy PI controller based on gain and phase margins , 1998, IEEE Trans. Syst. Man Cybern. Part A.

[19]  Ebrahim Mamdani,et al.  Applications of fuzzy algorithms for control of a simple dynamic plant , 1974 .

[20]  Andrea Bonarini,et al.  Evolutionary Learning of Fuzzy rules: competition and cooperation , 1996 .

[21]  Hung-Yuan Chung,et al.  A self-learning fuzzy logic controller using genetic algorithms with reinforcements , 1997, IEEE Trans. Fuzzy Syst..

[22]  Takanori Tagami,et al.  A genetic algorithm based minimax optimal design of robust I-PD controller , 1996 .

[23]  Si-Zhao Joe Qin,et al.  A multiregion fuzzy logic controller for nonlinear process control , 1994, IEEE Trans. Fuzzy Syst..

[24]  Jan Jantzen,et al.  Foundations of Fuzzy Control: A Practical Approach , 2013 .