An approach to tune fuzzy controllers based on reinforcement learning

This paper proposes a new approach for the tuning of fuzzy controllers parameters based on reinforcement learning. The architecture of the proposed approach comprises of a Q estimator network (QEN) and a Takagi-Sugeno type fuzzy inference system (FIS). Unlike the most of the existing fuzzy Q-learning approaches, which select an optimal action based on finite discrete actions, while the proposed controller obtain the control output directly. With the proposed architecture, the learning algorithms for all the parameters of the Q estimator network and the FIS are developed based on the temporal difference methods as well as the gradient descent algorithm. The performance of the proposed design technique is illustrated by simulation studies of a vehicle longitudinal control system.

[1]  Jennie Si,et al.  The best approximation to C2 functions and its error bounds using regular-center Gaussian networks , 1994, IEEE Trans. Neural Networks.

[2]  C. Lee Giles,et al.  Pruning recurrent neural networks for improved generalization performance , 1994, IEEE Trans. Neural Networks.

[3]  Hung-Yuan Chung,et al.  A self-learning fuzzy logic controller using genetic algorithms with reinforcements , 1997, IEEE Trans. Fuzzy Syst..

[4]  Richard S. Sutton,et al.  Reinforcement Learning is Direct Adaptive Optimal Control , 1992, 1991 American Control Conference.

[5]  Li-Xin Wang,et al.  Stable adaptive fuzzy control of nonlinear systems , 1992, [1992] Proceedings of the 31st IEEE Conference on Decision and Control.

[6]  Swaroop Darbha,et al.  Direct adaptive longitudinal control of vehicle platoons , 2001, IEEE Trans. Veh. Technol..