Adaptive fuzzy control of satellite attitude by reinforcement learning

The attitude control of a satellite is often characterized by a limit cycle, caused by measurement inaccuracies and noise in the sensor output. In order to reduce the limit cycle, a nonlinear fuzzy controller was applied. The controller was tuned by means of reinforcement learning without using any model of the sensors or the satellite. The reinforcement signal is computed as a fuzzy performance measure using a noncompensatory aggregation of two control subgoals. Convergence of the reinforcement learning scheme is improved by computing the temporal difference error over several time steps and adapting the critic and the controller at a lower sampling rate. The results show that an adaptive fuzzy controller can better cope with the sensor noise and nonlinearities than a standard linear controller.

[1]  J. van Amerongen,et al.  Limitations of adaptive critic control schemes , 1993 .

[2]  Charles W. Anderson,et al.  Strategy Learning with Multilayer Connectionist Representations , 1987 .

[3]  Chin-Teng Lin,et al.  A neural fuzzy control system with structure and parameter learning , 1995 .

[4]  Robert N. Lea,et al.  Space shuttle attitude control by reinforcement learning and fuzzy logic , 1993, [Proceedings 1993] Second IEEE International Conference on Fuzzy Systems.

[5]  L. Karsten,et al.  Optimal attitude control of satellites by artificial neural networks: a pilot study☆ , 1994 .

[6]  Hamid R. Berenji,et al.  Learning and tuning fuzzy logic controllers through reinforcements , 1992, IEEE Trans. Neural Networks.

[7]  Richard Bellman,et al.  Decision-making in fuzzy environment , 2012 .

[8]  Spyros G. Tzafestas,et al.  Neural fuzzy control systems with structure and parameter learning , 1996, J. Intell. Robotic Syst..

[9]  Rudolf Kruse,et al.  A fuzzy neural network learning fuzzy control rules and membership functions by fuzzy error backpropagation , 1993, IEEE International Conference on Neural Networks.

[10]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[11]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[12]  Nasa Contractor,et al.  DECISION-MAKING IN A FUZZY ENVIRONMENT , 1970 .