论文信息 - A parallel fuzzy inference model with distributed prediction scheme for reinforcement learning

A parallel fuzzy inference model with distributed prediction scheme for reinforcement learning

This paper proposes a three-layered parallel fuzzy inference model called reinforcement fuzzy neural network with distributed prediction scheme (RFNN-DPS), which performs reinforcement learning with a novel distributed prediction scheme. In RFNN-DPS, an additional predictor for predicting the external reinforcement signal is not necessary, and the internal reinforcement information is distributed into fuzzy rules (rule nodes). Therefore, using RFNN-DPS, only one network is needed to construct a fuzzy logic system with the abilities of parallel inference and reinforcement learning. Basically, the information for prediction in RFNN-DPS is composed of credit values stored in fuzzy rule nodes, where each node holds a credit vector to represent the reliability of the corresponding fuzzy rule. The credit values are not only accessed for predicting external reinforcement signals, but also provide a more profitable internal reinforcement signal to each fuzzy rule itself. RFNN-DPS performs a credit-based exploratory algorithm to adjust its internal status according to the internal reinforcement signal. During learning, the RFNN-DPS network is constructed by a single-step or multistep reinforcement learning algorithm based on the ART concept. According to our experimental results, RFNN-DPS shows the advantages of simple network structure, fast learning speed, and explicit representation of rule reliability.

[1] L X Wang,et al. Fuzzy basis functions, universal approximation, and orthogonal least-squares learning , 1992, IEEE Trans. Neural Networks.

[2] Cheng-Jian Lin,et al. Reinforcement learning for ART-based fuzzy adaptive learning control networks , 1995, Proceedings of 1995 IEEE International Conference on Fuzzy Systems..

[3] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[4] Richard S. Sutton,et al. Challenging Control Problems , 1995 .

[5] Charles W. Anderson,et al. Strategy Learning with Multilayer Connectionist Representations , 1987 .

[6] Chin-Teng Lin,et al. Neural-Network-Based Fuzzy Logic Control and Decision System , 1991, IEEE Trans. Computers.

[7] C. S. George Lee,et al. Reinforcement structure/parameter learning for neural-network-based fuzzy logic control systems , 1994, IEEE Trans. Fuzzy Syst..

[8] Hans-Jürgen Zimmermann,et al. Fuzzy Set Theory - and Its Applications , 1985 .

[9] E. H. Mamdani,et al. An Experiment in Linguistic Synthesis with a Fuzzy Logic Controller , 1999, Int. J. Man Mach. Stud..

[10] Y. Kuo,et al. A fuzzy neural network model with three-layered structure , 1995, Proceedings of 1995 IEEE International Conference on Fuzzy Systems..

[11] David M. Skapura,et al. Neural networks - algorithms, applications, and programming techniques , 1991, Computation and neural systems series.

[12] B. Widrow,et al. The truck backer-upper: an example of self-learning in neural networks , 1989, International 1989 Joint Conference on Neural Networks.

[13] J. Franklin. Input space representation for refinement learning control , 1989, Proceedings. IEEE International Symposium on Intelligent Control 1989.

[14] R.A. Abu Zitar,et al. Genetic and reinforcement-based rule extraction for regulator control , 1993, Proceedings of 32nd IEEE Conference on Decision and Control.

[15] Vijaykumar Gullapalli,et al. A stochastic reinforcement learning algorithm for learning real-valued functions , 1990, Neural Networks.

[16] Bart Kosko,et al. Neural networks and fuzzy systems , 1998 .

[17] P. Anandan,et al. Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.