R-POPTVR: A Novel Reinforcement-Based POPTVR Fuzzy Neural Network for Pattern Classification

In general, a fuzzy neural network (FNN) is characterized by its learning algorithm and its linguistic knowledge representation. However, it does not necessarily interact with its environment when the training data is assumed to be an accurate description of the environment under consideration. In interactive problems, it would be more appropriate for an agent to learn from its own experience through interactions with the environment, i.e., reinforcement learning. In this paper, three clustering algorithms are developed based on the reinforcement learning paradigm. This allows a more accurate description of the clusters as the clustering process is influenced by the reinforcement signal. They are the REINFORCE clustering technique I (RCT-I), the REINFORCE clustering technique II (RCT-II), and the episodic REINFORCE clustering technique (ERCT). The integrations of the RCT-I, the RCT-II, and the ERCT within the pseudo-outer product truth value restriction (POPTVR), which is a fuzzy neural network integrated with the truth restriction value (TVR) inference scheme in its five layered feedforward neural network, form the RPOPTVR-I, the RPOPTVR-II, and the ERPOPTVR, respectively. The Iris, Phoneme, and Spiral data sets are used for benchmarking. For both Iris and Phoneme data, the RPOPTVR is able to yield better classification results which are higher than the original POPTVR and the modified POPTVR over the three test trials. For the Spiral data set, the RPOPTVR-II is able to outperform the others by at least a margin of 5.8% over multiple test trials. The three reinforcement-based clustering techniques applied to the POPTVR network are able to exhibit the trial-and-error search characteristic that yields higher qualitative performance.

[1]  Robert Kozma,et al.  Beyond Feedforward Models Trained by Backpropagation: A Practical Training Tool for a More Efficient Universal Approximator , 2007, IEEE Transactions on Neural Networks.

[2]  Ganesh K. Venayagamoorthy,et al.  Quantum-inspired Evolutionary Algorithms and Binary Particle Swarm Optimization for Training MLP and SRN Neural Networks , 2005 .

[3]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[4]  James M. Keller,et al.  Neural network implementation of fuzzy logic , 1992 .

[5]  Ronald J. Williams,et al.  Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[6]  R. Clayton,et al.  Epicardial ECG Mapping of Human Ventricular Fibrillation , 2006 .

[7]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[8]  Leemon C. Baird,et al.  Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.

[9]  Chin-Teng Lin,et al.  Reinforcement learning for an ART-based fuzzy adaptive learning control network , 1996, IEEE Trans. Neural Networks.

[10]  Hiok Chai Quek,et al.  FCMAC-Yager: A Novel Yager-Inference-Scheme-Based Fuzzy CMAC , 2006, IEEE Transactions on Neural Networks.

[11]  Ruowei Zhou,et al.  The POP learning algorithms: reducing work in identifying fuzzy rules , 2001, Neural Networks.

[12]  Ramon López de Mántaras,et al.  Approximate Reasoning Models , 1990 .

[13]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[14]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[15]  Ruowei Zhou,et al.  Structure and learning algorithms of a nonsingleton input fuzzy neural network based on the approximate analogical reasoning schema , 2006, Fuzzy Sets Syst..

[16]  Jyh-Shing Roger Jang,et al.  Self-learning fuzzy controllers based on temporal backpropagation , 1992, IEEE Trans. Neural Networks.

[17]  Hiok Chai Quek,et al.  POP-Yager: A novel self-organizing fuzzy neural network based on the Yager inference , 2005, Expert Syst. Appl..

[18]  C. S. George Lee,et al.  Neural fuzzy systems: a neuro-fuzzy synergism to intelligent systems , 1996 .

[19]  R. J. Williams,et al.  On the use of backpropagation in associative reinforcement learning , 1988, IEEE 1988 International Conference on Neural Networks.

[20]  Hiok Chai Quek,et al.  DCT-Yager FNN: A Novel Yager-Based Fuzzy Neural Network With the Discrete Clustering Technique , 2008, IEEE Transactions on Neural Networks.

[21]  Chin-Teng Lin,et al.  An ART-based fuzzy adaptive learning control network , 1997, IEEE Trans. Fuzzy Syst..

[22]  Geoffrey J. Gordon Stable Function Approximation in Dynamic Programming , 1995, ICML.

[23]  D. P. Kwok,et al.  Dynamic neural network control through fuzzy Q-learning algorithms , 1997, 1997 IEEE International Conference on Intelligent Processing Systems (Cat. No.97TH8335).

[24]  C. S. George Lee,et al.  Reinforcement structure/parameter learning for neural-network-based fuzzy logic control systems , 1994, IEEE Trans. Fuzzy Syst..

[25]  Ronald R. Yager,et al.  Modeling and formulating fuzzy knowledge bases using neural networks , 1994, Neural Networks.

[26]  Michael I. Jordan,et al.  Learning Without State-Estimation in Partially Observable Markovian Decision Processes , 1994, ICML.

[27]  L. A. Zedeh Knowledge representation in fuzzy logic , 1989 .

[28]  Lotfi A. Zadeh,et al.  The Concepts of a Linguistic Variable and its Application to Approximate Reasoning , 1975 .

[29]  Kai Keng Ang,et al.  RSPOP: Rough SetBased Pseudo Outer-Product Fuzzy Rule Identification Algorithm , 2005, Neural Computation.

[30]  Feng Wang,et al.  Self adaptive neuro-fuzzy control of neural prostheses using reinforcement learning , 1996, Proceedings of 18th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[31]  Lotfi A. Zadeh,et al.  Knowledge Representation in Fuzzy Logic , 1996, IEEE Trans. Knowl. Data Eng..

[32]  Jf Baldwin,et al.  An Introduction to Fuzzy Logic Applications in Intelligent Systems , 1992 .

[33]  Martin A. Riedmiller,et al.  A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.

[34]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[35]  Ruowei Zhou,et al.  POPFNN: A Pseudo Outer-product Based Fuzzy Neural Network , 1996, Neural Networks.

[36]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[37]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..