ART2 neural network interacting with environment

It is common to train a neural network by using samples so that it can realize the required input-output characteristics. However, to obtain such samples is difficult or even impossible in some cases. This paper proposes the use of on-line reinforcement learning (RL) algorithms to train adaptive-resonance-theory-based (ART2) neural networks through interaction with environments, namely RL-ART2 neural network. By utilizing its adaptation ability to a dynamic environment, RL is able to evaluate and select ART2 classification patterns without training samples. The connection weights can be automatically modified according to the running effect evaluation of classification pattern of neural networks. The proposed novel RL-ART2 neural network is applied to implement the collaboration movement of mobile robots. Simulation results are presented to demonstrate the feasibility and performance of the proposed algorithm.

[1]  Paulo J. G. Lisboa,et al.  An approach based on the Adaptive Resonance Theory for analysing the viability of recommender systems in a citizen Web portal , 2007, Expert Syst. Appl..

[2]  Dai Ji,et al.  Intrusion Detection Based on An Improved ART2 Neural Network , 2005 .

[3]  Xu Yin-lin The Research on a Fractionizing and Fitting ART2 Neural Network with Supervise , 2004 .

[4]  Ralf Salomon,et al.  Implementation of Path Planning using Genetic Algorithms on Mobile Robots , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[5]  Yu Wang,et al.  A method of data clustering based on improved algorithm of ART2 , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[6]  Zhang Minglu,et al.  The Improvement on ART-2 Neural Network Algorithm for Pattern Recognition , 2000 .

[7]  S. Grossberg,et al.  Adaptive pattern classification and universal recoding: I. Parallel development and coding of neural feature detectors , 1976, Biological Cybernetics.

[8]  Stephen Grossberg,et al.  Adaptive pattern classification and universal recoding: II. Feedback, expectation, olfaction, illusions , 1976, Biological Cybernetics.

[9]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[10]  Yan Chao ART2 Neural Networks with More Vigorous Vigilance Test Criterion , 2001 .

[11]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[12]  Howard C. Card,et al.  Vector quantization of images using modified adaptive resonance algorithm for hierarchical clustering , 2001, IEEE Trans. Neural Networks.

[13]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[14]  Ji Dai,et al.  Intrusion Detection Based on An Improved ART2 Neural Network , 2005, Sixth International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT'05).

[15]  Kyung Youn Kim,et al.  Model‐based fault detection and isolation method using ART2 neural network , 2003, Int. J. Intell. Syst..

[16]  Stephen Grossberg,et al.  Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system , 1991, Neural Networks.

[17]  Luo Jian-hong ART2 neural network with bidirectional matching mechanism , 2004 .