论文信息 - An Analysis of Generalization in the XCS Classifier System

An Analysis of Generalization in the XCS Classifier System

The XCS classifier system represents a major advance in learning classifier systems research because (1) it has a sound and accurate generalization mechanism, and (2) its learning mechanism is based on Q-learning, a recognized learning technique. In taking XCS beyond its very first environments and parameter settings, we show that, in certain difficult sequential (animat) environments, performance is poor. We suggest that this occurs because in the chosen environments, some conditions for proper functioning of the generalization mechanism do not hold, resulting in overly general classifiers that cause reduced performance. We hypothesize that one such condition is a lack of sufficiently wide exploration of the environment during learning. We show that if XCS is forced to explore its environment more completely, performance improves dramatically. We propose a technique, based on Sutton's Dyna concept, through which wider exploration would occur naturally. Separately, we demonstrate that the compactness of the representation evolved by XCS is limited by the number of instances of each generalization actually present in the environment. The paper shows that XCS's generalization mechanism is effective, but that the conditions under which it works must be clearly understood.

Pier Luca Lanzi | P. Lanzi

[1] Stewart W. Wilson. Generalization in the XCS Classifier System , 1998 .

[2] Dave Cliff,et al. Adding Temporary Memory to ZCS , 1994, Adapt. Behav..

[3] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[4] T. Kovacs. XCS Classifier System Reliably Evolves Accurate, Complete, and Minimal Representations for Boolean Functions , 1998 .

[5] V. Rich. Personal communication , 1989, Nature.

[6] Pier Luca Lanzi. A Model of the Environment to Avoid Local Learning , 1997 .

[7] Stewart W. Wilson. ZCS: A Zeroth Level Classifier System , 1994, Evolutionary Computation.

[8] Wolfgang Stolzmann,et al. Anticipatory Classifier Systems: An introduction , 2001 .

[9] Pier Luca Lanzi,et al. A Study of the Generalization Capabilities of XCS , 1997, ICGA.

[10] Pattie Maes,et al. Explore/Exploit Strategies in Autonomy , 1996 .

[11] Stewart W. Wilson. Classifier Fitness Based on Accuracy , 1995, Evolutionary Computation.

[12] Rick L. Riolo,et al. Lookahead planning and latent learning in a classifier system , 1991 .

[13] Long-Ji Lin,et al. Reinforcement learning for robots using neural networks , 1992 .