论文信息 - Guiding a relational learning agent with a learning classifier system

Guiding a relational learning agent with a learning classifier system

This paper researches a collaborative strategy between an XCS learning classifier system (LCS) and a relational learning (RL) agent. The problem here is to learn a relational policy for a stochastic markovian decision process. In the proposed method the XCS agent is used to improve the performance of the RL agent by filtering the samples used at the induction step. This research shows that in these conditions, one of the main benefits of using the XCS algorithm comes from selecting the examples for relational learning using an estimation for the accuracy of the predicted value at each state-action pair. This kind of transfer learning is important because the characteristics of both agents are complementary: the RL agent incrementally induces a high level description of a policy, while the LCS agent offers adaptation to changes in the environment.

Pedro A. Toledo | Silvia Alayón | J. I. Estévez

[1] Luc De Raedt,et al. Bellman goes relational , 2004, ICML.

[2] Martin V. Butz,et al. Rule-Based Evolutionary Online Learning Systems - A Principled Approach to LCS Analysis and Design , 2006, Studies in Fuzziness and Soft Computing.

[3] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[5] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..

[6] Saso Dzeroski,et al. Integrating Guidance into Relational Reinforcement Learning , 2004, Machine Learning.

[7] van Martijn Otterlo. Relational Representations in Reinforcement Learning: Review and Open Problems , 2002, ICML 2002.

[8] Martin Houlind. Translation and adaptation , 2001 .

[9] Richard Fikes,et al. STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[10] Kurt Driessens,et al. Speeding Up Relational Reinforcement Learning through the Use of an Incremental First Order Decision Tree Learner , 2001, ECML.

[11] H. Crichton-Miller. Adaptation , 1926 .

[12] John K. Slaney,et al. Blocks World revisited , 2001, Artif. Intell..

[13] Stewart W. Wilson. Classifier Fitness Based on Accuracy , 1995, Evolutionary Computation.