Combining Rule Induction and Reinforcement Learning

Reinforcement learning suffers from inefficiency when the number of potential solutions to be searched is large. This paper describes a method of improving reinforcement learning by applying rule induction in multi-agent systems. Knowledge captured by learned rules is used to reduce search space in reinforcement learning, allowing it to shorten learning time. The method is particularly suitable for agents operating in dynamically changing environments, in which fast response to changes is required. The method has been tested in trans- portation logistics domain in which agents represent vehicles being routed in a simple road network. Experimental results indicate that in this domain the method performs better than traditional Q-learning, as indicated by statistical comparison.

[1]  Jerzy W. Grzymala-Busse,et al.  A New Version of the Rule Induction System LERS , 1997, Fundam. Informaticae.

[2]  Sandip Sen,et al.  Evolving Beharioral Strategies in Predators and Prey , 1995, Adaption and Learning in Multi-Agent Systems.

[3]  Toshiharu Sugawara,et al.  On-Line Learning of Coordination Plans , 1993 .

[4]  Annie S. Wu,et al.  Evolving control for distributed micro air vehicles , 1999, Proceedings 1999 IEEE International Symposium on Computational Intelligence in Robotics and Automation. CIRA'99 (Cat. No.99EX375).

[5]  C. Lee Giles,et al.  Learning Communication for Multi-agent Systems , 2002, WRAC.

[6]  Sean Luke,et al.  Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.

[7]  Kenneth A. Kaufman,et al.  The AQ21 Natural Induction Program for Pattern Discovery: Initial Version and its Novel Features , 2006, 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06).

[8]  Allen Newell,et al.  Human Problem Solving. , 1973 .

[9]  Sandip Sen,et al.  Learning in multiagent systems , 1999 .

[10]  JOHANNES FÜRNKRANZ,et al.  Separate-and-Conquer Rule Learning , 1999, Artificial Intelligence Review.

[11]  R. Michalski Attributional Calculus: A Logic and Representation Language for Natural Induction , 2004 .

[12]  Bartlomiej Sniezynski,et al.  Agent Strategy Generation by Rule Induction in Predator-Prey Problem , 2009, ICCS.

[13]  Jan D. Gehrke,et al.  Traffic Prediction for Agent Route Planning , 2008, ICCS.

[14]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[15]  Lynne E. Parker,et al.  Multi-Robot Learning in a Cooperative Observation Task , 2000, DARS.

[16]  Ming Tan,et al.  Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.

[17]  Jan D. Gehrke,et al.  Designing a Simulation Middleware for FIPA Multiagent Systems , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[18]  Wojciech Kotlowski,et al.  Maximum likelihood rule ensembles , 2008, ICML '08.

[19]  Jacek Malec,et al.  Learning to evaluate conditional partial plans , 2007, ICMLA 2007.