Enhancing Inference in Relational Reinforcement Learning Via Truth Maintenance Systems

Computational complexity is still a challenging problem for intelligent systems operating in compound environments. To tackle it, an agent has to deal with perceptual information intelligently. In this paper, we propose an efficient and adaptive reasoning system based on Adaptive Logic Interpreter reasoning system, a mechanism for guiding inference through relational reinforcement learning, and a variation of Truth Maintenance Systems to speed up the inference. Relational reinforcement learning guides the inference toward the most rewarding parts of the knowledge base and truth maintenance system maintains beliefs, avoids repetitive inferences and reduces the state space. Empirical results demonstrate higher performance than the basic approach in terms of number of inferred instances, average reward, and average reward accuracy.

[1]  Luc De Raedt,et al.  Relational Reinforcement Learning , 2001, Machine Learning.

[2]  Yoram Singer,et al.  A simple, fast, and effective rule learner , 1999, AAAI 1999.

[3]  Raymond J. Mooney,et al.  Combining FOIL and EBG to Speed-up Logic Programs , 1993, IJCAI.

[4]  Michael R. Genesereth,et al.  Logic programming , 1985, CACM.

[5]  Sanja Vranes,et al.  Using truth maintenance systems. A tutorial , 1994, IEEE Expert.

[6]  Jon Doyle,et al.  A Truth Maintenance System , 1979, Artif. Intell..

[7]  Michael J. Witbrock,et al.  Guiding Inference with Policy Search Reinforcement Learning , 2007, FLAIRS.

[8]  Sachiyo Arai,et al.  Guiding Inference Through Relational Reinforcement Learning , 2005, ILP.

[9]  Stuart J. Russell,et al.  Principles of Metareasoning , 1989, Artif. Intell..

[10]  Pat Langley,et al.  An architecture for persistent reactive behavior , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[11]  Johan de Kleer,et al.  An Assumption-Based TMS , 1987, Artif. Intell..

[12]  John L. Cuadrado,et al.  A Truth Maintenance System Based on Stable Models , 1989, NACLP.

[13]  Steven Minton,et al.  Quantitative Results Concerning the Utility of Explanation-based Learning , 1988, Artif. Intell..

[14]  Charles L. Forgy,et al.  Rete: a fast algorithm for the many pattern/many object pattern match problem , 1991 .