Reinforcement Learning and Function Approximation

Relational reinforcement learning combines traditional reinforcement learning with a strong emphasis on a relational (rather than attribute-value) representation. Earlier work used relational reinforcement learning on a learning version of the classic Blocks World planning problem (a version where the learner does not know what the result of taking an action will be). “Structural” learning results have been obtained, such as learning in a mixed 3–5 block environment and being able to perform in a 3 or 10 block environment. Here we instead take a function approximation approach to reinforcement learning for this same problem. We obtain similar learning accuracies, with much better running times, allowing us to consider much larger problem sizes. For instance, we can train on a mix of 3–7 blocks and then perform well on worlds with 100–800 blocks—using less running time than the relational method required for 3–10 blocks.

[1]  Richard S. Sutton,et al.  Reinforcement Learning , 1992, Handbook of Machine Learning.

[2]  Manuela M. Veloso,et al.  Team-partitioned, opaque-transition reinforcement learning , 1999, AGENTS '99.

[3]  Geoffrey E. Hinton,et al.  Reinforcement Learning with Factored States and Actions , 2004, J. Mach. Learn. Res..

[4]  Pat Langley,et al.  Elements of Machine Learning , 1995 .

[5]  Pentti Kanerva,et al.  Sparse distributed memory and related models , 1993 .

[6]  Stuart J. Russell,et al.  Reinforcement Learning with Hierarchies of Machines , 1997, NIPS.

[7]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[8]  John N. Tsitsiklis,et al.  Feature-based methods for large scale dynamic programming , 2004, Machine Learning.

[9]  Thomas Gärtner,et al.  Graph kernels and Gaussian processes for relational reinforcement learning , 2006, Machine Learning.

[10]  Kurt Driessens,et al.  Relational Reinforcement Learning , 1998, Machine-mediated learning.

[11]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[12]  Thomas Gärtner,et al.  On Graph Kernels: Hardness Results and Efficient Alternatives , 2003, COLT.

[13]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .

[14]  Manuela M. Veloso,et al.  Team-Partitioned, Opaque-Transition Reinforced Learning , 1998, RoboCup.

[15]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[16]  Robert Givan,et al.  Relational Reinforcement Learning: An Overview , 2004, ICML 2004.