Economic Hierarchical Q-Learning
暂无分享,去创建一个
[1] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[2] David Andre,et al. State abstraction for programmable reinforcement learning agents , 2002, AAAI/IAAI.
[3] John H. Holland,et al. Escaping brittleness: the possibilities of general-purpose learning algorithms applied to parallel rule-based systems , 1995 .
[4] Stuart J. Russell,et al. Reinforcement Learning with Hierarchies of Machines , 1997, NIPS.
[5] Igor Durdanovic,et al. Evolution of Cooperative Problem Solving in an Artificial Economy , 2000, Neural Computation.
[6] Thomas Dean,et al. Decomposition Techniques for Planning in Stochastic Domains , 1995, IJCAI.
[7] Thomas G. Dietterich. State Abstraction in MAXQ Hierarchical Reinforcement Learning , 1999, NIPS.
[8] Stuart J. Russell,et al. A compact, hierarchically optimal Q-function decomposition , 2006, UAI 2006.