Accelerating Action Dependent Hierarchical Reinforcement Learning Through Autonomous Subgoal Discovery
暂无分享,去创建一个
[1] Pattie Maes,et al. Emergent Hierarchical Control Structures: Learning Reactive/Hierarchical Relationships in Reinforcement Environments , 1996 .
[2] Chris Drummond. Using a Case Base of Surfaces to Speed-Up Reinforcement Learning , 1997, ICCBR.
[3] R. Sutton. Between MDPs and Semi-MDPs : Learning , Planning , and Representing Knowledge at Multiple Temporal Scales , 1998 .
[4] Ronald E. Parr,et al. Hierarchical control and learning for markov decision processes , 1998 .
[5] IT Kee-EungKim. Solving Factored MDPs Using Non-homogeneous Partitions , 1998 .
[6] Craig Boutilier,et al. Decision-Theoretic Planning: Structural Assumptions and Computational Leverage , 1999, J. Artif. Intell. Res..
[7] Thomas G. Dietterich. An Overview of MAXQ Hierarchical Reinforcement Learning , 2000, SARA.
[8] Andrew G. Barto,et al. Automatic Discovery of Subgoals in Reinforcement Learning using Diverse Density , 2001, ICML.
[9] Manfred Huber,et al. Subgoal Discovery for Hierarchical Reinforcement Learning Using Learned Policies , 2003 .
[10] Manfred Huber,et al. State Space Reduction For Hierarchical Reinforcement Learning , 2004, FLAIRS.