Multi-Value-Functions: Efficient Automatic Action Hierarchies for Multiple Goal MDPs
暂无分享,去创建一个
[1] Editors , 1986, Brain Research Bulletin.
[2] John N. Tsitsiklis,et al. Parallel and distributed computation , 1989 .
[3] Richard P. Lippmann,et al. Proceedings of the 1997 conference on Advances in neural information processing systems 10 , 1990 .
[4] Satinder P. Singh,et al. Transfer of Learning Across Compositions of Sequentail Tasks , 1991, ML.
[5] Geoffrey E. Hinton,et al. Feudal Reinforcement Learning , 1992, NIPS.
[6] C. Atkeson,et al. Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time , 1993, Machine Learning.
[7] Andrew W. Moore,et al. The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces , 2004, Machine Learning.
[8] Leslie Pack Kaelbling,et al. Hierarchical Learning in Stochastic Domains: Preliminary Results , 1993, ICML.
[9] Stuart J. Russell,et al. Reinforcement Learning with Hierarchies of Machines , 1997, NIPS.
[10] Doina Precup,et al. Multi-time Models for Temporally Abstract Planning , 1997, NIPS.
[11] Thomas G. Dietterich. The MAXQ Method for Hierarchical Reinforcement Learning , 1998, ICML.
[12] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..