Reinforcement Learning with Hierarchies of Machines
暂无分享,去创建一个
[1] Rodney A. Brooks,et al. A Robust Layered Control Syste For A Mobile Robot , 2022 .
[2] John N. Tsitsiklis,et al. Parallel and distributed computation , 1989 .
[3] Austin Tate,et al. O-Plan: The open Planning Architecture , 1991, Artif. Intell..
[4] Jane Yung-jen Hsu,et al. Synthesizing Efficient Agents from Partial Programs , 1991, ISMIS.
[5] Geoffrey E. Hinton,et al. Feudal Reinforcement Learning , 1992, NIPS.
[6] Long-Ji Lin,et al. Reinforcement learning for robots using neural networks , 1992 .
[7] Satinder P. Singh,et al. Scaling Reinforcement Learning Algorithms by Learning Variable Temporal Resolution Models , 1992, ML.
[8] Nils J. Nilsson,et al. Reacting, Planning, and Learning in an Autonomous Agent , 1996, Machine Intelligence 14.
[9] Michael I. Jordan,et al. MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES , 1996 .
[10] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[11] Michael O. Duff,et al. Reinforcement Learning Methods for Continuous-Time Markov Decision Problems , 1994, NIPS.
[12] Michael I. Jordan,et al. Reinforcement Learning with Soft State Aggregation , 1994, NIPS.
[13] Thomas Dean,et al. Decomposition Techniques for Planning in Stochastic Domains , 1995, IJCAI.
[14] Richard S. Sutton,et al. Roles of Macro-Actions in Accelerating Reinforcement Learning , 1998 .
[15] Doina Precup,et al. Multi-time Models for Temporally Abstract Planning , 1997, NIPS.
[16] Shieu-Hong Lin,et al. Exploiting structure for planning and control , 1997 .
[17] Robert Givan,et al. Model Reduction Techniques for Computing Approximately Optimal Solutions for Markov Decision Processes , 1997, UAI.