1 Supervised Actor-Critic Reinforcement Learning
暂无分享,去创建一个
[1] Jude W. Shavlik,et al. Creating Advice-Taking Reinforcement Learners , 1998, Machine Learning.
[2] Richard S. Sutton,et al. Reinforcement Learning with Replacing Eligibility Traces , 2005, Machine Learning.
[3] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[4] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[5] Long Ji Lin,et al. Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.
[6] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[7] Andrew G. Barto,et al. Lyapunov Design for Safe Reinforcement Learning , 2003, J. Mach. Learn. Res..
[8] Leslie Pack Kaelbling,et al. Effective reinforcement learning for mobile robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).
[9] Maja J. Matarić,et al. Sensory-motor primitives as a basis for imitation: linking perception to action and biology to robotics , 2002 .
[10] Douglas C. Hittle,et al. Robust reinforcement learning control with static and dynamic stability , 2001 .
[11] Andrew G. Barto,et al. Lyapunov-Constrained Action Sets for Reinforcement Learning , 2001, ICML.
[12] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.
[13] Craig Boutilier,et al. Implicit Imitation in Multiagent Reinforcement Learning , 1999, ICML.
[14] Stefan Schaal,et al. Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.
[15] Andrew G. Barto,et al. Reinforcement learning in motor control , 1998 .
[16] Roderic A. Grupen,et al. A feedback control structure for on-line learning tasks , 1997, Robotics Auton. Syst..
[17] Ashwin Ram,et al. Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces , 1997, Adapt. Behav..
[18] Judy A. Franklin,et al. Biped dynamic walking using reinforcement learning , 1997, Robotics Auton. Syst..
[19] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[20] Rüdiger Dillmann,et al. Building elementary robot skills from human demonstration , 1996, Proceedings of IEEE International Conference on Robotics and Automation.
[21] Paul E. Utgoff,et al. On integrating apprentice learning and reinforcement learning , 1996 .
[22] Marco Colombetti,et al. Robot Shaping: Developing Autonomous Agents Through Learning , 1994, Artif. Intell..
[23] Maja J. Mataric,et al. Reward Functions for Accelerated Learning , 1994, ICML.
[24] Paul E. Utgoff,et al. A Teaching Method for Reinforcement Learning , 1992, ML.
[25] Michael I. Jordan,et al. Forward Models: Supervised Learning with a Distal Teacher , 1992, Cogn. Sci..
[26] Vijaykumar Gullapalli,et al. A stochastic reinforcement learning algorithm for learning real-valued functions , 1990, Neural Networks.
[27] Francis L. Merat,et al. Introduction to robotics: Mechanics and control , 1987, IEEE J. Robotics Autom..
[28] J. Meditch,et al. Applied optimal control , 1972, IEEE Transactions on Automatic Control.
[29] N. A. Bernshteĭn. The co-ordination and regulation of movements , 1967 .