Learning non-myopically from human-generated reward
暂无分享,去创建一个
[1] Peter Stone,et al. Interactively shaping agents via human reinforcement: the TAMER framework , 2009, K-CAP '09.
[2] Farbod Fahimi,et al. Online human training of a myoelectric prosthesis controller via actor-critic reinforcement learning , 2011, 2011 IEEE International Conference on Rehabilitation Robotics.
[3] Sonia Chernova,et al. Effect of human guidance and state space size on Interactive Reinforcement Learning , 2011, 2011 RO-MAN.
[4] W. Bradley Knox,et al. Learning from human-generated reward , 2012 .
[5] David Silver,et al. Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008) Achieving Master Level Play in 9 × 9 Computer Go , 2022 .
[6] Peter Stone,et al. Cobot in LambdaMOO: An Adaptive Social Statistics Agent , 2006, Autonomous Agents and Multi-Agent Systems.
[7] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[8] Leopoldo Altamirano Robles,et al. Teaching a Robot to Perform Task through Imitation and On-line Feedback , 2011, CIARP.
[9] Peter Stone,et al. Reinforcement learning from human reward: Discounting in episodic tasks , 2012, 2012 IEEE RO-MAN: The 21st IEEE International Symposium on Robot and Human Interactive Communication.
[10] Andrea Lockerd Thomaz,et al. Teachable robots: Understanding human teaching behavior to build more effective robot learners , 2008, Artif. Intell..
[11] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[12] Csaba Szepesvári,et al. Bandit Based Monte-Carlo Planning , 2006, ECML.
[13] Luis Alvarez,et al. Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications , 2012, Lecture Notes in Computer Science.
[14] Neil D. Lawrence,et al. Missing Data in Kernel PCA , 2006, ECML.
[15] Peter Stone,et al. Learning and Using Models , 2012, Reinforcement Learning.
[16] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..
[17] Eduardo F. Morales,et al. Dynamic Reward Shaping: Training a Robot by Voice , 2010, IBERAMIA.