暂无分享,去创建一个
Guan Wang | Matthew E. Taylor | Michael L. Littman | Bei Peng | James MacGlashan | Mark K. Ho | David L. Roberts | Robert Tyler Loftin | M. Littman | J. MacGlashan | R. Loftin | Bei Peng | Guan Wang | David L. Roberts
[1] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[2] Paul E. Utgoff,et al. A Teaching Method for Reinforcement Learning , 1992, ML.
[3] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[4] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.
[5] R. Miltenberger. Behavior Modification: Principles and Procedures , 1996 .
[6] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[7] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[8] Peter Stone,et al. A social reinforcement learning agent , 2001, AGENTS '01.
[9] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[10] Jude W. Shavlik,et al. Giving Advice about Preferred Actions to Reinforcement Learners Via Knowledge-Based Kernel Regression , 2005, AAAI.
[11] Andrea Lockerd Thomaz,et al. Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance , 2006, AAAI.
[12] C. Breazeal,et al. Robot learning via socially guided exploration , 2007, 2007 IEEE 6th International Conference on Development and Learning.
[13] Andrea Lockerd Thomaz,et al. Teachable robots: Understanding human teaching behavior to build more effective robot learners , 2008, Artif. Intell..
[14] Peter Stone,et al. Interactively shaping agents via human reinforcement: the TAMER framework , 2009, K-CAP '09.
[15] Shalabh Bhatnagar,et al. Natural actor-critic algorithms , 2009, Autom..
[16] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..
[17] Eduardo F. Morales,et al. Dynamic Reward Shaping: Training a Robot by Voice , 2010, IBERAMIA.
[18] Peter Stone,et al. Combining manual feedback with subsequent MDP reward signals for reinforcement learning , 2010, AAMAS.
[19] Farbod Fahimi,et al. Online human training of a myoelectric prosthesis controller via actor-critic reinforcement learning , 2011, 2011 IEEE International Conference on Rehabilitation Robotics.
[20] Bradley C. Love,et al. How Humans Teach Agents , 2012, Int. J. Soc. Robotics.
[21] W. Bradley Knox,et al. Learning from human-generated reward , 2012 .
[22] Cynthia Breazeal,et al. Training a Robot via Human Feedback: A Case Study , 2013, ICSR.
[23] Peter Stone,et al. Learning non-myopically from human-generated reward , 2013, IUI '13.
[24] Andrea Lockerd Thomaz,et al. Policy Shaping: Integrating Human Feedback with Reinforcement Learning , 2013, NIPS.
[25] David L. Roberts,et al. Learning behaviors via human-delivered discrete feedback: modeling implicit feedback strategies to speed up learning , 2015, Autonomous Agents and Multi-Agent Systems.
[26] Fiery Cushman,et al. Teaching with Rewards and Punishments: Reinforcement or Communication? , 2015, CogSci.