论文信息 - Real-Time Interactive Reinforcement Learning for Robots

Real-Time Interactive Reinforcement Learning for Robots

It is our goal to understand the role real-time human interaction can play in machine learning algorithms for robots. In this paper we present Interactive Reinforcement Learning (IRL) as a plausible approach for training human-centric assistive robots by natural interaction. We describe an experimental platform to study IRL, pose questions arising from IRL, and discuss initial observations obtained during the development of our system.

[1] M. Argyle,et al. The Different Functions of Gaze , 1973 .

[2] J. Glidewell. The Social context of learning and development , 1977 .

[3] James U. Korein,et al. Robotics , 2018, IBM Syst. J..

[4] David A. Cohn,et al. Active Learning with Statistical Models , 1996, NIPS.

[5] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[6] Sebastian Thrun,et al. Lifelong robot learning , 1993, Robotics Auton. Syst..

[7] R. Krauss,et al. Nonverbal Behavior and Nonverbal Communication: What do Conversational Hand Gestures Tell Us? , 1996 .

[8] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[9] Maja J. Mataric,et al. Reinforcement Learning in the Multi-Robot Domain , 1997, Auton. Robots.

[10] Doina Precup,et al. Between MOPs and Semi-MOP: Learning, Planning & Representing Knowledge at Multiple Temporal Scales , 1998 .

[11] Luc Steels,et al. Aibo''s first words. the social learning of language and meaning. Evolution of Communication , 2002 .

[12] Peter Stone,et al. Cobot: A Social Reinforcement Learning Agent , 2001, NIPS.

[13] Guido Bugmann,et al. Mobile robot programming using natural language , 2002, Robotics Auton. Syst..

[14] Monica N. Nicolescu,et al. Natural methods for robot task learning: instructive demonstrations, generalization and practice , 2003, AAMAS '03.

[15] Nuttapong Chentanez,et al. Intrinsically Motivated Reinforcement Learning , 2004, NIPS.

[16] Andrea Lockerd Thomaz,et al. Tutelage and Collaboration for Humanoid Robots , 2004, Int. J. Humanoid Robotics.

[17] Andrea Lockerd Thomaz,et al. Tutelage and socially guided robot learning , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).