Human-robot collaboration in manufacturing: Quantitative evaluation of predictable, convergent joint action

New industrial robotic systems that operate in the same physical space as people highlight the emerging need for robots that can integrate seamlessly into human group dynamics. In this paper we build on our prior investigation, which evaluates the convergence of a robot computational teaming model and a human teammate's mental model, by computing the entropy rate of the Markov chain. We present and analyze the six out of thirty-six human trials where the human participant switched execution strategies while working with the robot. We conduct a post-hoc analysis of this dataset and show that the entropy rate appears to be sensitive to changes in the human strategy and reflects the resulting increase in uncertainty about the human next actions. We propose that these results provide first support that entropy rate may be used as a component of dynamic risk assessment, to generate risk-aware robot motions and action selections.

[1]  Pierre-Yves Oudeyer,et al.  Robotic clicker training , 2002, Robotics Auton. Syst..

[2]  VelosoManuela,et al.  A survey of robot learning from demonstration , 2009 .

[3]  Monica N. Nicolescu,et al.  Natural methods for robot task learning: instructive demonstrations, generalization and practice , 2003, AAMAS '03.

[4]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[5]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[6]  Andrea Lockerd Thomaz,et al.  Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance , 2006, AAAI.

[7]  C. Burke,et al.  The impact of cross-training on team effectiveness. , 2002, The Journal of applied psychology.

[8]  Maya Cakmak,et al.  Trajectories and keyframes for kinesthetic teaching: A human-robot interaction perspective , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[9]  Manuela M. Veloso,et al.  Teaching multi-robot coordination using demonstration of communication and state sharing , 2008, AAMAS.

[10]  Stefanos Nikolaidis,et al.  Human-robot cross-training: Computational formulation, modeling and evaluation of a human team training strategy , 2013, 2013 8th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[11]  Bruce Blumberg,et al.  Integrated learning for interactive synthetic characters , 2002, SIGGRAPH.

[12]  Stefan Schaal,et al.  Robot Learning From Demonstration , 1997, ICML.

[13]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[14]  Thomas M. Cover,et al.  The entropy of Markov trajectories , 1993, IEEE Trans. Inf. Theory.

[15]  N. Nilsson STUART RUSSELL AND PETER NORVIG, ARTIFICIAL INTELLIGENCE: A MODERN APPROACH , 1996 .

[16]  Nicholas Roy,et al.  Efficient model learning for dialog management , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[17]  Pieter Abbeel,et al.  Apprenticeship learning via inverse reinforcement learning , 2004, ICML.