Predicting trust in human control of swarms via inverse reinforcement learning

In this paper, we study the model of human trust where an operator controls a robotic swarm remotely for a search mission. Existing trust models in human-in-the-loop systems are based on task performance of robots. However, we find that humans tend to make their decisions based on physical characteristics of the swarm rather than its performance since task performance of swarms is not clearly perceivable by humans. We formulate trust as a Markov decision process whose state space includes physical parameters of the swarm. We employ an inverse reinforcement learning algorithm to learn behaviors of the operator from a single demonstration. The learned behaviors are used to predict the trust level of the operator based on the features of the swarm.

[1]  Stephan Sand,et al.  Return-to-base navigation of robotic swarms in Mars exploration using DoA estimation , 2013, Proceedings ELMAR-2013.

[2]  Spring Berman,et al.  Design of control policies for spatially inhomogeneous robot swarms with application to commercial pollination , 2011, 2011 IEEE International Conference on Robotics and Automation.

[3]  Andrew Y. Ng,et al.  Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[4]  Katia P. Sycara,et al.  Human Interaction With Robot Swarms: A Survey , 2016, IEEE Transactions on Human-Machine Systems.

[5]  Craig W. Reynolds Flocks, herds, and schools: a distributed behavioral model , 1998 .

[6]  Anind K. Dey,et al.  Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.

[7]  Yue Wang,et al.  Trust-based human-robot interaction for multi-robot symbolic motion planning , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[8]  Ali Marjovi,et al.  Optimal spatial formation of swarm robotic gas sensors in odor plume finding , 2013, Autonomous Robots.

[9]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[10]  Gregory Dudek,et al.  OPTIMo: Online Probabilistic Trust Inference Model for Asymmetric Human-Robot Collaborations , 2015, 2015 10th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[11]  Jessie Y. C. Chen,et al.  A Meta-Analysis of Factors Affecting Trust in Human-Robot Interaction , 2011, Hum. Factors.

[12]  Lyuba Alboul,et al.  A Robot Swarm Assisting a Human Fire-Fighter , 2009, Adv. Robotics.

[13]  David W. Aha,et al.  Adapting Autonomous Behavior Using an Inverse Trust Estimation , 2014, ICCSA.

[14]  Holly A. Yanco,et al.  Potential measures for detecting trust changes , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).