Cooperative Queuing Policies for Effective Human-Multi-Robot Interaction

We consider multi-robot applications, where a team of robots can ask for the intervention of a human operator to handle difficult situations. As the number of requests grows, team members will have to wait for the operator attention, hence the operator becomes a bottleneck for the system. Our aim in this context is to make the robots learn cooperative strategies to decrease the time spent waiting for the operator. In particular, we consider a queuing model where robots decide whether or not to join the queue and use multi-robot learning to estimate the best cooperative policy. In more detail, we formalize the problem as Decentralized Markov Decision Process and provide a suitable state representation, so to apply an independent learners approach. We evaluate the proposed method in a robotic water monitoring simulation and empirically show that our approach can significantly improve the team performance, while being computationally tractable.

[1]  P. Naor The Regulation of Queue Size by Levying Tolls , 1969 .

[2]  Huosheng Hu,et al.  Towards human-friendly efficient control of multi-robot teams , 2013, 2013 International Conference on Collaboration Technologies and Systems (CTS).

[3]  Michael Lewis,et al.  Human control for cooperating robot teams , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[4]  Victor R. Lesser,et al.  Multi-agent policies: from centralized ones to decentralized ones , 2002, AAMAS '02.

[5]  Worthy N. Martin,et al.  Effects of Uncertainty on Variable Autonomy in Maintenance Robots , 1999 .

[6]  Vincenzo Lippiello,et al.  Mixed-Initiative Planning and Execution for Multiple Drones in Search and Rescue Missions , 2015, ICAPS.

[7]  Katia P. Sycara,et al.  Scheduling operator attention for Multi-Robot Control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8]  Manuela M. Veloso,et al.  Interactive Policy Learning through Confidence-Based Autonomy , 2014, J. Artif. Intell. Res..

[9]  K. Suzanne Barber,et al.  Dynamic adaptive autonomy in multi-agent systems , 2000, J. Exp. Theor. Artif. Intell..

[10]  Noa Agmon,et al.  Intelligent agent supporting human-multi-robot team collaboration , 2015, Artif. Intell..

[11]  Debra Schreckenghost,et al.  Adjustable Autonomy for Human-Centered Autonomous Systems , 1999 .

[12]  Gal A. Kaminka,et al.  Flexible Teamwork in Behavior-Based Robots , 2005, AAAI.

[13]  Eric Horvitz,et al.  Attention-Sensitive Alerting , 1999, UAI.

[14]  Gaurav S. Sukhatme,et al.  Adaptive teams of autonomous aerial and ground robots for situational awareness , 2007, J. Field Robotics.

[15]  Luca Maria Gambardella,et al.  Wearable multi-modal interface for human multi-robot interaction , 2016, 2016 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR).

[16]  Alessandro Farinelli,et al.  Learning Queuing Strategies in Human-Multi-Robot Interaction , 2018, AAMAS.

[17]  Prasanna Velagapudi,et al.  Development of a Low Cost Multi-Robot Autonomous Marine Surface Platform , 2012, FSR.

[18]  Matthias Scheutz,et al.  Reflection and Reasoning Mechanisms for Failure Detection and Recovery in a Distributed Robotic Architecture for Complex Robots , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[19]  Milind Tambe,et al.  Towards Flexible Teamwork , 1997, J. Artif. Intell. Res..

[20]  Claudia V. Goldman,et al.  Optimizing information exchange in cooperative multi-agent systems , 2003, AAMAS '03.

[21]  Katia P. Sycara,et al.  Task Switching and Single vs. Multiple Alarms for Supervisory Control of Multiple Robots , 2014, HCI.

[22]  Maria L. Gini,et al.  Mixed-initiative decision support in agent-based automated contracting , 2000, AGENTS '00.

[23]  Neil Immerman,et al.  The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[24]  Sean Luke,et al.  Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.