Coactive design of explainable agent-based task planning and deep reinforcement learning for human-UAVs teamwork

Abstract Unmanned Aerial Vehicles (UAVs) are useful in dangerous and dynamic tasks such as search-and-rescue, forest surveillance, and anti-terrorist operations. These tasks can be solved better through the collaboration of multiple UAVs under human supervision. However, it is still difficult for human to monitor, understand, predict and control the behaviors of the UAVs due to the task complexity as well as the black-box machine learning and planning algorithms being used. In this paper, the coactive design method is adopted to analyze the cognitive capabilities required for the tasks and design the interdependencies among the heterogeneous teammates of UAVs or human for coherent collaboration. Then, an agent-based task planner is proposed to automatically decompose a complex task into a sequence of explainable subtasks under constrains of resources, execution time, social rules and costs. Besides, a deep reinforcement learning approach is designed for the UAVs to learn optimal policies of a flocking behavior and a path planner that are easy for the human operator to understand and control. Finally, a mixed-initiative action selection mechanism is used to evaluate the learned policies as well as the human’s decisions. Experimental results demonstrate the effectiveness of the proposed methods.

[1]  Jie Huang,et al.  Adaptive Leader-Following Consensus for Multiple Euler–Lagrange Systems With an Uncertain Leader System , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Jeffrey M. Bradshaw,et al.  Dimensions of Adjustable Autonomy and Mixed-Initiative Interaction , 2003, Agents and Computational Autonomy.

[3]  Timothy Patten,et al.  Dec-MCTS: Decentralized planning for multi-robot active perception , 2019, Int. J. Robotics Res..

[4]  Bo He,et al.  Human-Centered Reinforcement Learning: A Survey , 2019, IEEE Transactions on Human-Machine Systems.

[5]  Zhiwen Zeng,et al.  Multi-agent distributed coordination control: Developments and directions via graph viewpoint , 2015, Neurocomputing.

[6]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[7]  Vijay Kumar,et al.  A Survey on Aerial Swarm Robotics , 2018, IEEE Transactions on Robotics.

[8]  Elad H. Kivelevitch,et al.  Genetic Fuzzy Trees and their Application Towards Autonomous Training and Control of a Squadron of Unmanned Combat Aerial Vehicles , 2015, Unmanned Syst..

[9]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[10]  Michael A. Goodrich,et al.  Ecological Interfaces for Improving Mobile Robot Teleoperation , 2007, IEEE Transactions on Robotics.

[11]  Lincheng Shen,et al.  Systemic design of distributed multi-UAV cooperative decision-making for multi-target tracking , 2019, Autonomous Agents and Multi-Agent Systems.

[12]  Charles E. Thorpe,et al.  Collaborative control: a robot-centric model for vehicle teleoperation , 2001 .

[13]  Kemal E. Tepe,et al.  Survey of Multi-agent Communication Strategies for Information Exchange and Mission Control of Drone Deployments , 2019, J. Intell. Robotic Syst..

[14]  Ronald C. Arkin,et al.  Governing Lethal Behavior in Autonomous Robots , 2009 .

[15]  Chao Yan,et al.  Towards Real-Time Path Planning through Deep Reinforcement Learning for a UAV in Dynamic Environments , 2020, J. Intell. Robotic Syst..

[16]  Salil S. Kanhere,et al.  Multi-Agent Systems: A Survey , 2018, IEEE Access.

[17]  M. J. Johnson Coactive Design: Designing Support for Interdependence in Human-Robot Teamwork , 2014 .

[18]  Chang Wang,et al.  Adaptive human-in-the-loop multi-target recognition improved by learning , 2018 .

[19]  Twan Koolen,et al.  Team IHMC's Lessons Learned from the DARPA Robotics Challenge Trials , 2015, J. Field Robotics.

[20]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[21]  Maarten Sierhuis,et al.  Autonomy and interdependence in human-agent-robot teams , 2012, IEEE Intelligent Systems.

[22]  Saeid Nahavandi,et al.  System Design Perspective for Human-Level Agents Using Deep Reinforcement Learning: A Survey , 2017, IEEE Access.

[23]  Tjerk de Greef ePartners for Dynamic Task Allocation and Coordination , 2012 .