Integral Reinforcement Learning-Based Multi-Robot Minimum Time-Energy Path Planning Subject to Collision Avoidance and Unknown Environmental Disturbances

In this letter, we study the online multi-robot minimum time-energy path planning problem subject to collision avoidance and input constraints in an unknown environment. We develop an online adaptive solution for the problem using integral reinforcement learning (IRL). This is achieved through transforming the finite-horizon minimum time-energy problem with input constraints to an approximate infinite-horizon optimal control problem. To achieve collision avoidance, we incorporate artificial potential fields into the approximate cost function. We develop an IRL-based optimal control strategy and prove its convergence. The theoretical results are verified through simulation studies.

[1]  Yan Wan,et al.  Spatiotemporal scenario data-driven decision for the path planning of multiple UASs , 2019, SCOPE@CPSIoTWeek.

[2]  Frank L. Lewis,et al.  Adaptive optimal control for continuous-time linear systems based on policy iteration , 2009, Autom..

[3]  S. Lyashevskiy Constrained optimization and control of nonlinear systems: new results in optimal control , 1996, Proceedings of 35th IEEE Conference on Decision and Control.

[4]  A. Matveev,et al.  Algorithms for collision-free navigation of mobile robots in complex cluttered environments: a survey , 2014, Robotica.

[5]  Frank L. Lewis,et al.  Nonlinear H2/H-Infinity Constrained Feedback Control: A Practical Design Approach Using Neural Networks (Advances in Industrial Control) , 2006 .

[6]  Bérénice Mettler,et al.  Survey of Motion Planning Literature in the Presence of Uncertainty: Considerations for UAV Guidance , 2012, J. Intell. Robotic Syst..

[7]  Oskar Wigström,et al.  Energy efficient multi-robot coordination , 2016 .

[8]  J. Rubio,et al.  Adaptive Path Planning for Autonomous UAV Oceanic Search Missions , 2004 .

[9]  Lynne E. Parker,et al.  Path Planning and Motion Coordination in Multiple Mobile Robot Teams , 2009 .

[10]  F. Lewis,et al.  Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers , 2012, IEEE Control Systems.

[11]  Dianwei Qian,et al.  Multi-Robot Path Planning Method Using Reinforcement Learning , 2019, Applied Sciences.

[12]  Amr Mohamed,et al.  Optimal path planning for unmanned ground vehicles using potential field method and optimal control method , 2018 .

[13]  S. Lyshevski Optimal control of nonlinear continuous-time systems: design of bounded controllers via generalized nonquadratic functionals , 1998, Proceedings of the 1998 American Control Conference. ACC (IEEE Cat. No.98CH36207).

[14]  Yan Wan,et al.  Design and implementation of aerial communication using directional antennas: learning control in unknown communication environments , 2019, IET Control Theory & Applications.

[15]  J. Karl Hedrick,et al.  Optimal path planning in a constant wind with a bounded turning rate , 2005 .

[16]  Wei Luo,et al.  Deep-Sarsa Based Multi-UAV Path Planning and Obstacle Avoidance in a Dynamic Environment , 2018, ICSI.