AUV Path Following Control using Deep Reinforcement Learning Under the Influence of Ocean Currents

Path following control of Autonomous underwater Vehicles (AUVs) is considered an important and challenging task, especially when AUVs are designed as underactuated systems with saturated inputs, and the ocean currents and disturbances in the environments are considered. Based on this premise, this work proposes a novel path following control approach based on deep reinforcement learning (DRL). Different from the existing DRL-based control methods, where only one single controller is used to control the motion of AUV, our proposed control approach consists of two subsystems, for controlling the velocity and steering of AUV, independently. Specifically, the deep deterministic policy gradient (DDPG) algorithm is adopted to achieve satisfactory control in the surge velocity control subsystem, and the soft actor-critic (SAC) algorithm is employed in the steering control subsystem to achieve high-accuracy path following control. Besides, we derive an enhanced Line-of-Sight (LOS) guidance method to make the AUV generate a drift angle to compensate for the sideslip induced by sway velocity of AUV and ocean currents. The numerical simulation results validate the advantages of the proposed DDPG-SAC-based control approach in great generalization ability, improved anti-interference capability and high robustness.

[1]  A. Joseph The History of Measuring Ocean Currents , 2014 .

[2]  Hongde Qin,et al.  A novel adaptive second order sliding mode path following control for a portable AUV , 2018 .

[3]  Timothy Prestero,et al.  Verification of a six-degree of freedom simulation model for the REMUS autonomous underwater vehicle , 2001 .

[4]  Rogelio Lozano,et al.  Output based backstepping control for trajectory tracking of an Autonomous Underwater Vehicle , 2016, 2016 American Control Conference (ACC).

[5]  Saeed Balochian,et al.  Modeling and control of autonomous underwater vehicle (AUV) in heading and depth attitude via self-adaptive fuzzy PID controller , 2015 .

[6]  Cong Wang,et al.  Three-Dimensional Path Following of an Underactuated AUV Based on Neuro-Adaptive Command Filtered Backstepping Control , 2018, IEEE Access.

[7]  Antony Joseph,et al.  Measuring Ocean Currents: Tools, Technologies, and Data , 2013 .

[8]  Bo He,et al.  Fuzzy PID Motion Control Based on Extended State Observer for AUV , 2019, 2019 IEEE Underwater Technology (UT).

[9]  Reza Malekian,et al.  Path Following Optimization for an Underactuated USV Using Smoothly-Convergent Deep Reinforcement Learning , 2020, IEEE Transactions on Intelligent Transportation Systems.

[10]  Qiang Ma,et al.  Three-Dimensional Path Following of an Underactuated AUV Based on Fuzzy Backstepping Sliding Mode Control , 2018, Int. J. Fuzzy Syst..

[11]  Cheng Wu,et al.  Multi Pseudo Q-Learning-Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[12]  Sergey Levine,et al.  Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.

[13]  Qixin Sha,et al.  Deep Interactive Reinforcement Learning for Path Following of Autonomous Underwater Vehicle , 2020, IEEE Access.

[14]  Shiji Song,et al.  End-to-end sensorimotor control problems of AUVs with deep reinforcement learning , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[15]  Yeung Sam Hung,et al.  Global Path-Planning for Constrained and Optimal Visual Servoing , 2007, IEEE Transactions on Robotics.

[16]  Hao Xu,et al.  AUV path following controlled by modified Deep Deterministic Policy Gradient , 2020 .

[17]  Anastasios M. Lekkas,et al.  Curved Path Following with Deep Reinforcement Learning: Results from Three Vessel Models , 2018, OCEANS 2018 MTS/IEEE Charleston.