Deep Reinforcement Learning for Continuous Docking Control of Autonomous Underwater Vehicles: A Benchmarking Study

Docking control of an autonomous underwater vehicle (AUV) is a task that is integral to achieving persistent long term autonomy. This work explores the application of state-of-the-art model-free deep reinforcement learning (DRL) approaches to the task of AUV docking in the continuous domain. We provide a detailed formulation of the reward function, utilized to successfully dock the AUV onto a fixed docking platform. A major contribution that distinguishes our work from the previous approaches is the usage of a physics simulator to define and simulate the underwater environment as well as the DeepLeng AUV. We propose a new reward function formulation for the docking task, incorporating several components, that outperforms previous reward formulations. We evaluate proximal policy optimization (PPO), twin delayed deep deterministic policy gradients (TD3) and soft actor-critic (SAC) in combination with our reward function. Our evaluation yielded results that conclusively show the TD3 agent to be most efficient and consistent in terms of docking the AUV, over multiple evaluation runs it achieved a 100% success rate and episode return of 10667.1±688.8. We also show how our reward function formulation improves over the state of the art.

[1]  Jun-Ho Oh,et al.  Experiments on vision guided docking of an autonomous underwater vehicle using one camera , 2009 .

[2]  C.-C. Huang,et al.  Design of a sliding mode fuzzy controller for the guidance and control of an autonomous underwater vehicle , 2003 .

[3]  Zhenyu Shi,et al.  Deep reinforcement learning based optimal trajectory tracking control of autonomous underwater vehicle , 2017, 2017 36th Chinese Control Conference (CCC).

[4]  S. M. Smith,et al.  Fuzzy logic control of an autonomous underwater vehicle , 1994 .

[5]  Bilal Wehbe,et al.  From Epi- to Bathypelagic: Transformation of a Compact AUV System for Long-Term Deployments , 2020, 2020 IEEE/OES Autonomous Underwater Vehicles Symposium (AUV)(50043).

[6]  B.W. Hobson,et al.  Docking Control System for a 54-cm-Diameter (21-in) AUV , 2008, IEEE Journal of Oceanic Engineering.

[7]  Giles Thomas,et al.  Docking Control of an Autonomous Underwater Vehicle Using Reinforcement Learning , 2019, Applied Sciences.

[8]  Sen Wang,et al.  Adaptive low-level control of autonomous underwater vehicles using deep reinforcement learning , 2018, Robotics Auton. Syst..

[9]  Jun-Ho Oh,et al.  Homing And Docking Control of AUV Using Model Predictive Control , 2002 .

[10]  Gerardo G. Acosta,et al.  AUV Position Tracking Control Using End-to-End Deep Reinforcement Learning , 2018, OCEANS 2018 MTS/IEEE Charleston.

[11]  Frederic Maire,et al.  A Vision Based Target Detection System for Docking of an A utonomous Underwater Vehicle , 2009, ICRA 2009.

[12]  E. An,et al.  A Robust Fuzzy Autonomous Underwater Vehicle (AUV) Docking Approach for Unknown Current Disturbances , 2012, IEEE Journal of Oceanic Engineering.

[13]  Cheng Wu,et al.  Depth Control of Model-Free AUVs via Reinforcement Learning , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[14]  Philip Bachman,et al.  Deep Reinforcement Learning that Matters , 2017, AAAI.

[15]  Sea-Moon Kim,et al.  Visual servoing for underwater docking of an autonomous underwater vehicle with one camera , 2003, Oceans 2003. Celebrating the Past ... Teaming Toward the Future (IEEE Cat. No.03CH37492).