Reinforcement Learning applied to the control of an Autonomous Underwater Vehicle

At the Australian National University we are developing an autonomous underwater vehicle for exploration and inspection. Our aim is to develop on-board intelligent control. We intend that the vehicle will learn to control its thrusters in response to command and sensor inputs. Algorithms based on reinforcement learning with continuous state and actions are being developed for this purpose.

[1]  I. G. BONNER CLAPPISON Editor , 1960, The Electric Power Engineering Handbook - Five Volume Set.

[2]  Jean-Jacques E. Slotine,et al.  Robust trajectory control of underwater vehicles , 1985 .

[3]  Peter Lancaster,et al.  Curve and surface fitting - an introduction , 1986 .

[4]  R. Lathe Phd by thesis , 1988, Nature.

[5]  R.M. Sanner,et al.  Neuromorphic pitch attitude regulation of an underwater telerobot , 1989, IEEE Control Systems Magazine.

[6]  Jean-Jacques E. Slotine,et al.  The influence of thruster dynamics on underwater vehicle behavior and their incorporation into control system design , 1990 .

[7]  A. J. Healey,et al.  Adaptive sliding mode control of autonomous underwater vehicles in the dive plane , 1990 .

[8]  Abhijit S. Pandya,et al.  On-line learning control of autonomous underwater vehicles using feedforward neural networks , 1992 .

[9]  Donald A. Sofge,et al.  Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches , 1992 .

[10]  A. J. Healey,et al.  Multivariable sliding mode control for autonomous diving and steering of unmanned underwater vehicles , 1993 .

[11]  Leemon C Baird,et al.  Reinforcement Learning With High-Dimensional, Continuous Actions , 1993 .

[12]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[13]  Peter Nordin,et al.  Genetic Programming Controlling a Miniature Robot , 1995 .

[14]  L. Rodrigues,et al.  Sliding mode control of an AUV in the diving and steering planes , 1996, OCEANS 96 MTS/IEEE Conference Proceedings. The Coastal Ocean - Prospects for the 21st Century.

[15]  Junku Yuh,et al.  A survey and experimental study of neural network AUV control , 1996, Proceedings of Symposium on Autonomous Underwater Vehicle Technology.

[16]  Ralf Bachmayer,et al.  A four quadrant finite dimensional thruster model , 1998, IEEE Oceanic Engineering Society. OCEANS'98. Conference Proceedings (Cat. No.98CH36259).

[17]  Giorgio Bartolini,et al.  Tracking control of underwater vehicles including thruster dynamics by second order sliding modes , 1998, IEEE Oceanic Engineering Society. OCEANS'98. Conference Proceedings (Cat. No.98CH36259).

[18]  Alexander Zelinsky,et al.  Development of a visually-guided autonomous underwater vehicle , 1998, IEEE Oceanic Engineering Society. OCEANS'98. Conference Proceedings (Cat. No.98CH36259).

[19]  H. Wesseling An introduction , 2000, European Review.

[20]  O. Yaniv IEEE Control Systems Magazine , 2001 .