Reinforcement Learning applied to the control of an AutonolDOlls ·U nclerwater .Vehicle

At· the· Australlan·.National University we are developing an autonomous.· underwater vehicle for exploration> and inspection. Our aim is to develop ·on-board intelligent control.· We .in­ tend that····the> vehicle. will learn to control its thrusters in response to command and sen­ sor inputs..Algorithms continuous state and actions are being<.developedfor this purpose.

[1]  A. J. Healey,et al.  Adaptive sliding mode control of autonomous underwater vehicles in the dive plane , 1990 .

[2]  R.M. Sanner,et al.  Neuromorphic pitch attitude regulation of an underwater telerobot , 1989, IEEE Control Systems Magazine.

[3]  Donald A. Sofge,et al.  Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches , 1992 .

[4]  Alexander Zelinsky,et al.  Development of a visually-guided autonomous underwater vehicle , 1998, IEEE Oceanic Engineering Society. OCEANS'98. Conference Proceedings (Cat. No.98CH36259).

[5]  A. J. Healey,et al.  Multivariable sliding mode control for autonomous diving and steering of unmanned underwater vehicles , 1993 .

[6]  Jean-Jacques E. Slotine,et al.  The influence of thruster dynamics on underwater vehicle behavior and their incorporation into control system design , 1990 .

[7]  Peter Nordin,et al.  Genetic Programming Controlling a Miniature Robot , 1995 .

[8]  L. Rodrigues,et al.  Sliding mode control of an AUV in the diving and steering planes , 1996, OCEANS 96 MTS/IEEE Conference Proceedings. The Coastal Ocean - Prospects for the 21st Century.

[9]  Peter Lancaster,et al.  Curve and surface fitting - an introduction , 1986 .

[10]  Junku Yuh,et al.  A survey and experimental study of neural network AUV control , 1996, Proceedings of Symposium on Autonomous Underwater Vehicle Technology.

[11]  Jean-Jacques E. Slotine,et al.  Robust trajectory control of underwater vehicles , 1985 .

[12]  Leemon C Baird,et al.  Reinforcement Learning With High-Dimensional, Continuous Actions , 1993 .

[13]  Ralf Bachmayer,et al.  A four quadrant finite dimensional thruster model , 1998, IEEE Oceanic Engineering Society. OCEANS'98. Conference Proceedings (Cat. No.98CH36259).

[14]  Abhijit S. Pandya,et al.  On-line learning control of autonomous underwater vehicles using feedforward neural networks , 1992 .

[15]  Giorgio Bartolini,et al.  Tracking control of underwater vehicles including thruster dynamics by second order sliding modes , 1998, IEEE Oceanic Engineering Society. OCEANS'98. Conference Proceedings (Cat. No.98CH36259).

[16]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..