论文信息 - Reinforcement Learning applied to the control of an AutonolDOlls ·U nclerwater .Vehicle

Reinforcement Learning applied to the control of an AutonolDOlls ·U nclerwater .Vehicle

At· the· Australlan·.National University we are developing an autonomous.· underwater vehicle for exploration> and inspection. Our aim is to develop ·on-board intelligent control.· We .in tend that····the> vehicle. will learn to control its thrusters in response to command and sen sor inputs..Algorithms continuous state and actions are being<.developedfor this purpose.

Wettergreen Alexander Zelinsky

[1] A. J. Healey,et al. Adaptive sliding mode control of autonomous underwater vehicles in the dive plane , 1990 .

[2] R.M. Sanner,et al. Neuromorphic pitch attitude regulation of an underwater telerobot , 1989, IEEE Control Systems Magazine.

[3] Donald A. Sofge,et al. Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches , 1992 .

[4] Alexander Zelinsky,et al. Development of a visually-guided autonomous underwater vehicle , 1998, IEEE Oceanic Engineering Society. OCEANS'98. Conference Proceedings (Cat. No.98CH36259).

[5] A. J. Healey,et al. Multivariable sliding mode control for autonomous diving and steering of unmanned underwater vehicles , 1993 .

[6] Jean-Jacques E. Slotine,et al. The influence of thruster dynamics on underwater vehicle behavior and their incorporation into control system design , 1990 .

[7] Peter Nordin,et al. Genetic Programming Controlling a Miniature Robot , 1995 .

[8] L. Rodrigues,et al. Sliding mode control of an AUV in the diving and steering planes , 1996, OCEANS 96 MTS/IEEE Conference Proceedings. The Coastal Ocean - Prospects for the 21st Century.

[9] Peter Lancaster,et al. Curve and surface fitting - an introduction , 1986 .

[10] Junku Yuh,et al. A survey and experimental study of neural network AUV control , 1996, Proceedings of Symposium on Autonomous Underwater Vehicle Technology.

[11] Jean-Jacques E. Slotine,et al. Robust trajectory control of underwater vehicles , 1985 .

[12] Leemon C Baird,et al. Reinforcement Learning With High-Dimensional, Continuous Actions , 1993 .

[13] Ralf Bachmayer,et al. A four quadrant finite dimensional thruster model , 1998, IEEE Oceanic Engineering Society. OCEANS'98. Conference Proceedings (Cat. No.98CH36259).

[14] Abhijit S. Pandya,et al. On-line learning control of autonomous underwater vehicles using feedforward neural networks , 1992 .

[15] Giorgio Bartolini,et al. Tracking control of underwater vehicles including thruster dynamics by second order sliding modes , 1998, IEEE Oceanic Engineering Society. OCEANS'98. Conference Proceedings (Cat. No.98CH36259).

[16] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..