Optimization of fish-like locomotion using hierarchical reinforcement learning
暂无分享,去创建一个
[1] Doina Precup,et al. Intra-Option Learning about Temporally Abstract Actions , 1998, ICML.
[2] Joel W. Burdick,et al. Experiments in carangiform robotic fish locomotion , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).
[3] M. Lighthill. Note on the swimming of slender fish , 1960, Journal of Fluid Mechanics.
[4] C. Breder. The locomotion of fishes , 1926 .
[5] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[6] Stuart J. Russell,et al. Reinforcement Learning with Hierarchies of Machines , 1997, NIPS.
[7] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[8] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..