Learning to Act in Partially Structured Dynamic Environment
暂无分享,去创建一个
[1] Lawrence V. Snyder,et al. A Deep Q-Network for the Beer Game with Partial Information , 2017, ArXiv.
[2] Gaurav S. Sukhatme,et al. Persistent ocean monitoring with underwater gliders: Adapting sampling resolution , 2011, J. Field Robotics.
[3] B. Bett,et al. Autonomous Underwater Vehicles (AUVs): Their past, present and future contributions to the advancement of marine geoscience , 2014 .
[4] Emanuel Todorov,et al. Iterative Linear Quadratic Regulator Design for Nonlinear Biological Movement Systems , 2004, ICINCO.
[5] Sergey Levine,et al. Guided Policy Search , 2013, ICML.
[6] Alexander F. Shchepetkin,et al. The regional oceanic modeling system (ROMS): a split-explicit, free-surface, topography-following-coordinate oceanic model , 2005 .
[7] Hyun-chul Lee,et al. Loop Current, Rings and Related Circulation in the Gulf of Mexico: A Review of Numerical Models and Future Challenges , 2013 .
[8] Craig A. Woolsey,et al. Underwater glider motion control , 2008, 2008 47th IEEE Conference on Decision and Control.
[9] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.