Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion
暂无分享,去创建一个
[1] Stuart J. Russell,et al. Reinforcement Learning with Hierarchies of Machines , 1997, NIPS.
[2] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[3] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[4] Jun Morimoto,et al. Minimax Differential Dynamic Programming: An Application to Robust Biped Walking , 2002, NIPS.
[5] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..
[6] J. Chestnutt,et al. Planning Biped Navigation Strategies in Complex Environments , 2003 .
[7] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..
[8] Peter Stone,et al. Machine Learning for Fast Quadrupedal Locomotion , 2004, AAAI.
[9] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[10] Ben Taskar,et al. Learning structured prediction models: a large margin approach , 2005, ICML.
[11] Hyoukryeol Choi,et al. Gait Planning of Quadruped Walking and Climbing Robot for Locomotion in 3D Environment , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.
[12] Peter Stone,et al. State Abstraction Discovery from Irrelevant State Variables , 2005, IJCAI.
[13] Thomas Hofmann,et al. Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..
[14] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[15] Chih-Han Yu,et al. Quadruped robot obstacle negotiation via reinforcement learning , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..
[16] David M. Bradley,et al. Boosting Structured Prediction for Imitation Learning , 2006, NIPS.
[17] Csaba Szepesvári,et al. Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods , 2007, UAI.