暂无分享,去创建一个
[1] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[2] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[3] Jeff A. Bilmes,et al. Interactive Submodular Set Cover , 2010, ICML.
[4] Robert E. Schapire,et al. A Game-Theoretic Approach to Apprenticeship Learning , 2007, NIPS.
[5] Craig Boutilier,et al. Eliciting Additive Reward Functions for Markov Decision Processes , 2011, IJCAI.
[6] Craig Boutilier,et al. Robust Policy Computation in Reward-Uncertain MDPs Using Nondominated Policies , 2010, AAAI.
[7] Pieter Abbeel,et al. An Application of Reinforcement Learning to Aerobatic Helicopter Flight , 2006, NIPS.
[8] Eyal Amir,et al. Bayesian Inverse Reinforcement Learning , 2007, IJCAI.
[9] Manuel Lopes,et al. Active Learning for Reward Estimation in Inverse Reinforcement Learning , 2009, ECML/PKDD.
[10] Pieter Abbeel,et al. Learning for control from multiple demonstrations , 2008, ICML '08.
[11] Leslie Pack Kaelbling,et al. Effective reinforcement learning for mobile robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).
[12] Craig Boutilier,et al. Regret-based Reward Elicitation for Markov Decision Processes , 2009, UAI.
[13] Christos Dimitrakakis,et al. Preference elicitation and inverse reinforcement learning , 2011, ECML/PKDD.
[14] Andreas Krause,et al. Adaptive Submodularity: A New Approach to Active Learning and Stochastic Optimization , 2010, COLT 2010.
[15] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[16] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[17] Pieter Abbeel,et al. Apprenticeship learning for helicopter control , 2009, CACM.
[18] Daphne Koller,et al. Making Rational Decisions Using Adaptive Utility Elicitation , 2000, AAAI/IAAI.
[19] László Lovász,et al. Hit-and-run mixes fast , 1999, Math. Program..