Inverse Reinforcement Learning with Simultaneous Estimation of Rewards and Dynamics
暂无分享,去创建一个
Wolfram Burgard | Tobias Gindele | Felix Schmitt | Jörg Wagner | Michael Herman | W. Burgard | Jörg Wagner | Michael Herman | T. Gindele | Felix Schmitt
[1] Welch Bl. THE GENERALIZATION OF ‘STUDENT'S’ PROBLEM WHEN SEVERAL DIFFERENT POPULATION VARLANCES ARE INVOLVED , 1947 .
[2] E. Zeidler. The Implicit Function Theorem , 1995 .
[3] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[4] Harold R. Parks,et al. The Implicit Function Theorem , 2002 .
[5] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[6] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[7] Csaba Szepesvári,et al. Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods , 2007, UAI.
[8] Eyal Amir,et al. Bayesian Inverse Reinforcement Learning , 2007, IJCAI.
[9] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[10] Siddhartha S. Srinivasa,et al. Planning-based prediction for pedestrians , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[11] J. Andrew Bagnell,et al. Modeling Purposeful Adaptive Behavior with the Principle of Maximum Causal Entropy , 2010 .
[12] Christian Vollmer,et al. Learning to navigate through crowded environments , 2010, 2010 IEEE International Conference on Robotics and Automation.
[13] Anind K. Dey,et al. Modeling Interaction via the Principle of Maximum Causal Entropy , 2010, ICML.
[14] Christos Dimitrakakis,et al. Preference elicitation and inverse reinforcement learning , 2011, ECML/PKDD.
[15] Jan Peters,et al. Relative Entropy Inverse Reinforcement Learning , 2011, AISTATS.
[16] Matthieu Geist,et al. Inverse Reinforcement Learning through Structured Classification , 2012, NIPS.
[17] Matthieu Geist,et al. A Cascaded Supervised Learning Approach to Inverse Reinforcement Learning , 2013, ECML/PKDD.
[18] Christos Dimitrakakis,et al. Probabilistic inverse reinforcement learning in unknown environments , 2013, UAI.
[19] Byron M. Yu,et al. Learning an Internal Dynamics Model from Control Demonstration , 2013, ICML.
[20] Wolfram Burgard,et al. Teaching mobile robots to cooperatively navigate in populated environments , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[21] N. Bambos,et al. Infinite time horizon maximum causal entropy inverse reinforcement learning , 2014, 53rd IEEE Conference on Decision and Control.
[22] Michael Bloem,et al. Infinite Time Horizon Maximum Causal Entropy Inverse Reinforcement Learning , 2014, IEEE Transactions on Automatic Control.