论文信息 - Generalized Maximum Causal Entropy for Inverse Reinforcement Learning

Generalized Maximum Causal Entropy for Inverse Reinforcement Learning

We consider the problem of learning from demonstrated trajectories with inverse reinforcement learning (IRL). Motivated by a limitation of the classical maximum entropy model in capturing the structure of the network of states, we propose an IRL model based on a generalized version of the causal entropy maximization problem, which allows us to generate a class of maximum entropy IRL models. Our generalized model has an advantage of being able to recover, in addition to a reward function, another expert's function that would (partially) capture the impact of the connecting structure of the states on experts' decisions. Empirical evaluation on a real-world dataset and a grid-world dataset shows that our generalized model outperforms the classical ones, in terms of recovering reward functions and demonstrated trajectories.

Patrick Jaillet | Tien Mai | Kennard Chan

[1] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[2] Generalized entropy models , 2016 .

[3] Markus Wulfmeier,et al. Maximum Entropy Deep Inverse Reinforcement Learning , 2015, 1507.04888.

[4] A. Dawid,et al. Game theory, maximum entropy, minimum discrepancy and robust Bayesian decision theory , 2004, math/0410076.

[5] Stuart J. Russell. Learning agents for uncertain environments (extended abstract) , 1998, COLT' 98.

[6] David Silver,et al. Learning to search: Functional gradient techniques for imitation learning , 2009, Auton. Robots.

[7] Sergey Levine,et al. Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization , 2016, ICML.

[8] David M. Bradley,et al. Boosting Structured Prediction for Imitation Learning , 2006, NIPS.

[9] André de Palma,et al. E M ] 2 6 Se p 20 17 Discrete Choice and Rational Inattention : a General Equivalence Result ∗ , 2018 .

[10] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.

[11] Sergey Levine,et al. A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models , 2016, ArXiv.