Apprenticeship Learning About Multiple Intentions
暂无分享,去创建一个
Michael L. Littman | Kaushik Subramanian | Vukosi N. Marivate | Monica Babes-Vroman | M. Littman | K. Subramanian | Monica Babes-Vroman | Vukosi Marivate
[1] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[2] George H. John. When the Best Move Isn't Optimal: Q-learning with Exploration , 1994, AAAI.
[3] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[4] Jeff A. Bilmes,et al. A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models , 1998 .
[5] Matthew Richardson,et al. Learning with Knowledge from Multiple Experts , 2003, ICML.
[6] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[7] Csaba Szepesvári,et al. Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods , 2007, UAI.
[8] Eyal Amir,et al. Bayesian Inverse Reinforcement Learning , 2007, IJCAI.
[9] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[10] Michael H. Bowling,et al. Apprenticeship learning using linear programming , 2008, ICML '08.
[11] Adam B. Moore,et al. A computational model of moral judgment , 2009 .
[12] Brett Browning,et al. Automatic weight learning for multiple data sources when learning from demonstration , 2009, 2009 IEEE International Conference on Robotics and Automation.
[13] Manuel Lopes,et al. Active Learning for Reward Estimation in Inverse Reinforcement Learning , 2009, ECML/PKDD.
[14] Luke S. Zettlemoyer,et al. Reinforcement Learning for Mapping Instructions to Actions , 2009, ACL.
[15] Csaba Szepesvári,et al. Training parsers by inverse reinforcement learning , 2009, Machine Learning.