暂无分享,去创建一个
[1] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.
[2] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[3] Jeff G. Schneider,et al. Covariant Policy Search , 2003, IJCAI.
[4] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[5] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[6] Emanuel Todorov,et al. Linearly-solvable Markov decision problems , 2006, NIPS.
[7] David M. Bradley,et al. Boosting Structured Prediction for Imitation Learning , 2006, NIPS.
[8] Eyal Amir,et al. Bayesian Inverse Reinforcement Learning , 2007, IJCAI.
[9] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[10] David Silver,et al. Learning to search: Functional gradient techniques for imitation learning , 2009, Auton. Robots.
[11] J. Andrew Bagnell,et al. Modeling Purposeful Adaptive Behavior with the Principle of Maximum Causal Entropy , 2010 .
[12] Yasemin Altun,et al. Relative Entropy Policy Search , 2010 .
[13] Sergey Levine,et al. Nonlinear Inverse Reinforcement Learning with Gaussian Processes , 2011, NIPS.
[14] Timothy Bretl,et al. Maximum entropy inverse reinforcement learning in continuous state spaces with path integrals , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[15] Jan Peters,et al. Relative Entropy Inverse Reinforcement Learning , 2011, AISTATS.
[16] Siddhartha S. Srinivasa,et al. Formalizing Assistive Teleoperation , 2012, Robotics: Science and Systems.
[17] Marc Toussaint,et al. On Stochastic Optimal Control and Reinforcement Learning by Approximate Inference , 2012, Robotics: Science and Systems.
[18] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[19] Sergey Levine,et al. Continuous Inverse Optimal Control with Locally Optimal Examples , 2012, ICML.
[20] Stefan Schaal,et al. Learning objective functions for manipulation , 2013, 2013 IEEE International Conference on Robotics and Automation.
[21] Bernhard Schölkopf,et al. Learning strategies in table tennis using inverse reinforcement learning , 2014, Biological Cybernetics.
[22] Sergey Levine,et al. Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics , 2014, NIPS.
[23] Kris M. Kitani,et al. Action-Reaction: Forecasting the Dynamics of Human Interaction , 2014, ECCV.
[24] Joshua B. Tenenbaum,et al. Softstar: Heuristic-Guided Probabilistic Inference , 2015, NIPS.
[25] Alessandro Lazaric,et al. Maximum Entropy Semi-Supervised Inverse Reinforcement Learning , 2015, IJCAI.
[26] Trevor Darrell,et al. Simultaneous Deep Transfer Across Domains and Tasks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[27] Markus Wulfmeier,et al. Maximum Entropy Deep Inverse Reinforcement Learning , 2015, 1507.04888.
[28] Marc Toussaint,et al. Direct Loss Minimization Inverse Optimal Control , 2015, Robotics: Science and Systems.
[29] Byron Boots,et al. Graph-Based Inverse Optimal Control for Robot Manipulation , 2015, IJCAI.
[30] Nolan Wagener,et al. Learning contact-rich manipulation skills with guided policy search , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).
[31] Sergey Levine,et al. Deep spatial autoencoders for visuomotor learning , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).