暂无分享,去创建一个
[1] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[2] Prashant Doshi,et al. A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress , 2018, Artif. Intell..
[3] Markus Wulfmeier,et al. Maximum Entropy Deep Inverse Reinforcement Learning , 2015, 1507.04888.
[4] Herke van Hoof,et al. Addressing Function Approximation Error in Actor-Critic Methods , 2018, ICML.
[5] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.
[6] Sebastian Tschiatschek,et al. Learner-aware Teaching: Inverse Reinforcement Learning with Preferences and Constraints , 2019, NeurIPS.
[7] Gabriel Kalweit,et al. Interpretable Multi Time-scale Constraints in Model-free Deep Reinforcement Learning for Autonomous Driving , 2020, ArXiv.
[8] Stuart J. Russell,et al. Research Priorities for Robust and Beneficial Artificial Intelligence , 2015, AI Mag..
[9] Gabriel Kalweit,et al. Composite Q-learning: Multi-scale Q-function Decomposition and Separable Optimization , 2019 .
[10] Daniel Krajzewicz,et al. Recent Development and Applications of SUMO - Simulation of Urban MObility , 2012 .
[11] Csaba Szepesvári,et al. Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods , 2007, UAI.
[12] Michael A. Osborne,et al. The future of employment: How susceptible are jobs to computerisation? , 2017 .
[13] Sergey Levine,et al. Nonlinear Inverse Reinforcement Learning with Gaussian Processes , 2011, NIPS.
[14] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[15] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[16] Sergey Levine,et al. Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization , 2016, ICML.
[17] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[18] Peter Englert,et al. Inverse KKT - Learning Cost Functions of Manipulation Tasks from Demonstrations , 2017, ISRR.
[19] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[20] Peter Dayan,et al. Improving Generalization for Temporal Difference Learning: The Successor Representation , 1993, Neural Computation.
[21] Gabriel Kalweit,et al. Dynamic Input for Deep Reinforcement Learning in Autonomous Driving , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[22] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[23] Gabriel Kalweit,et al. Off-policy Multi-step Q-learning , 2019, ArXiv.
[24] Samuel Gershman,et al. Deep Successor Reinforcement Learning , 2016, ArXiv.
[25] Gabriel Kalweit,et al. Deep Constrained Q-learning , 2020 .
[26] Jan Peters,et al. Relative Entropy Inverse Reinforcement Learning , 2011, AISTATS.
[27] Matthias Scheutz,et al. Value Alignment or Misalignment - What Will Keep Systems Accountable? , 2017, AAAI Workshops.
[28] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[29] Michael L. Littman,et al. Reinforcement Learning as a Framework for Ethical Decision Making , 2016, AAAI Workshop: AI, Ethics, and Society.
[30] Eyal Amir,et al. Bayesian Inverse Reinforcement Learning , 2007, IJCAI.