Integrating State Representation Learning Into Deep Reinforcement Learning
暂无分享,去创建一个
Karl Tuyls | Jens Kober | Robert Babuška | Tim de Bruin | Robert Babuška | K. Tuyls | J. Kober | Tim de Bruin | Jens Kober
[1] Chris Watkins,et al. Learning from delayed rewards , 1989 .
[2] Jitendra Malik,et al. Learning to Poke by Poking: Experiential Learning of Intuitive Physics , 2016, NIPS.
[3] Martin A. Riedmiller,et al. Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images , 2015, NIPS.
[4] Terrence J. Sejnowski,et al. Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.
[5] Samuel Gershman,et al. Deep Successor Reinforcement Learning , 2016, ArXiv.
[6] Sergey Levine,et al. Deep spatial autoencoders for visuomotor learning , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).
[7] Sergey Levine,et al. Learning Visual Servoing with Deep Features and Fitted Q-Iteration , 2017, ICLR.
[8] Razvan Pascanu,et al. Learning to Navigate in Complex Environments , 2016, ICLR.
[9] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[10] M. Botvinick,et al. The hippocampus as a predictive map , 2016 .
[11] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.
[12] Oliver Brock,et al. Learning state representations with robotic priors , 2015, Auton. Robots.
[13] Martin A. Riedmiller,et al. PVEs: Position-Velocity Encoders for Unsupervised Learning of Structured State Representations , 2017, ArXiv.
[14] Marc G. Bellemare,et al. Increasing the Action Gap: New Operators for Reinforcement Learning , 2015, AAAI.
[15] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[16] Danica Kragic,et al. Deep predictive policy training using reinforcement learning , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[17] Longxin Lin. Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching , 2004, Machine Learning.
[18] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[19] Peter Dayan,et al. Improving Generalization for Temporal Difference Learning: The Successor Representation , 1993, Neural Computation.
[20] David Silver,et al. Learning values across many orders of magnitude , 2016, NIPS.
[21] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[22] Manuela M. Veloso,et al. Learning End-to-end Multimodal Sensor Policies for Autonomous Navigation , 2017, CoRL.
[23] Juhan Nam,et al. Multimodal Deep Learning , 2011, ICML.
[24] Robert Babuska,et al. Learning state representation for deep actor-critic control , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).
[25] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.
[26] Tom Schaul,et al. Successor Features for Transfer in Reinforcement Learning , 2016, NIPS.
[27] Jan Peters,et al. Stable reinforcement learning with autoencoders for tactile and visual data , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[28] Trevor Darrell,et al. Loss is its own Reward: Self-Supervision for Reinforcement Learning , 2016, ICLR.
[29] Christos Dimitrakakis,et al. TORCS, The Open Racing Car Simulator , 2005 .
[30] Martin A. Riedmiller,et al. Autonomous reinforcement learning on raw visual input data in a real world application , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).