Combined Reinforcement Learning via Abstract Representations
暂无分享,去创建一个
Joelle Pineau | Yoshua Bengio | Doina Precup | Vincent François-Lavet | Yoshua Bengio | Doina Precup | Joelle Pineau | Vincent François-Lavet
[1] Satinder Singh,et al. Value Prediction Network , 2017, NIPS.
[2] Aaron C. Courville,et al. MINE: Mutual Information Neural Estimation , 2018, ArXiv.
[3] Marc G. Bellemare,et al. A Distributional Perspective on Reinforcement Learning , 2017, ICML.
[4] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .
[5] Shimon Whiteson,et al. TreeQN and ATreeC: Differentiable Tree Planning for Deep Reinforcement Learning , 2017, ICLR 2018.
[6] Razvan Pascanu,et al. Imagination-Augmented Agents for Deep Reinforcement Learning , 2017, NIPS.
[7] Joelle Pineau,et al. Decoupling Dynamics and Reward for Transfer Learning , 2018, ICLR.
[8] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.
[9] Jürgen Schmidhuber,et al. World Models , 2018, ArXiv.
[10] Shimon Whiteson,et al. TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning , 2017, ICLR.
[11] Tom Schaul,et al. The Predictron: End-To-End Learning and Planning , 2016, ICML.
[12] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.
[13] R. Bellman. A Markovian Decision Process , 1957 .
[14] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[15] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[16] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[17] Tom Schaul,et al. Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.
[18] Christopher Burgess,et al. DARLA: Improving Zero-Shot Transfer in Reinforcement Learning , 2017, ICML.
[19] Dileep George,et al. Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics , 2017, ICML.
[20] Yoshua Bengio,et al. The Consciousness Prior , 2017, ArXiv.
[21] Doina Precup,et al. The Option-Critic Architecture , 2016, AAAI.
[22] Pieter Abbeel,et al. Value Iteration Networks , 2016, NIPS.
[23] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[24] Martha White,et al. Unifying Task Specification in Reinforcement Learning , 2016, ICML.