暂无分享,去创建一个
Wei Xu | Volker Tresp | Pieter Abbeel | Yang Gao | Rui Zhao
[1] David Barber,et al. The IM algorithm: a variational approach to Information Maximization , 2003, NIPS 2003.
[2] Aaron C. Courville,et al. MINE: Mutual Information Neural Estimation , 2018, ArXiv.
[3] Yoshua Bengio,et al. Deep Learning of Representations for Unsupervised and Transfer Learning , 2011, ICML Unsupervised and Transfer Learning.
[4] Jordi Torres,et al. Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills , 2020, ICML.
[5] Olivier Marre,et al. Relevant sparse codes with variational information bottleneck , 2016, NIPS.
[6] Murray Shanahan,et al. Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning , 2017, IEEE Transactions on Neural Networks and Learning Systems.
[7] Sergey Levine,et al. Diversity is All You Need: Learning Skills without a Reward Function , 2018, ICLR.
[8] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[9] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[10] Peter Stone,et al. Empowerment for continuous agent—environment systems , 2011, Adapt. Behav..
[11] David H. Wolpert,et al. Nonlinear Information Bottleneck , 2017, Entropy.
[12] Shakir Mohamed,et al. Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning , 2015, NIPS.
[13] Marcin Andrychowicz,et al. Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research , 2018, ArXiv.
[14] Jürgen Schmidhuber,et al. A possibility for implementing curiosity and boredom in model-building neural controllers , 1991 .
[15] Thomas Lukasiewicz,et al. Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards , 2019, AAAI.
[16] David Warde-Farley,et al. Unsupervised Control Through Non-Parametric Discriminative Rewards , 2018, ICLR.
[17] Rui Zhao,et al. Maximum Entropy-Regularized Multi-Goal Reinforcement Learning , 2019, ICML.
[18] Alexander A. Alemi,et al. Deep Variational Information Bottleneck , 2017, ICLR.
[19] Marcin Andrychowicz,et al. Hindsight Experience Replay , 2017, NIPS.
[20] Daan Wierstra,et al. Variational Intrinsic Control , 2016, ICLR.
[21] Yoshua Bengio,et al. Learning deep representations by mutual information estimation and maximization , 2018, ICLR.
[22] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[23] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[24] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[25] Arindam Banerjee,et al. On Bayesian bounds , 2006, ICML.
[26] Jakub W. Pachocki,et al. Dota 2 with Large Scale Deep Reinforcement Learning , 2019, ArXiv.
[27] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[28] Filip De Turck,et al. VIME: Variational Information Maximizing Exploration , 2016, NIPS.
[29] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[30] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.
[31] Alexei A. Efros,et al. Curiosity-Driven Exploration by Self-Supervised Prediction , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[32] Christoph Salge,et al. Empowerment - an Introduction , 2013, ArXiv.
[33] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.
[34] Alexei A. Efros,et al. Large-Scale Study of Curiosity-Driven Learning , 2018, ICLR.