暂无分享,去创建一个
David Warde-Farley | Volodymyr Mnih | Tejas D. Kulkarni | Tejas Kulkarni | Catalin Ionescu | Steven Hansen | Tom Van de Wiele | Volodymyr Mnih | T. Wiele | David Warde-Farley | S. Hansen | Catalin Ionescu
[1] Long-Ji Lin,et al. Reinforcement learning for robots using neural networks , 1992 .
[2] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[3] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.
[4] Blockin Blockin,et al. Quick Training of Probabilistic Neural Nets by Importance Sampling , 2003 .
[5] David Barber,et al. Information Maximization in Noisy Channels : A Variational Approach , 2003, NIPS.
[6] Geoffrey E. Hinton,et al. Inferring Motor Programs from Images of Handwritten Digits , 2005, NIPS.
[7] Hossein Mobahi,et al. Deep learning from temporal coherence in video , 2009, ICML '09.
[8] Patrick M. Pilarski,et al. Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.
[9] Ben Taskar,et al. Determinantal Point Processes for Machine Learning , 2012, Found. Trends Mach. Learn..
[10] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[11] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[12] Tom Schaul,et al. Universal Value Function Approximators , 2015, ICML.
[13] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[14] Honglak Lee,et al. Action-Conditional Video Prediction using Deep Networks in Atari Games , 2015, NIPS.
[15] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[16] Marlos C. Machado,et al. Learning Purposeful Behaviour in the Absence of Rewards , 2016, ArXiv.
[17] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.
[18] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[19] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.
[20] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.
[21] Sergey Levine,et al. Continuous Deep Q-Learning with Model-based Acceleration , 2016, ICML.
[22] Tom Schaul,et al. FeUdal Networks for Hierarchical Reinforcement Learning , 2017, ICML.
[23] Pieter Abbeel,et al. Stochastic Neural Networks for Hierarchical Reinforcement Learning , 2016, ICLR.
[24] Marcin Andrychowicz,et al. Hindsight Experience Replay , 2017, NIPS.
[25] Marlos C. Machado,et al. A Laplacian Framework for Option Discovery in Reinforcement Learning , 2017, ICML.
[26] Daan Wierstra,et al. Variational Intrinsic Control , 2016, ICLR.
[27] Daan Wierstra,et al. Recurrent Environment Simulators , 2017, ICLR.
[28] David Budden,et al. Distributed Prioritized Experience Replay , 2018, ICLR.
[29] Sergey Levine,et al. Time-Contrastive Networks: Self-Supervised Learning from Video , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[30] Pierre-Yves Oudeyer,et al. Unsupervised Learning of Goal Spaces for Intrinsically Motivated Goal Exploration , 2018, ICLR.
[31] Satinder Singh,et al. Many-Goals Reinforcement Learning , 2018, ArXiv.
[32] Sergey Levine,et al. Data-Efficient Hierarchical Reinforcement Learning , 2018, NeurIPS.
[33] Sergey Levine,et al. Visual Reinforcement Learning with Imagined Goals , 2018, NeurIPS.
[34] Pieter Abbeel,et al. Automatic Goal Generation for Reinforcement Learning Agents , 2017, ICML.
[35] Kate Saenko,et al. Hierarchical Reinforcement Learning with Hindsight , 2018, ArXiv.
[36] Oriol Vinyals,et al. Synthesizing Programs for Images using Reinforced Adversarial Learning , 2018, ICML.
[37] Pushmeet Kohli,et al. Learning to Follow Language Instructions with Adversarial Reward Induction , 2018, ArXiv.
[38] Pierre-Yves Oudeyer,et al. Curiosity Driven Exploration of Learned Disentangled Goal Spaces , 2018, CoRL.
[39] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.
[40] Jitendra Malik,et al. Zero-Shot Visual Imitation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[41] Yuval Tassa,et al. DeepMind Control Suite , 2018, ArXiv.
[42] Allan Jabri,et al. Universal Planning Networks , 2018, ICML.
[43] Martin A. Riedmiller,et al. Learning by Playing - Solving Sparse Reward Tasks from Scratch , 2018, ICML.
[44] Pushmeet Kohli,et al. Learning to Understand Goal Specifications by Modelling Reward , 2018, ICLR.
[45] Sergey Levine,et al. Diversity is All You Need: Learning Skills without a Reward Function , 2018, ICLR.