暂无分享,去创建一个
Ruben Villegas | Honglak Lee | James Davidson | Ian S. Fischer | Danijar Hafner | Timothy P. Lillicrap | David Ha | Ian Fischer | T. Lillicrap | Honglak Lee | Danijar Hafner | James Davidson | Ruben Villegas | David R Ha
[1] Martial Hebert,et al. Improving Multi-Step Prediction of Learned Time Series Models , 2015, AAAI.
[2] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[3] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.
[4] Arthur G. Richards,et al. Robust constrained model predictive control , 2005 .
[5] Sergey Levine,et al. SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning , 2018, ICML.
[6] Yoshua Bengio,et al. Professor Forcing: A New Algorithm for Training Recurrent Networks , 2016, NIPS.
[7] Martin A. Riedmiller,et al. Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images , 2015, NIPS.
[8] Reuven Y. Rubinstein,et al. Optimization of computer simulation models with rare events , 1997 .
[9] Erik Talvitie,et al. Model Regularization for Stable Sample Rollouts , 2014, UAI.
[10] Daan Wierstra,et al. Recurrent Environment Simulators , 2017, ICLR.
[11] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.
[12] Ali Ghodsi,et al. Robust Locally-Linear Controllable Embedding , 2017, AISTATS.
[13] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[14] Ruben Villegas,et al. Learning to Generate Long-term Future via Hierarchical Prediction , 2017, ICML.
[15] Gregory Dudek,et al. Synthesizing Neural Network Controllers with Probabilistic Model-Based Reinforcement Learning , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[16] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.
[17] Duy Nguyen-Tuong,et al. Probabilistic Recurrent State-Space Models , 2018, ICML.
[18] Fabio Viola,et al. Learning and Querying Fast Generative Models for Reinforcement Learning , 2018, ArXiv.
[19] Shimon Whiteson,et al. Deep Variational Reinforcement Learning for POMDPs , 2018, ICML.
[20] Antonio Torralba,et al. Generating Videos with Scene Dynamics , 2016, NIPS.
[21] Maximilian Karl,et al. Deep Variational Bayes Filters: Unsupervised Learning of State Space Models from Raw Data , 2016, ICLR.
[22] Sergey Levine,et al. Self-Supervised Visual Planning with Temporal Skip Connections , 2017, CoRL.
[23] Kevin Waugh,et al. DeepStack: Expert-level artificial intelligence in heads-up no-limit poker , 2017, Science.
[24] Prafulla Dhariwal,et al. Glow: Generative Flow with Invertible 1x1 Convolutions , 2018, NeurIPS.
[25] Yoshua Bengio,et al. A Recurrent Latent Variable Model for Sequential Data , 2015, NIPS.
[26] Alex Graves,et al. Video Pixel Networks , 2016, ICML.
[27] David Amos,et al. Generative Temporal Models with Memory , 2017, ArXiv.
[28] Sergey Levine,et al. Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[29] Pieter Abbeel,et al. Model-Ensemble Trust-Region Policy Optimization , 2018, ICLR.
[30] Sepp Hochreiter,et al. Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.
[31] Razvan Pascanu,et al. Imagination-Augmented Agents for Deep Reinforcement Learning , 2017, NIPS.
[32] Carl E. Rasmussen,et al. PILCO: A Model-Based and Data-Efficient Approach to Policy Search , 2011, ICML.
[33] Oriol Vinyals,et al. Neural Discrete Representation Learning , 2017, NIPS.
[34] Honglak Lee,et al. Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion , 2018, NeurIPS.
[35] Rob Fergus,et al. Stochastic Video Generation with a Learned Prior , 2018, ICML.
[36] Jitendra Malik,et al. Learning to Poke by Poking: Experiential Learning of Intuitive Physics , 2016, NIPS.
[37] Yuval Tassa,et al. DeepMind Control Suite , 2018, ArXiv.
[38] Allan Jabri,et al. Universal Planning Networks , 2018, ICML.
[39] Sergey Levine,et al. Deep visual foresight for planning robot motion , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[40] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[41] Gabriel Kalweit,et al. Uncertainty-driven Imagination for Continuous Deep Reinforcement Learning , 2017, CoRL.
[42] Misha Denil,et al. Learning Awareness Models , 2018, ICLR.
[43] Matthew W. Hoffman,et al. Distributed Distributional Deterministic Policy Gradients , 2018, ICLR.
[44] Christopher Burgess,et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.
[45] Joel Z. Leibo,et al. Unsupervised Predictive Memory in a Goal-Directed Agent , 2018, ArXiv.
[46] Uri Shalit,et al. Structured Inference Networks for Nonlinear State Space Models , 2016, AAAI.
[47] Sergey Levine,et al. Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control , 2018, ArXiv.
[48] Karol Gregor,et al. Temporal Difference Variational Auto-Encoder , 2018, ICLR.
[49] Sergey Levine,et al. Stochastic Variational Video Prediction , 2017, ICLR.
[50] Uri Shalit,et al. Deep Kalman Filters , 2015, ArXiv.
[51] Catholijn M. Jonker,et al. Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning , 2017, ArXiv.
[52] Honglak Lee,et al. Action-Conditional Video Prediction using Deep Networks in Atari Games , 2015, NIPS.
[53] Yuval Tassa,et al. Synthesis and stabilization of complex behaviors through online trajectory optimization , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[54] Sergey Levine,et al. Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models , 2018, NeurIPS.
[55] Yann LeCun,et al. Model-Based Planning with Discrete and Continuous Actions , 2017 .
[56] Samy Bengio,et al. Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks , 2015, NIPS.
[57] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[58] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[59] Yann LeCun,et al. Deep multi-scale video prediction beyond mean square error , 2015, ICLR.
[60] C. Rasmussen,et al. Improving PILCO with Bayesian Neural Network Dynamics Models , 2016 .