Learning Context-aware Task Reasoning for Efficient Meta-reinforcement Learning
暂无分享,去创建一个
[1] Joan Bruna,et al. Spectral Networks and Locally Connected Networks on Graphs , 2013, ICLR.
[2] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[3] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.
[4] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.
[5] Sergey Levine,et al. Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables , 2019, ICML.
[6] Tamim Asfour,et al. ProMP: Proximal Meta-Policy Search , 2018, ICLR.
[7] Demis Hassabis,et al. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play , 2018, Science.
[8] Sergey Levine,et al. Meta-Reinforcement Learning of Structured Exploration Strategies , 2018, NeurIPS.
[9] Samy Bengio,et al. Order Matters: Sequence to sequence for sets , 2015, ICLR.
[10] Noam Brown,et al. Superhuman AI for heads-up no-limit poker: Libratus beats top professionals , 2018, Science.
[11] John N. Tsitsiklis,et al. Actor-Critic Algorithms , 1999, NIPS.
[12] Jure Leskovec,et al. Inductive Representation Learning on Large Graphs , 2017, NIPS.
[13] Razvan Pascanu,et al. Interaction Networks for Learning about Objects, Relations and Physics , 2016, NIPS.
[14] Abhinav Gupta,et al. Non-local Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[15] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.
[16] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[17] Razvan Pascanu,et al. A simple neural network module for relational reasoning , 2017, NIPS.
[18] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.
[19] Pietro Liò,et al. Graph Attention Networks , 2017, ICLR.
[20] Razvan Pascanu,et al. Relational inductive biases, deep learning, and graph networks , 2018, ArXiv.
[21] Xuming He,et al. LatentGNN: Learning Efficient Non-local Relations for Visual Recognition , 2019, ICML.
[22] Nasser M. Nasrabadi,et al. Pattern Recognition and Machine Learning , 2006, Technometrics.
[23] Qiang Liu,et al. Learning to Explore via Meta-Policy Gradient , 2018, ICML.
[24] Pieter Abbeel,et al. A Simple Neural Attentive Meta-Learner , 2017, ICLR.
[25] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[26] Malcolm J. A. Strens,et al. A Bayesian Framework for Reinforcement Learning , 2000, ICML.
[27] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[28] Alexander J. Smola,et al. Deep Sets , 2017, 1703.06114.
[29] Richard S. Zemel,et al. Gated Graph Sequence Neural Networks , 2015, ICLR.
[30] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.
[31] Benjamin Van Roy,et al. (More) Efficient Reinforcement Learning via Posterior Sampling , 2013, NIPS.
[32] Pieter Abbeel,et al. Meta-Learning with Temporal Convolutions , 2017, ArXiv.
[33] Yedid Hoshen,et al. VAIN: Attentional Multi-agent Predictive Modeling , 2017, NIPS.
[34] Qiang Liu,et al. Learning to Explore with Meta-Policy Gradient , 2018, ICML 2018.
[35] Pieter Abbeel,et al. Some Considerations on Learning to Explore via Meta-Reinforcement Learning , 2018, ICLR 2018.
[36] Gerald Tesauro,et al. Temporal Difference Learning and TD-Gammon , 1995, J. Int. Comput. Games Assoc..
[37] Max Welling,et al. Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.
[38] Sergey Levine,et al. Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review , 2018, ArXiv.
[39] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[40] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[41] Yee Whye Teh,et al. Meta reinforcement learning as task inference , 2019, ArXiv.
[42] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.
[43] Samuel S. Schoenholz,et al. Neural Message Passing for Quantum Chemistry , 2017, ICML.
[44] Benjamin Van Roy,et al. Deep Exploration via Bootstrapped DQN , 2016, NIPS.