暂无分享,去创建一个
Gerald Tesauro | Jonathan P. How | Chuangchuang Sun | Miao Liu | Golnaz Habibi | Sebastian Lopez-Cot | Matthew Riemer | Dong-Ki Kim | Marwa Abdulhai | G. Tesauro | J. How | M. Riemer | Dong-Ki Kim | Miao Liu | Golnaz Habibi | Chuangchuang Sun | Marwa Abdulhai | Sebastian Lopez-Cot
[1] Pieter Abbeel,et al. Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.
[2] Jun Wang,et al. Multi-Agent Reinforcement Learning , 2020, Deep Reinforcement Learning.
[3] Karol Hausman,et al. Gradient Surgery for Multi-Task Learning , 2020, NeurIPS.
[4] Drew Wicke,et al. Multiagent Soft Q-Learning , 2018, AAAI Spring Symposia.
[5] Sergey Levine,et al. High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.
[6] Ming Zhou,et al. Mean Field Multi-Agent Reinforcement Learning , 2018, ICML.
[7] Gerald Tesauro,et al. Learning to Learn without Forgetting By Maximizing Transfer and Minimizing Interference , 2018, ICLR.
[8] Maruan Al-Shedivat,et al. Learning Policy Representations in Multiagent Systems , 2018, ICML.
[9] Pieter Abbeel,et al. Meta-Learning with Temporal Convolutions , 2017, ArXiv.
[10] Martha White,et al. Meta-Learning Representations for Continual Learning , 2019, NeurIPS.
[11] Shimon Whiteson,et al. Stable Opponent Shaping in Differentiable Games , 2018, ICLR.
[12] Shimon Whiteson,et al. Learning with Opponent-Learning Awareness , 2017, AAMAS.
[13] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[14] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[15] Ricardo Vilalta,et al. A Perspective View and Survey of Meta-Learning , 2002, Artificial Intelligence Review.
[16] Pieter Abbeel,et al. Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments , 2017, ICLR.
[17] Pieter Abbeel,et al. A Simple Neural Attentive Meta-Learner , 2017, ICLR.
[18] Nando de Freitas,et al. Sample Efficient Actor-Critic with Experience Replay , 2016, ICLR.
[19] Jordan L. Boyd-Graber,et al. Opponent Modeling in Deep Reinforcement Learning , 2016, ICML.
[20] Rob Fergus,et al. Modeling Others using Oneself in Multi-Agent Reinforcement Learning , 2018, ICML.
[21] Eduardo F. Morales,et al. An Introduction to Reinforcement Learning , 2011 .
[22] Min Lin,et al. Online Fast Adaptation and Knowledge Accumulation: a New Approach to Continual Learning , 2020, ArXiv.
[23] Shimon Whiteson,et al. Counterfactual Multi-Agent Policy Gradients , 2017, AAAI.
[24] Yi Wu,et al. Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.
[25] Shimon Whiteson,et al. FACMAC: Factored Multi-Agent Centralised Policy Gradients , 2020, NeurIPS.
[26] Amos Storkey,et al. Meta-Learning in Neural Networks: A Survey , 2020, IEEE transactions on pattern analysis and machine intelligence.
[27] M. Stanković. Multi-agent reinforcement learning , 2016 .
[28] Shimon Whiteson,et al. DiCE: The Infinitely Differentiable Monte-Carlo Estimator , 2018, ICML.
[29] Joel Lehman,et al. Learning to Continually Learn , 2020, ECAI.
[30] J. Schulman,et al. Reptile: a Scalable Metalearning Algorithm , 2018 .
[31] Gunshi Gupta,et al. La-MAML: Look-ahead Meta Learning for Continual Learning , 2020, NeurIPS.
[32] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[33] Pablo Hernandez-Leal,et al. A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity , 2017, ArXiv.
[34] Giacomo Spigler. Meta-learnt priors slow down catastrophic forgetting in neural networks , 2019, ArXiv.
[35] Victor R. Lesser,et al. Multi-Agent Learning with Policy Prediction , 2010, AAAI.
[36] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.
[37] Siddhartha S. Srinivasa,et al. The Assistive Multi-Armed Bandit , 2019, 2019 14th ACM/IEEE International Conference on Human-Robot Interaction (HRI).
[38] Filippos Christianos,et al. Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning , 2019, ArXiv.
[39] Yoshua Bengio,et al. On the Optimization of a Synaptic Learning Rule , 2007 .
[40] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.
[41] L. Shapley,et al. Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.
[42] Bart De Schutter,et al. Multi-agent Reinforcement Learning: An Overview , 2010 .
[43] David Vázquez,et al. Online Fast Adaptation and Knowledge Accumulation (OSAKA): a New Approach to Continual Learning , 2020, NeurIPS.