暂无分享,去创建一个
[1] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[2] Murray Shanahan,et al. Policy Consolidation for Continual Reinforcement Learning , 2019, ICML.
[3] Stefan Wermter,et al. Continual Lifelong Learning with Neural Networks: A Review , 2019, Neural Networks.
[4] Yi Wu,et al. Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.
[5] Gerald Tesauro,et al. Deep RL With Information Constrained Policies: Generalization in Continuous Control , 2020, ArXiv.
[6] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[7] Jordi Grau-Moya,et al. Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning , 2019, CoRL.