Learning Distinct Strategies for Heterogeneous Cooperative Multi-agent Reinforcement Learning
暂无分享,去创建一个
[1] Xiaoyan Zhu,et al. Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning , 2018, WWW.
[2] Nikos A. Vlassis,et al. Optimal and Approximate Q-value Functions for Decentralized POMDPs , 2008, J. Artif. Intell. Res..
[3] Prateek Jain,et al. Non-convex Optimization for Machine Learning , 2017, Found. Trends Mach. Learn..
[4] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[5] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.