Scalable Bayesian Reinforcement Learning for Multiagent POMDPs
暂无分享,去创建一个
[1] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..
[2] Kee-Eung Kim,et al. Learning to Cooperate via Policy Search , 2000, UAI.
[3] Carlos Guestrin,et al. Multiagent Planning with Factored MDPs , 2001, NIPS.
[4] Milind Tambe,et al. The Communicative Multiagent Team Decision Problem: Analyzing Teamwork Theories and Models , 2011, J. Artif. Intell. Res..
[5] Leslie Pack Kaelbling,et al. All learning is Local: Multi-agent Learning in Global Reward Games , 2003, NIPS.
[6] Craig Boutilier,et al. Coordination in multiagent reinforcement learning: a Bayesian approach , 2003, AAMAS '03.
[7] Makoto Yokoo,et al. Networked Distributed POMDPs: A Synergy of Distributed Constraint Optimization and POMDPs , 2005, IJCAI.
[8] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[9] Nikos A. Vlassis,et al. Collaborative Multiagent Reinforcement Learning by Payoff Propagation , 2006, J. Mach. Learn. Res..
[10] Csaba Szepesvári,et al. Bandit Based Monte-Carlo Planning , 2006, ECML.
[11] Nicholas R. Jennings,et al. Decentralised coordination of low-power embedded devices using the max-sum algorithm , 2008, AAMAS.
[12] Shimon Whiteson,et al. Exploiting locality of interaction in factored Dec-POMDPs , 2008, AAMAS.
[13] Joel Veness,et al. Monte-Carlo Planning in Large POMDPs , 2010, NIPS.
[14] Joelle Pineau,et al. A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes , 2011, J. Mach. Learn. Res..
[15] Nicholas R. Jennings,et al. Decentralized Bayesian reinforcement learning for online agent collaboration , 2012, AAMAS.
[16] Frans A. Oliehoek,et al. Decentralized POMDPs , 2012, Reinforcement Learning.
[17] Peter Dayan,et al. Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search , 2012, NIPS.
[18] Decentralized control of partially observable Markov decision processes , 2015, 52nd IEEE Conference on Decision and Control.