Decentralized Bayesian reinforcement learning for online agent collaboration
暂无分享,去创建一个
Nicholas R. Jennings | Sally I. McClean | Alex Rogers | Gerard P. Parr | Alessandro Farinelli | Georgios Chalkiadakis | W. T. Luke Teacy | S. McClean | A. Rogers | N. Jennings | A. Farinelli | G. Chalkiadakis | G. Parr | W. L. Teacy
[1] Carlos Guestrin,et al. Max-norm Projections for Factored MDPs , 2001, IJCAI.
[2] Craig Boutilier,et al. Sequentially optimal repeated coalition formation under uncertainty , 2012, Autonomous Agents and Multi-Agent Systems.
[3] Stuart J. Russell,et al. Bayesian Q-Learning , 1998, AAAI/IAAI.
[4] Joel Veness,et al. Monte-Carlo Planning in Large POMDPs , 2010, NIPS.
[5] Neil Immerman,et al. The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.
[6] Makoto Yokoo,et al. Networked Distributed POMDPs: A Synergy of Distributed Constraint Optimization and POMDPs , 2005, IJCAI.
[7] Robert J. McEliece,et al. The generalized distributive law , 2000, IEEE Trans. Inf. Theory.
[8] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[9] Hea-Jung Kim,et al. Moments of truncated Student-t distribution , 2008 .
[10] William T. Freeman,et al. On the optimality of solutions of the max-product belief-propagation algorithm in arbitrary graphs , 2001, IEEE Trans. Inf. Theory.
[11] Michail G. Lagoudakis,et al. Coordinated Reinforcement Learning , 2002, ICML.
[12] J. J. Martin. Bayesian Decision Problems and Markov Chains , 1967 .
[13] X. Jin. Factor graphs and the Sum-Product Algorithm , 2002 .
[14] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[15] David Andre,et al. Model based Bayesian Exploration , 1999, UAI.
[16] Nicholas R. Jennings,et al. Decentralised coordination of low-power embedded devices using the max-sum algorithm , 2008, AAMAS.
[17] Nikos A. Vlassis,et al. Collaborative Multiagent Reinforcement Learning by Payoff Propagation , 2006, J. Mach. Learn. Res..
[18] Craig Boutilier,et al. Sequential Optimality and Coordination in Multiagent Systems , 1999, IJCAI.
[19] Craig Boutilier,et al. Decision-Theoretic Planning: Structural Assumptions and Computational Leverage , 1999, J. Artif. Intell. Res..
[20] Mark J. Schervish,et al. Student's solutions manual to accompany probability and statistics , 2002 .