论文信息 - Anytime algorithms for multiagent decision making using coordination graphs

Anytime algorithms for multiagent decision making using coordination graphs

Coordination graphs provide a tractable framework for cooperative multiagent decision making by decomposing the global pay off function into a sum of local terms. In this paper we review some distributed algorithms for action selection in a coordination graph and discuss their pros and cons. For real-time decision making we emphasize the need for anytime algorithms for action selection: these are algorithms that improve the quality of the solution over time. We describe variable elimination, coordinate ascent, and the max-plus algorithm, the latter being an instance of the belief propagation algorithm in Bayesian networks. We discuss some interesting open problems related to the use of the max-plus algorithm in real-time multiagent decision making

[1] Nikos A. Vlassis,et al. Sparse cooperative Q-learning , 2004, ICML.

[2] Nikos A. Vlassis,et al. Multi-robot decision making using coordination graphs , 2003 .

[3] Brendan J. Frey,et al. Factor graphs and the sum-product algorithm , 2001, IEEE Trans. Inf. Theory.

[4] Carlos Guestrin,et al. Multiagent Planning with Factored MDPs , 2001, NIPS.

[5] 伊藤孝行. Gerhard Weiss(編), Multiagent Systems : A Modern Approarch to Distributed Artificial Intelligence, The MIT Press, 1999年, US$60.00, ISBN4-621-05291-8 , 1999 .

[6] Judea Pearl,et al. Probabilistic reasoning in intelligent systems , 1988 .

[7] Martin J. Wainwright,et al. MAP estimation via agreement on (hyper)trees: Message-passing and linear programming , 2005, ArXiv.

[8] Martin J. Wainwright,et al. Tree consistency and bounds on the performance of the max-product algorithm and its generalizations , 2004, Stat. Comput..

[9] Guillermo Ricardo Simari,et al. Multiagent systems: a modern approach to distributed artificial intelligence , 2000 .

[10] Nikos Vlassis,et al. A Concise Introduction to Multiagent Systems and Distributed AI , 2003 .

[11] C. Tomlin,et al. Decentralized optimization, with application to multiple aircraft coordination , 2002, Proceedings of the 41st IEEE Conference on Decision and Control, 2002..

[12] Heinz Mühlenbein,et al. FDA -A Scalable Evolutionary Algorithm for the Optimization of Additively Decomposed Functions , 1999, Evolutionary Computation.

[13] Julie A. Adams,et al. Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence , 2001, AI Mag..

[14] Martin J. Wainwright,et al. MAP estimation via agreement on trees: message-passing and linear programming , 2005, IEEE Transactions on Information Theory.

[15] X. Jin. Factor graphs and the Sum-Product Algorithm , 2002 .