Anytime algorithms for multiagent decision making using coordination graphs

Coordination graphs provide a tractable framework for cooperative multiagent decision making by decomposing the global pay off function into a sum of local terms. In this paper we review some distributed algorithms for action selection in a coordination graph and discuss their pros and cons. For real-time decision making we emphasize the need for anytime algorithms for action selection: these are algorithms that improve the quality of the solution over time. We describe variable elimination, coordinate ascent, and the max-plus algorithm, the latter being an instance of the belief propagation algorithm in Bayesian networks. We discuss some interesting open problems related to the use of the max-plus algorithm in real-time multiagent decision making

[1]  Nikos A. Vlassis,et al.  Sparse cooperative Q-learning , 2004, ICML.

[2]  Nikos A. Vlassis,et al.  Multi-robot decision making using coordination graphs , 2003 .

[3]  Brendan J. Frey,et al.  Factor graphs and the sum-product algorithm , 2001, IEEE Trans. Inf. Theory.

[4]  Carlos Guestrin,et al.  Multiagent Planning with Factored MDPs , 2001, NIPS.

[5]  伊藤 孝行 Gerhard Weiss(編), Multiagent Systems : A Modern Approarch to Distributed Artificial Intelligence, The MIT Press, 1999年, US$60.00, ISBN4-621-05291-8 , 1999 .

[6]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[7]  Martin J. Wainwright,et al.  MAP estimation via agreement on (hyper)trees: Message-passing and linear programming , 2005, ArXiv.

[8]  Martin J. Wainwright,et al.  Tree consistency and bounds on the performance of the max-product algorithm and its generalizations , 2004, Stat. Comput..

[9]  Guillermo Ricardo Simari,et al.  Multiagent systems: a modern approach to distributed artificial intelligence , 2000 .

[10]  Nikos Vlassis,et al.  A Concise Introduction to Multiagent Systems and Distributed AI , 2003 .

[11]  C. Tomlin,et al.  Decentralized optimization, with application to multiple aircraft coordination , 2002, Proceedings of the 41st IEEE Conference on Decision and Control, 2002..

[12]  Heinz Mühlenbein,et al.  FDA -A Scalable Evolutionary Algorithm for the Optimization of Additively Decomposed Functions , 1999, Evolutionary Computation.

[13]  Julie A. Adams,et al.  Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence , 2001, AI Mag..

[14]  Martin J. Wainwright,et al.  MAP estimation via agreement on trees: message-passing and linear programming , 2005, IEEE Transactions on Information Theory.

[15]  X. Jin Factor graphs and the Sum-Product Algorithm , 2002 .