Speeding up Planning in Multiagent Settings Using CPU-GPU Architectures

Planning under uncertainty in multiagent settings is highly intractable because of history and plan space complexities. Probabilistic graphical models exploit the structure of the problem domain to mitigate the computational burden. In this article, we introduce the first parallelization of planning in multiagent settings on a CPU-GPU heterogeneous system. In particular, we focus on the algorithm for exactly solving interactive dynamic influence diagrams, which is a recognized graphical models for multiagent planning. Beyond parallelizing the standard Bayesian inference and the computation of decisions’ expected utilities, we also solve the other agents behavioral models in a parallel manner. The GPU-based approach provides significant speedup on two benchmark problems.

[1]  Yingke Chen,et al.  Team behavior in interactive dynamic influence diagrams with applications to ad hoc teams , 2014, AAMAS.

[2]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[3]  Daphne Koller,et al.  Multi-Agent Influence Diagrams for Representing and Solving Games , 2001, IJCAI.

[4]  Shlomo Zilberstein,et al.  Bounded Policy Iteration for Decentralized POMDPs , 2005, IJCAI.

[5]  Weiru Liu,et al.  Incorporating PGMs into a BDI Architecture , 2013, PRIMA.

[6]  Bo Li,et al.  Path planning for automated guided vehicles system via interactive dynamic influence diagrams with communication , 2011, 2011 9th IEEE International Conference on Control and Automation (ICCA).

[7]  Jaswinder Pal Singh,et al.  A parallel Lauritzen-Spiegelhalter algorithm for probabilistic inference , 1994, Proceedings of Supercomputing '94.

[8]  Neil Immerman,et al.  The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[9]  Edward J. Sondik,et al.  The Optimal Control of Partially Observable Markov Processes over a Finite Horizon , 1973, Oper. Res..

[10]  P. J. Gmytrasiewicz,et al.  A Framework for Sequential Planning in Multi-Agent Settings , 2005, AI&M.

[11]  Ronald A. Howard,et al.  Influence Diagrams , 2005, Decis. Anal..

[12]  Ole J. Mengshoel,et al.  Belief Propagation by Message Passing in Junction Trees: Computing Each Message Faster Using GPU Parallelization , 2011, UAI.

[13]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[14]  Yifeng Zeng,et al.  Exploiting Model Equivalences for Solving Interactive Dynamic Influence Diagrams , 2014, J. Artif. Intell. Res..

[15]  Naoki Fukuta,et al.  A Performance Optimization Support Framework for GPU-Based Traffic Simulations with Negotiating Agents , 2014, ANAC@AAMAS.

[16]  Yifeng Zeng,et al.  Graphical models for interactive POMDPs: representations and solutions , 2009, Autonomous Agents and Multi-Agent Systems.

[17]  Yifeng Zeng,et al.  Opponent modeling in a PGM framework , 2013, AAMAS.

[18]  Ya'akov Gal,et al.  Networks of Influence Diagrams: A Formalism for Representing Agents' Beliefs and Decision-Making Processes , 2008, J. Artif. Intell. Res..

[19]  Viktor K. Prasanna,et al.  Parallel exact inference on the cell broadband engine processor , 2008, HiPC 2008.

[20]  Viktor K. Prasanna,et al.  Parallel Exact Inference on a CPU-GPGPU Heterogenous System , 2010, 2010 39th International Conference on Parallel Processing.