论文信息 - Continuous Time Planning for Multiagent Teams with Temporal Constraints

Continuous Time Planning for Multiagent Teams with Temporal Constraints

Continuous state DEC-MDPs are critical for agent teams in domains involving resources such as time, but scaling them up is a significant challenge. To meet this challenge, we first introduce a novel continuous-time DEC-MDP model that exploits transition independence in domains with temporal constraints. More importantly, we present a new locally optimal algorithm called SPAC. Compared to the best previous algorithm, SPAC finds solutions of comparable quality substantially faster; SPAC also scales to larger teams of agents.

Milind Tambe | Zhengyu Yin | Milind Tambe | Zhengyu Yin

[1] Abdel-Illah Mouaddib,et al. A polynomial algorithm for decentralized Markov decision processes with temporal constraints , 2005, AAMAS '05.

[2] Neil Immerman,et al. The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[3] Claudia V. Goldman,et al. Solving Transition Independent Decentralized Markov Decision Processes , 2004, J. Artif. Intell. Res..

[4] Shlomo Zilberstein,et al. Point-based backup for decentralized POMDPs: complexity and new algorithms , 2010, AAMAS.

[5] Feng Wu,et al. Point-based policy generation for decentralized POMDPs , 2010, AAMAS.

[6] Frederic Py,et al. A systematic agent framework for situated autonomous systems , 2010, AAMAS.

[7] Lihong Li,et al. Lazy Approximation for Solving Continuous Finite-Horizon MDPs , 2005, AAAI.

[8] J. G. Bellingham,et al. Guest editorial - autonomous ocean-sampling networks , 2001 .

[9] A. Hanks. Canada , 2002 .

[10] Emmanuel Benazera. Solving Decentralized Continuous Markov Decision Problems with Structured Reward , 2007, KI.

[11] Milind Tambe,et al. A Fast Analytical Algorithm for Solving Markov Decision Processes with Real-Valued Resources , 2007, IJCAI.

[12] Milind Tambe,et al. Planning with continuous resources for agent teams , 2009, AAMAS.

[13] Francisco S. Melo,et al. Interaction-driven Markov games for decentralized multiagent planning under uncertainty , 2008, AAMAS.

[14] Milind Tambe,et al. On opportunistic techniques for solving decentralized Markov decision processes with temporal constraints , 2007, AAMAS '07.

[15] Yifeng Zeng,et al. Graphical models for interactive POMDPs: representations and solutions , 2009, Autonomous Agents and Multi-Agent Systems.

[16] Makoto Yokoo,et al. Networked Distributed POMDPs: A Synergy of Distributed Constraint Optimization and POMDPs , 2005, IJCAI.

[17] Claudia V. Goldman,et al. Transition-independent decentralized markov decision processes , 2003, AAMAS '03.