论文信息 - Unleashing Dec-MDPs in Security Games: Enabling Effective Defender Teamwork

Unleashing Dec-MDPs in Security Games: Enabling Effective Defender Teamwork

Multiagent teamwork and defender-attacker security games are two areas that are currently receiving significant attention within multiagent systems research. Unfortunately, despite the need for effective teamwork among multiple defenders, little has been done to harness the teamwork research in security games. This paper is the first to remedy this situation by integrating the powerful teamwork mechanisms offered by Dec-MDPs into security games. We offer the following novel contributions in this paper: (i) New models of security games where a defender team's pure strategy is defined as a Dec-MDP policy for addressing coordination under uncertainty; (ii) New algorithms based on column generation that enable efficient generation of mixed strategies given this new model; (iii) Handling global events during defender execution for effective teamwork; (iv) Exploration of the robustness of randomized pure strategies. The paper opens the door to a potentially new area combining computational game theory and multiagent teamwork.

[1] Shlomo Zilberstein,et al. Dynamic Programming for Partially Observable Stochastic Games , 2004, AAAI.

[2] Milind Tambe,et al. Exploiting Coordination Locales in Distributed POMDPs via Social Model Shaping , 2009, ICAPS.

[3] Milind Tambe,et al. Security and Game Theory - Algorithms, Deployed Systems, Lessons Learned , 2011 .

[4] Nicola Basilico,et al. Patrolling security games: Definition and algorithms for solving large instances with single patroller and single intruder , 2012, Artif. Intell..

[5] Shih-Fen Cheng,et al. Decision Support for Agent Populations in Uncertain and Congested Environments , 2012, AAAI.

[6] Frans A. Oliehoek,et al. The MultiAgent Decision Process toolbox: Software for decision-theoretic planning in multiagent-systems , 2008 .

[7] Branislav Bosanský,et al. Computing time-dependent policies for patrolling games with mobile targets , 2011, AAMAS.

[8] Claudia V. Goldman,et al. Solving Transition Independent Decentralized Markov Decision Processes , 2004, J. Artif. Intell. Res..

[9] Shie Mannor,et al. The Cross Entropy Method for Fast Policy Search , 2003, ICML.

[10] Laurent Jeanpierre,et al. Coordinated Multi-Robot Exploration Under Communication Constraints Using Decentralized Markov Decision Processes , 2012, AAAI.

[11] Arnaud Doniec,et al. Scaling Up Decentralized MDPs Through Heuristic Search , 2012, UAI.

[12] Martin W. P. Savelsbergh,et al. Branch-and-Price: Column Generation for Solving Huge Integer Programs , 1998, Oper. Res..

[13] Makoto Yokoo,et al. Networked Distributed POMDPs: A Synergy of Distributed Constraint Optimization and POMDPs , 2005, IJCAI.

[14] Makoto Yokoo,et al. Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings , 2003, IJCAI.

[15] Sarit Kraus,et al. Using Game Theory for Los Angeles Airport Security , 2009, AI Mag..

[16] Manish Jain,et al. Efficiently Solving Joint Activity Based Security Games , 2013, IJCAI.

[17] Michal Pechoucek,et al. Agents vs. pirates: multi-agent simulation and optimization to fight maritime piracy , 2012, AAMAS.

[18] Vincent Conitzer,et al. Computing the optimal strategy to commit to , 2006, EC '06.

[19] Manuela M. Veloso,et al. Decentralized MDPs with sparse interactions , 2011, Artif. Intell..