Stochastic Stackelberg games

In this paper, we consider a discrete-time stochastic Stackelberg game where there is a defender (also called leader) who has to defend a target and an attacker (also called follower). Both attacker and defender have conditionally independent private types, conditioned on action and previous state, that evolve as controlled Markov processes. The objective is to compute the stochastic Stackelberg equilibrium of the game where defender commits to a strategy. The attacker's strategy is the best response to the defender strategy and defender's strategy is optimum given the attacker plays the best response. In general, computing such equilibrium involves solving a fixed-point equation for the whole game. In this paper, we present an algorithm that computes such strategies by solving smaller fixed-point equations for each time $t$. This reduces the computational complexity of the problem from double exponential in time to linear in time. Based on this algorithm, we compute stochastic Stackelberg equilibrium of a security example.

[1]  Maria-Florina Balcan,et al.  Commitment Without Regrets: Online Learning in Stackelberg Security Games , 2015, EC.

[2]  Milind Tambe,et al.  "A Game of Thrones": When Human Behavior Models Compete in Repeated Stackelberg Security Games , 2015, AAMAS.

[3]  Rong Yang,et al.  Adaptive resource allocation for wildlife protection against illegal poachers , 2014, AAMAS.

[4]  Achilleas Anastasopoulos,et al.  A systematic process for evaluating structured perfect Bayesian equilibria in dynamic games with asymmetric information , 2015, 2016 American Control Conference (ACC).

[5]  E. Maskin,et al.  The Existence of Equilibrium in Discontinuous Economic Games, II: Applications , 1986 .

[6]  Milind Tambe,et al.  Security and Game Theory - Algorithms, Deployed Systems, Lessons Learned , 2011 .

[7]  Ashutosh Nayyar,et al.  Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach , 2012, IEEE Transactions on Automatic Control.

[8]  Nicola Basilico,et al.  Leader-follower strategies for robotic patrolling in environments with arbitrary topologies , 2009, AAMAS.

[9]  Sarit Kraus,et al.  Deployed ARMOR protection: the application of a game theoretic model for security at the Los Angeles International Airport , 2008, AAMAS.

[10]  Manish Jain,et al.  Computing optimal randomized resource allocations for massive security games , 2009, AAMAS.

[11]  Gerald Tesauro,et al.  Playing repeated Stackelberg games with unknown opponents , 2012, AAMAS.

[12]  Akira Okada Perfect Bayesian Equilibrium and Sequential Equilibrium , 2011 .

[13]  Achilleas Anastasopoulos,et al.  Signaling equilibria for dynamic LQG games with asymmetric information , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[14]  Achilleas Anastasopoulos,et al.  Decentralized Bayesian learning in dynamic games , 2016, 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[15]  Vincent Conitzer,et al.  Computing the optimal strategy to commit to , 2006, EC '06.

[16]  E. Maskin,et al.  The Existence of Equilibrium in Discontinuous Economic Games, I: Theory , 1986 .