论文信息 - Probably Almost Stable Strategy Profiles in Simulation-Based Games

Probably Almost Stable Strategy Profiles in Simulation-Based Games

Empirical studies of strategic settings commonly model player interactions under supposed game-theoretic equilibrium behavior, to predict what rational agents might do. But in sufficiently complex settings, analysts cannot solve for exact equilibria, and may resort to solving a restricted game where agents are limited to a tractable subset of strategies. This provides a solution, but one with unclear strategic stability in the original game. We propose a search and evaluation method that can guarantee a well-defined strategic stability property in the profile that it yields, even if only a small subset of possible strategies in a game have been analyzed. The method achieves this result by combining statistical confidence interval estimation, a multiple test correction, and empirical game-theoretic analysis. We also present an extension of the method that more often finds genuine approximate equilibria, by using simulated annealing instead of simple random search for strategy exploration. We demonstrate efficacy in two example settings: the first-price sealed-bid auction, and a cybersecurity game.

Michael P. Wellman | Mason Wright | Mason Wright

[1] David Silver,et al. A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning , 2017, NIPS.

[2] Michael P. Wellman,et al. Multi-Stage Attack Graph Security Games: Heuristic Strategies, with Empirical Game-Theoretic Analysis , 2017, MTD@CCS.

[3] O. Mangasarian. Equilibrium Points of Bimatrix Games , 1964 .

[4] Michael P. Wellman,et al. Strategy exploration in empirical games , 2010, AAMAS.

[5] Michael P. Wellman,et al. Strategic Payment Routing in Financial Credit Networks , 2016, EC.

[6] Cynthia A. Phillips,et al. A graph-based system for network-vulnerability analysis , 1998, NSPW '98.

[7] Michael P. Wellman,et al. Stronger CDA strategies through empirical game-theoretic analysis and reinforcement learning , 2009, AAMAS.

[8] Demosthenis Teneketzis,et al. Optimal Defense Policies for Partially Observable Spreading Processes on Bayesian Attack Graphs , 2015, MTD@CCS.

[9] E. S. Pearson,et al. THE USE OF CONFIDENCE OR FIDUCIAL LIMITS ILLUSTRATED IN THE CASE OF THE BINOMIAL , 1934 .

[10] Michael P. Wellman,et al. Generating trading agent strategies: Analytic and empirical methods for infinite and large games , 2005 .

[11] Avrim Blum,et al. Planning in the Presence of Cost Functions Controlled by an Adversary , 2003, ICML.