Equilibrium approximation in simulation-based extensive-form games

The class of simulation--based games, in which the payoffs are generated as an output of a simulation process, recently received a lot of attention in literature. In this paper, we extend such class to games in extensive form with continuous actions and perfect information. We design two convergent algorithms to find an approximate subgame perfect equilibrium (SPE) and an approximate Nash equilibrium (NE) respectively. Our algorithms can exploit different optimization techniques. In particular, we use: simulated annealing, cross entropy method, and Lipschitz optimization. We produce an extensive experimental evaluation of the performance of our algorithms in terms of approximation degree of the optimal solution and number of evaluated samples. Finding approximate NE and SPE requires exponential time in the game tree depth: an SPE can be computed in game trees with a small depth, while the computation of an NE is easier.

[1]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[2]  M. Dufwenberg Game theory. , 2011, Wiley interdisciplinary reviews. Cognitive science.

[3]  Nicola Gatti,et al.  Bargaining over multiple issues in finite horizon alternating-offers protocol , 2006, Annals of Mathematics and Artificial Intelligence.

[4]  B. Shubert A Sequential Method Seeking the Global Maximum of a Function , 1972 .

[5]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[6]  Robert L. Smith,et al.  Adaptive search with stochastic acceptance probabilities for global optimization , 2008, Oper. Res. Lett..

[7]  Michael P. Wellman,et al.  Stochastic Search Methods for Nash Equilibrium Approximation in Simulation-based Games , 2022 .

[8]  Dirk P. Kroese,et al.  The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation and Machine Learning , 2004 .

[9]  Ivan Marsá-Maestre,et al.  Avoiding the prisoner's dilemma in auction-based negotiations for highly rugged utility spaces , 2010, AAMAS.

[10]  M. Locatelli Simulated Annealing Algorithms for Continuous Global Optimization: Convergence Conditions , 2000 .

[11]  M. E. Johnson,et al.  Generalized simulated annealing for function optimization , 1986 .

[12]  Dirk P. Kroese,et al.  The Cross Entropy Method: A Unified Approach To Combinatorial Optimization, Monte-carlo Simulation (Information Science and Statistics) , 2004 .

[13]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[14]  Yoav Shoham,et al.  Multiagent Systems - Algorithmic, Game-Theoretic, and Logical Foundations , 2009 .

[15]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[16]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[17]  L. Margolin,et al.  On the Convergence of the Cross-Entropy Method , 2005, Ann. Oper. Res..

[18]  C. Harris,et al.  Existence and Characterization of Perfect Equilibrium in Games of Perfect Information , 1985 .

[19]  Steven G. Louie,et al.  A Monte carlo simulated annealing approach to optimization over continuous variables , 1984 .

[20]  P. Pardalos,et al.  Handbook of global optimization , 1995 .

[21]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[22]  Marie-Françoise Roy,et al.  Real algebraic geometry , 1992 .

[23]  Michael P. Wellman,et al.  Constrained automated mechanism design for infinite games of incomplete information , 2007, Autonomous Agents and Multi-Agent Systems.

[24]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.