On Randomized Fictitious Play for Approximating Saddle Points Over Convex Sets

Given two bounded convex sets $$X\subseteq \mathbb R^m$$X⊆Rm and $$Y\subseteq \mathbb R^n,$$Y⊆Rn, specified by membership oracles, and a continuous convex–concave function $$F:X\times Y\rightarrow \mathbb R$$F:X×Y→R, we consider the problem of computing an $$\varepsilon $$ε-approximate saddle point, that is, a pair $$(x^*,y^*)\in X\times Y$$(x∗,y∗)∈X×Y such that $$\sup _{y\in Y} F(x^*,y)\le \inf _{x\in X}F(x,y^*)+\varepsilon .$$supy∈YF(x∗,y)≤infx∈XF(x,y∗)+ε. Grigoriadis and Khachiyan (Oper Res Lett 18(2):53–58, 1995) gave a simple randomized variant of fictitious play for computing an $$\varepsilon $$ε-approximate saddle point for matrix games, that is, when $$F$$F is bilinear and the sets $$X$$X and $$Y$$Y are simplices. In this paper, we extend their method to the general case. In particular, we show that, for functions of constant “width”, an $$\varepsilon $$ε-approximate saddle point can be computed using $$O^* \big (\frac{(n+m)}{\varepsilon ^2}\ln R \big )$$O∗((n+m)ε2lnR) random samples from log-concave distributions over the convex sets $$X$$X and $$Y$$Y. It is assumed that $$X$$X and $$Y$$Y have inscribed balls of radius $$1/R$$1/R and circumscribing balls of radius $$R$$R. As a consequence, we obtain a simple randomized polynomial-time algorithm that computes such an approximation faster than known methods for problems with bounded width and when $$\varepsilon \in (0,1)$$ε∈(0,1) is a fixed, but arbitrarily small constant. Our main tool for achieving this result is the combination of the randomized fictitious play with the recently developed results on sampling from convex sets.

[1]  Sanjeev Arora,et al.  Fast algorithms for approximate semidefinite programming using the multiplicative weights update method , 2005, 46th Annual IEEE Symposium on Foundations of Computer Science (FOCS'05).

[2]  H. Groemer,et al.  On the min-max theorem for finite two-person zero-sum games , 1967 .

[3]  Santosh S. Vempala,et al.  Solving convex programs by random walks , 2002, STOC '02.

[4]  Sanjeev Arora,et al.  A combinatorial, primal-dual approach to semidefinite programs , 2007, STOC '07.

[5]  R. Khandekar Lagrangian relaxation based algorithms for convex programming problems , 2004 .

[6]  H. N. Shapiro Note on a Computation Method in the Theory of Games , 1958 .

[7]  Paul R. Thie,et al.  Two‐Person, Zero‐Sum Games , 2011 .

[8]  Santosh S. Vempala,et al.  The geometry of logconcave functions and sampling algorithms , 2007, Random Struct. Algorithms.

[9]  L. McLinden,et al.  A Minimax Theorem , 1984, Math. Oper. Res..

[10]  Noam Nisan,et al.  A parallel approximation algorithm for positive linear programming , 1993, STOC.

[11]  Oded Regev,et al.  Simulating Quantum Correlations with Finite Communication , 2007, FOCS.

[12]  Naveen Garg,et al.  Fractional Covering with Upper Bounds on the Variables: Solving LPs with Negative Entries , 2004, ESA.

[13]  Leonid Khachiyan,et al.  A sublinear-time randomized approximation algorithm for matrix games , 1995, Oper. Res. Lett..

[14]  J. Robinson AN ITERATIVE METHOD OF SOLVING A GAME , 1951, Classics in Game Theory.

[15]  Klaus Jansen Approximation Algorithm for the Mixed Fractional Packing and Covering Problem , 2006, SIAM J. Optim..

[16]  Neal E. Young,et al.  Sequential and parallel algorithms for mixed packing and covering , 2001, Proceedings 2001 IEEE International Conference on Cluster Computing.

[17]  Alexander Schrijver,et al.  Theory of linear and integer programming , 1986, Wiley-Interscience series in discrete mathematics and optimization.

[18]  Leonid Khachiyan,et al.  Fast Approximation Schemes for Convex Programs with Many Blocks and Coupling Constraints , 1994, SIAM J. Optim..

[19]  Santosh S. Vempala,et al.  Fast Algorithms for Logconcave Functions: Sampling, Rounding, Integration and Optimization , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[20]  J. G. Pierce,et al.  Geometric Algorithms and Combinatorial Optimization , 2016 .

[21]  A. Wald Generalization of a Theorem By v. Neumann Concerning Zero Sum Two Person Games , 1945 .

[22]  Zoltán Sebestyén A general saddle point theorem and its applications , 1990 .

[23]  Manfred K. Warmuth,et al.  The Weighted Majority Algorithm , 1994, Inf. Comput..

[24]  Éva Tardos,et al.  Fast approximation algorithms for fractional packing and covering problems , 1991, [1991] Proceedings 32nd Annual Symposium of Foundations of Computer Science.

[25]  Sanjeev Arora,et al.  Efficient algorithms for online convex optimization and their applications , 2006 .

[26]  O. H. Brownlee,et al.  ACTIVITY ANALYSIS OF PRODUCTION AND ALLOCATION , 1952 .

[27]  J. Hofbauer,et al.  BEST RESPONSE DYNAMICS FOR CONTINUOUS ZERO{SUM GAMES , 2005 .

[28]  Santosh S. Vempala,et al.  Simulated Annealing for Convex Optimization , 2004 .

[29]  S. Vempala Geometric Random Walks: a Survey , 2007 .

[30]  H Tateishi,et al.  On Some Minimax Theorems , 1989 .

[31]  N. Biggs GEOMETRIC ALGORITHMS AND COMBINATORIAL OPTIMIZATION: (Algorithms and Combinatorics 2) , 1990 .

[32]  Jochen Könemann,et al.  Faster and simpler algorithms for multicommodity flow and other fractional packing problems , 1998, Proceedings 39th Annual Symposium on Foundations of Computer Science (Cat. No.98CB36280).

[33]  Klaus Jansen,et al.  Faster and simpler approximation algorithms for mixed packing and covering problems , 2007, Theor. Comput. Sci..

[34]  In-Sook Kim,et al.  Saddle point theorems on generalized convex spaces. , 2000 .

[35]  Danny Raz,et al.  Fast, Distributed Approximation Algorithms for Positive Linear Programming with Applications to Flow Control , 2004, SIAM J. Comput..

[36]  Yves Robert,et al.  Linear Scheduling Is Nearly Optimal , 1991, Parallel Process. Lett..

[37]  Y. Freund,et al.  Adaptive game playing using multiplicative weights , 1999 .

[38]  Telikepalli Kavitha,et al.  Popular mixed matchings , 2009, Theor. Comput. Sci..

[39]  Satyen Kale Efficient algorithms using the multiplicative weights update method , 2007 .

[40]  Sanjeev Arora,et al.  The Multiplicative Weights Update Method: a Meta-Algorithm and Applications , 2012, Theory Comput..

[41]  Leonid Khachiyan,et al.  Approximate Max-Min Resource Sharing for Structured Concave Optimization , 2000, SIAM J. Optim..

[42]  Santosh S. Vempala,et al.  Hit-and-run from a corner , 2004, STOC '04.

[43]  Klaus Jansen,et al.  Approximation Algorithms for Mixed Fractional Packing and Covering Problems , 2004, WAOA.

[44]  Nesa L'abbe Wu,et al.  Linear programming and extensions , 1981 .

[45]  Leonid Khachiyan,et al.  Coordination Complexity of Parallel Price-Directive Decomposition , 1996, Math. Oper. Res..

[46]  A. Belenky A 2-person game on a polyhedral set of connected strategies☆ , 1997 .

[47]  N. N. Vorobʹev,et al.  Foundations of Game Theory: Noncooperative Games , 1994 .