论文信息 - On Randomized Fictitious Play for Approximating Saddle Points Over Convex Sets

On Randomized Fictitious Play for Approximating Saddle Points Over Convex Sets

Given two bounded convex sets $$X\subseteq \mathbb R^m$$X⊆Rm and $$Y\subseteq \mathbb R^n,$$Y⊆Rn, specified by membership oracles, and a continuous convex–concave function $$F:X\times Y\rightarrow \mathbb R$$F:X×Y→R, we consider the problem of computing an $$\varepsilon $$ε-approximate saddle point, that is, a pair $$(x^*,y^*)\in X\times Y$$(x∗,y∗)∈X×Y such that $$\sup _{y\in Y} F(x^*,y)\le \inf _{x\in X}F(x,y^*)+\varepsilon .$$supy∈YF(x∗,y)≤infx∈XF(x,y∗)+ε. Grigoriadis and Khachiyan (Oper Res Lett 18(2):53–58, 1995) gave a simple randomized variant of fictitious play for computing an $$\varepsilon $$ε-approximate saddle point for matrix games, that is, when $$F$$F is bilinear and the sets $$X$$X and $$Y$$Y are simplices. In this paper, we extend their method to the general case. In particular, we show that, for functions of constant “width”, an $$\varepsilon $$ε-approximate saddle point can be computed using $$O^* \big (\frac{(n+m)}{\varepsilon ^2}\ln R \big )$$O∗((n+m)ε2lnR) random samples from log-concave distributions over the convex sets $$X$$X and $$Y$$Y. It is assumed that $$X$$X and $$Y$$Y have inscribed balls of radius $$1/R$$1/R and circumscribing balls of radius $$R$$R. As a consequence, we obtain a simple randomized polynomial-time algorithm that computes such an approximation faster than known methods for problems with bounded width and when $$\varepsilon \in (0,1)$$ε∈(0,1) is a fixed, but arbitrarily small constant. Our main tool for achieving this result is the combination of the randomized fictitious play with the recently developed results on sampling from convex sets.

Kurt Mehlhorn | Khaled M. Elbassioni | Kazuhisa Makino | Fahimeh Ramezani

[1] Sanjeev Arora,et al. Fast algorithms for approximate semidefinite programming using the multiplicative weights update method , 2005, 46th Annual IEEE Symposium on Foundations of Computer Science (FOCS'05).

[2] H. Groemer,et al. On the min-max theorem for finite two-person zero-sum games , 1967 .

[3] Santosh S. Vempala,et al. Solving convex programs by random walks , 2002, STOC '02.

[4] Sanjeev Arora,et al. A combinatorial, primal-dual approach to semidefinite programs , 2007, STOC '07.

[5] R. Khandekar. Lagrangian relaxation based algorithms for convex programming problems , 2004 .

[6] H. N. Shapiro. Note on a Computation Method in the Theory of Games , 1958 .

[7] Paul R. Thie,et al. Two‐Person, Zero‐Sum Games , 2011 .

[8] Santosh S. Vempala,et al. The geometry of logconcave functions and sampling algorithms , 2007, Random Struct. Algorithms.

[9] L. McLinden,et al. A Minimax Theorem , 1984, Math. Oper. Res..

[10] Noam Nisan,et al. A parallel approximation algorithm for positive linear programming , 1993, STOC.

[11] Oded Regev,et al. Simulating Quantum Correlations with Finite Communication , 2007, FOCS.

[12] Naveen Garg,et al. Fractional Covering with Upper Bounds on the Variables: Solving LPs with Negative Entries , 2004, ESA.

[13] Leonid Khachiyan,et al. A sublinear-time randomized approximation algorithm for matrix games , 1995, Oper. Res. Lett..

[14] J. Robinson. AN ITERATIVE METHOD OF SOLVING A GAME , 1951, Classics in Game Theory.

[15] Klaus Jansen. Approximation Algorithm for the Mixed Fractional Packing and Covering Problem , 2006, SIAM J. Optim..

[16] Neal E. Young,et al. Sequential and parallel algorithms for mixed packing and covering , 2001, Proceedings 2001 IEEE International Conference on Cluster Computing.

[17] Alexander Schrijver,et al. Theory of linear and integer programming , 1986, Wiley-Interscience series in discrete mathematics and optimization.

[18] Leonid Khachiyan,et al. Fast Approximation Schemes for Convex Programs with Many Blocks and Coupling Constraints , 1994, SIAM J. Optim..

[19] Santosh S. Vempala,et al. Fast Algorithms for Logconcave Functions: Sampling, Rounding, Integration and Optimization , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[20] J. G. Pierce,et al. Geometric Algorithms and Combinatorial Optimization , 2016 .

[21] A. Wald. Generalization of a Theorem By v. Neumann Concerning Zero Sum Two Person Games , 1945 .

[22] Zoltán Sebestyén. A general saddle point theorem and its applications , 1990 .

[23] Manfred K. Warmuth,et al. The Weighted Majority Algorithm , 1994, Inf. Comput..

[24] Éva Tardos,et al. Fast approximation algorithms for fractional packing and covering problems , 1991, [1991] Proceedings 32nd Annual Symposium of Foundations of Computer Science.

[25] Sanjeev Arora,et al. Efficient algorithms for online convex optimization and their applications , 2006 .

[26] O. H. Brownlee,et al. ACTIVITY ANALYSIS OF PRODUCTION AND ALLOCATION , 1952 .

[27] J. Hofbauer,et al. BEST RESPONSE DYNAMICS FOR CONTINUOUS ZERO{SUM GAMES , 2005 .

[28] Santosh S. Vempala,et al. Simulated Annealing for Convex Optimization , 2004 .

[29] S. Vempala. Geometric Random Walks: a Survey , 2007 .

[30] H Tateishi,et al. On Some Minimax Theorems , 1989 .

[31] N. Biggs. GEOMETRIC ALGORITHMS AND COMBINATORIAL OPTIMIZATION: (Algorithms and Combinatorics 2) , 1990 .

[32] Jochen Könemann,et al. Faster and simpler algorithms for multicommodity flow and other fractional packing problems , 1998, Proceedings 39th Annual Symposium on Foundations of Computer Science (Cat. No.98CB36280).

[33] Klaus Jansen,et al. Faster and simpler approximation algorithms for mixed packing and covering problems , 2007, Theor. Comput. Sci..

[34] In-Sook Kim,et al. Saddle point theorems on generalized convex spaces. , 2000 .

[35] Danny Raz,et al. Fast, Distributed Approximation Algorithms for Positive Linear Programming with Applications to Flow Control , 2004, SIAM J. Comput..

[36] Yves Robert,et al. Linear Scheduling Is Nearly Optimal , 1991, Parallel Process. Lett..

[37] Y. Freund,et al. Adaptive game playing using multiplicative weights , 1999 .

[38] Telikepalli Kavitha,et al. Popular mixed matchings , 2009, Theor. Comput. Sci..

[39] Satyen Kale. Efficient algorithms using the multiplicative weights update method , 2007 .

[40] Sanjeev Arora,et al. The Multiplicative Weights Update Method: a Meta-Algorithm and Applications , 2012, Theory Comput..

[41] Leonid Khachiyan,et al. Approximate Max-Min Resource Sharing for Structured Concave Optimization , 2000, SIAM J. Optim..

[42] Santosh S. Vempala,et al. Hit-and-run from a corner , 2004, STOC '04.

[43] Klaus Jansen,et al. Approximation Algorithms for Mixed Fractional Packing and Covering Problems , 2004, WAOA.

[44] Nesa L'abbe Wu,et al. Linear programming and extensions , 1981 .

[45] Leonid Khachiyan,et al. Coordination Complexity of Parallel Price-Directive Decomposition , 1996, Math. Oper. Res..

[46] A. Belenky. A 2-person game on a polyhedral set of connected strategies☆ , 1997 .

[47] N. N. Vorobʹev,et al. Foundations of Game Theory: Noncooperative Games , 1994 .