论文信息 - GANGs: Generative Adversarial Network Games - 字舞流文

GANGs: Generative Adversarial Network Games

Generative Adversarial Networks (GAN) have become one of the most successful frameworks for unsupervised generative modeling. As GANs are difficult to train much research has focused on this. However, very little of this research has directly exploited game-theoretic techniques. We introduce Generative Adversarial Network Games (GANGs), which explicitly model a finite zero-sum game between a generator ($G$) and classifier ($C$) that use mixed strategies. The size of these games precludes exact solution methods, therefore we define resource-bounded best responses (RBBRs), and a resource-bounded Nash Equilibrium (RB-NE) as a pair of mixed strategies such that neither $G$ or $C$ can find a better RBBR. The RB-NE solution concept is richer than the notion of `local Nash equilibria' in that it captures not only failures of escaping local optima of gradient descent, but applies to any approximate best response computations, including methods with random restarts. To validate our approach, we solve GANGs with the Parallel Nash Memory algorithm, which provably monotonically converges to an RB-NE. We compare our results to standard GAN setups, and demonstrate that our method deals well with typical GAN problems such as mode collapse, partial mode coverage and forgetting.

Edwin D. de Jong | Rahul Savani | Frans A. Oliehoek | Roderich Groß | Jose Gallego-Posada | Elise van der Pol | Rahul Savani | F. Oliehoek | E. D. Jong | R. Groß | Jose Gallego-Posada | E. Jong

[1] Éva Tardos,et al. Learning in Games: Robustness of Fast Convergence , 2016, NIPS.

[2] D. Fudenberg,et al. The Theory of Learning in Games , 1998 .

[3] Constantinos Daskalakis,et al. Training GANs with Optimism , 2017, ICLR.

[4] Yann LeCun,et al. Energy-based Generative Adversarial Networks , 2016, ICLR.

[5] Stuart J. Russell. Rationality and Intelligence , 1995, IJCAI.

[6] Edwin D. de Jong,et al. The parallel Nash Memory for asymmetric games , 2006, GECCO.

[7] J. Aubin. Optima and Equilibria: An Introduction to Nonlinear Analysis , 1993 .

[8] Karthik Sridharan,et al. Online Learning with Predictable Sequences , 2012, COLT.

[9] Yang Cai,et al. Zero-Sum Polymatrix Games: A Generalization of Minmax , 2016, Math. Oper. Res..

[10] C. Martin. 2015 , 2015, Les 25 ans de l’OMC: Une rétrospective en photos.

[11] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.

[12] A. Rubinstein. Finite automata play the repeated prisoner's dilemma , 1986 .

[13] Florence March,et al. 2016 , 2016, Affair of the Heart.

[14] Trevor Hastie,et al. The Elements of Statistical Learning , 2001 .

[15] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[16] Yi Zhang,et al. Do GANs actually learn the distribution? An empirical study , 2017, ArXiv.

[17] Karthik Sridharan,et al. Optimization, Learning, and Games with Predictable Sequences , 2013, NIPS.

[18] Andreas Krause,et al. An Online Learning Approach to Generative Adversarial Networks , 2017, ICLR.

[19] Philip Bachman,et al. Calibrating Energy-based Generative Adversarial Networks , 2017, ICLR.

[20] Ariel Rubinstein,et al. A Course in Game Theory , 1995 .

[21] Jordan B. Pollack,et al. A Game-Theoretic Memory Mechanism for Coevolution , 2003, GECCO.

[22] Shlomo Zilberstein. Metareasoning and Bounded Rationality , 2011, Metareasoning.

[23] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.

[24] J. Neumann. Zur Theorie der Gesellschaftsspiele , 1928 .

[25] J. Nash. Equilibrium Points in N-Person Games. , 1950, Proceedings of the National Academy of Sciences of the United States of America.

[26] Sebastian Nowozin,et al. f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization , 2016, NIPS.

[27] Wei Li,et al. Turing learning: a metric-free approach to inferring behavior and its application to swarms , 2016, Swarm Intelligence.

[28] H. Simon,et al. A Behavioral Model of Rational Choice , 1955 .

[29] Branislav Bosanský,et al. An Exact Double-Oracle Algorithm for Zero-Sum Extensive-Form Games with Imperfect Information , 2014, J. Artif. Intell. Res..

[30] Karan Singh,et al. Efficient Regret Minimization in Non-Convex Games , 2017, ICML.

[31] Sepp Hochreiter,et al. Coulomb GANs: Provably Optimal Nash Equilibria via Potential Fields , 2017, ICLR.

[32] S. Shankar Sastry,et al. Characterization and computation of local Nash equilibria in continuous games , 2013, 2013 51st Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[33] R. McKelvey,et al. Quantal Response Equilibria for Normal Form Games , 1995 .

[34] Joseph Y. Halpern,et al. Decision Theory with Resource-Bounded Agents , 2014, Top. Cogn. Sci..

[35] Yann LeCun,et al. Energy-based Generative Adversarial Network , 2016, ICLR.

[36] Yingyu Liang,et al. Generalization and Equilibrium in Generative Adversarial Nets (GANs) , 2017, ICML.

[37] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.