论文信息 - Competing Prediction Algorithms - 字舞流文

Competing Prediction Algorithms

Prediction is a well-studied machine learning task, and prediction algorithms are core ingredients in online products and services. Despite their centrality in the competition between online companies who offer prediction-based products, the strategic use of prediction algorithms remains unexplored. The goal of this paper is to examine strategic use of prediction algorithms. We introduce a novel game-theoretic setting that is based on the PAC learning framework, where each player (aka a prediction algorithm at competition) seeks to maximize the sum of points for which it produces an accurate prediction and the others do not. We show that algorithms aiming at generalization may wittingly miss-predict some points to perform better than others on expectation. We analyze the empirical game, i.e. the game induced on a given sample, prove that it always possesses a pure Nash equilibrium, and show that every better-response learning process converges. Moreover, our learning-theoretic analysis suggests that players can, with high probability, learn an approximate pure Nash equilibrium for the whole population using a small number of samples.

Moshe Tennenholtz | Omer Ben-Porat | Moshe Tennenholtz | Omer Ben-Porat

[1] I. Althöfer. On sparse approximations to randomized strategies and convex combinations , 1994 .

[2] Yishay Mansour,et al. Competing Bandits: Learning Under Competition , 2017, ITCS.

[3] Yannai A. Gonczarowski,et al. Efficient empirical revenue maximization in single-parameter auction environments , 2016, STOC.

[4] Tim Roughgarden,et al. On the Pseudo-Dimension of Nearly Optimal Auctions , 2015, NIPS.

[5] Ariel D. Procaccia,et al. Collaborative PAC Learning , 2017, NIPS.

[6] Alvin E. Roth,et al. A choice prediction competition: Choices from experience and from description , 2010 .

[7] R. Rosenthal. A class of games possessing pure-strategy Nash equilibria , 1973 .

[8] H. Simon,et al. Rational choice and the structure of the environment. , 1956, Psychological review.

[9] Noam Nisan,et al. Algorithmic Mechanism Design , 2001, Games Econ. Behav..

[10] Adam Tauman Kalai,et al. Dueling algorithms , 2011, STOC '11.

[11] Leslie G. Valiant,et al. A theory of the learnable , 1984, CACM.

[12] Yakov Babichenko,et al. Empirical Distribution of Equilibrium Play and Its Testing Application , 2013, Math. Oper. Res..

[13] Richard Cole,et al. The sample complexity of revenue maximization , 2014, STOC.

[14] I. Erev,et al. Small feedback‐based decisions and their limited correspondence to description‐based decisions , 2003 .

[15] Moshe Tennenholtz,et al. Best Response Regression , 2017, NIPS.

[16] H. Hotelling. Stability in Competition , 1929 .

[17] Vladimir Vapnik,et al. Chervonenkis: On the uniform convergence of relative frequencies of events to their probabilities , 1971 .

[18] Roger B. Myerson,et al. Optimal Auction Design , 1981, Math. Oper. Res..

[19] David H. Wolpert,et al. No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[20] L. Shapley,et al. Potential Games , 1994 .

[21] Aranyak Mehta,et al. Playing large games using simple strategies , 2003, EC '03.