论文信息 - Potential-Based Algorithms in On-Line Prediction and Game Theory ∗

Potential-Based Algorithms in On-Line Prediction and Game Theory ∗

In this paper we show that several known algorithms for sequential prediction problems (including Weighted Majority and the quasi-additive family of Grove, Littlestone, and Schuurmans), for playing iterated games (including Freund and Schapire’s Hedge and MW, as well as the -strategies of Hart and Mas-Colell), and for boosting (including AdaBoost) are special cases of a general decision strategy based on the notion of potential. By analyzing this strategy we derive known performance bounds, as well as new bounds, as simple corollaries of a single general theorem. Besides offering a new and unified view on a large family of algorithms, we establish a connection between potential-based analysis in learning and their counterparts independently developed in game theory. By exploiting this connection, we show that certain learning problems are instances of more general gametheoretic problems. In particular, we describe a notion of generalized regret and show its applications in learning theory.

J. Shawe-Taylor

[1] Ehud Lehrer,et al. A wide range no-regret theorem , 2003, Games Econ. Behav..

[2] Claudio Gentile,et al. A Second-Order Perceptron Algorithm , 2002, SIAM J. Comput..

[3] Claudio Gentile,et al. Adaptive and Self-Confident On-Line Learning Algorithms , 2000, J. Comput. Syst. Sci..

[4] V. Vovk. Competitive On‐line Statistics , 2001 .

[5] Andreu Mas-Colell,et al. A General Class of Adaptive Strategies , 1999, J. Econ. Theory.

[6] Y. Freund,et al. Adaptive game playing using multiplicative weights , 1999 .

[7] Dean P. Foster,et al. Regret in the On-Line Decision Problem , 1999 .

[8] D. Fudenberg,et al. Conditional Universal Consistency , 1999 .

[9] Claudio Gentile,et al. The Robustness of the p-Norm Algorithms , 1999, COLT '99.

[10] Robert E. Schapire,et al. Drifting Games , 1999, COLT '99.

[11] Manfred K. Warmuth,et al. Averaging Expert Predictions , 1999, EuroCOLT.