论文信息 - Potential-Based Algorithms in On-Line Prediction and Game Theory ∗

Potential-Based Algorithms in On-Line Prediction and Game Theory ∗

In this paper we show that several known algorithms for sequential prediction problems (including Weighted Majority and the quasi-additive family of Grove, Littlestone, and Schuurmans), for playing iterated games (including Freund and Schapire’s Hedge and MW, as well as the -strategies of Hart and Mas-Colell), and for boosting (including AdaBoost) are special cases of a general decision strategy based on the notion of potential. By analyzing this strategy we derive known performance bounds, as well as new bounds, as simple corollaries of a single general theorem. Besides offering a new and unified view on a large family of algorithms, we establish a connection between potential-based analysis in learning and their counterparts independently developed in game theory. By exploiting this connection, we show that certain learning problems are instances of more general gametheoretic problems. In particular, we describe a notion of generalized regret and show its applications in learning theory.

J. Shawe-Taylor

[1] D. Blackwell. An analog of the minimax theorem for vector payoffs. , 1956 .

[2] James Hannan,et al. 4. APPROXIMATION TO RAYES RISK IN REPEATED PLAY , 1958 .

[3] A. A. Mullin,et al. Principles of neurodynamics , 1962 .

[4] H. D. Block. The perceptron: a model for brain functioning. I , 1962 .

[5] Albert B Novikoff,et al. ON CONVERGENCE PROOFS FOR PERCEPTRONS , 1963 .

[6] L. Bregman. The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming , 1967 .

[7] Manfred K. Warmuth,et al. The weighted majority algorithm , 1989, 30th Annual Symposium on Foundations of Computer Science.

[8] Vladimir Vovk,et al. Aggregating strategies , 1990, COLT '90.

[9] N. Littlestone. Mistake bounds and logarithmic linear-threshold learning algorithms , 1990 .

[10] Thomas M. Cover,et al. Elements of Information Theory , 2005 .

[11] David Haussler,et al. How to use expert advice , 1993, STOC.