论文信息 - PAC Analogues of Perceptron and Winnow Via Boosting the Margin

PAC Analogues of Perceptron and Winnow Via Boosting the Margin

We describe a novel family of PAC model algorithms for learning linear threshold functions. The new algorithms work by boosting a simple weak learner and exhibit sample complexity bounds remarkably similar to those of known online algorithms such as Perceptron and Winnow, thus suggesting that these well-studied online algorithms in some sense correspond to instances of boosting. We show that the new algorithms can be viewed as natural PAC analogues of the online p-norm algorithms which have recently been studied by Grove, Littlestone, and Schuurmans (1997, Proceedings of the Tenth Annual Conference on Computational Learning Theory (pp. 171–183) and Gentile and Littlestone (1999, Proceedings of the Twelfth Annual Conference on Computational Learning Theory (pp. 1–11). As special cases of the algorithm, by taking p = 2 and p = ∞ we obtain natural boosting-based PAC analogues of Perceptron and Winnow respectively. The p = ∞ case of our algorithm can also be viewed as a generalization (with an improved sample complexity bound) of Jackson and Craven's PAC-model boosting-based algorithm for learning “sparse perceptrons” (Jackson & Craven, 1996, Advances in neural information processing systems 8, MIT Press). The analysis of the generalization error of the new algorithms relies on techniques from the theory of large margin classification.

Rocco A. Servedio | R. Servedio

[1] Manfred K. Warmuth,et al. The Perceptron Algorithm Versus Winnow: Linear Versus Logarithmic Mistake Bounds when Few Input Variables are Relevant (Technical Note) , 1997, Artif. Intell..

[2] Manfred K. Warmuth,et al. The perceptron algorithm vs. Winnow: linear vs. logarithmic mistake bounds when few input variables are relevant , 1995, COLT '95.

[3] Leslie G. Valiant,et al. Cryptographic Limitations on Learning Boolean Formulae and Finite Automata , 1993, Machine Learning: From Theory to Applications.

[4] Yoav Freund,et al. An improved boosting algorithm and its implications on learning complexity , 1992, COLT '92.

[5] Yoav Freund,et al. Game theory, on-line prediction and boosting , 1996, COLT '96.

[6] Tom Bylander. Worst-Case Analysis of the Perceptron and Exponentiated Update Algorithms , 1998, Artif. Intell..

[7] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.

[8] N. Fisher,et al. Probability Inequalities for Sums of Bounded Random Variables , 1994 .

[9] Alexander A. Razborov,et al. Majority gates vs. general weighted threshold gates , 1992, [1992] Proceedings of the Seventh Annual Structure in Complexity Theory Conference.

[10] R. Schapire. The Strength of Weak Learnability , 1990, Machine Learning.

[11] Dale Schuurmans,et al. General Convergence Results for Linear Discriminant Updates , 1997, COLT '97.