论文信息 - Novel Distance-Based SVM Kernels for Infinite Ensemble Learning

Novel Distance-Based SVM Kernels for Infinite Ensemble Learning

Ensemble learning algorithms such as boosting can achieve better performance by averaging over the predictions of base hypotheses. However, most existing algorithms are limited to combining only a finite number of hypotheses, and the generated ensemble is usually sparse. It has recently been shown that the support vector machine (SVM) with a carefully crafted kernel can be used to construct a nonsparse ensemble of infinitely many hypotheses. Such infinite ensembles may surpass finite and/or sparse ensembles in learning performance and robustness. In this paper, we derive two novel kernels, the stump kernel and the perceptron kernel, for infinite ensemble learning. The stump kernel embodies an infinite number of decision stumps, and measures the similarity between examples by the '1-norm distance. The perceptron kernel embodies perceptrons, and works with the '2-norm distance. Experimental results show that SVM with these kernels is superior to boosting with the same base hypothesis set. In addition, SVM with these kernels has similar performance to SVM with the Gaussian kernel, but enjoys the benefit of faster parameter selection. These properties make the kernels favorable choices in practice.

Hsuan-Tien Lin | Ling Li

[1] C. Micchelli. Interpolation of scattered data: Distance matrices and conditionally positive definite functions , 1986 .

[2] S. Hyakin,et al. Neural Networks: A Comprehensive Foundation , 1994 .

[3] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[4] Yoav Freund,et al. Experiments with a New Boosting Algorithm , 1996, ICML.

[5] Christopher J. Merz,et al. UCI Repository of Machine Learning Databases , 1996 .

[6] Vladimir Vapnik,et al. Statistical learning theory , 1998 .

[7] Alexander J. Smola,et al. Learning with kernels , 1998 .

[8] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .

[9] Leo Breiman,et al. Prediction Games and Arcing Algorithms , 1999, Neural Computation.

[10] Yoav Freund,et al. A Short Introduction to Boosting , 1999 .

[11] J. Nazuno. Haykin, Simon. Neural networks: A comprehensive foundation, Prentice Hall, Inc. Segunda Edición, 1999 , 2000 .