论文信息 - Experiments in Value Function Approximation with Sparse Support Vector Regression

Experiments in Value Function Approximation with Sparse Support Vector Regression

We present first experiments using Support Vector Regression as function approximator for an on-line, sarsa-like reinforcement learner. To overcome the batch nature of SVR two ideas are employed. The first is sparse greedy approximation: the data is projected onto the subspace spanned by only a small subset of the original data (in feature space). This subset can be built up in an on-line fashion. Second, we use the sparsified data to solve a reduced quadratic problem, where the number of variables is independent of the total number of training samples seen. The feasability of this approach is demonstrated on two common toy-problems.

Thomas Uthmann | Tobias Jung

[1] Tom Downs,et al. Exact Simplification of Support Vector Solutions , 2002, J. Mach. Learn. Res..

[2] Alexander J. Smola,et al. Learning with kernels , 1998 .

[3] Matthias W. Seeger,et al. Using the Nyström Method to Speed Up Kernel Machines , 2000, NIPS.

[4] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[5] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[6] Shie Mannor,et al. Sparse Online Greedy Support Vector Regression , 2002, ECML.

[7] Katya Scheinberg,et al. Efficient SVM Training Using Low-Rank Kernel Representations , 2002, J. Mach. Learn. Res..

[8] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[9] Richard S. Sutton,et al. Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .

[10] Xin Wang,et al. Batch Value Function Approximation via Support Vectors , 2001, NIPS.

[11] Y. Engel. Kernel Recursive Least Squares , 2004 .

[12] Bernhard Schölkopf,et al. Sparse Greedy Matrix Approximation for Machine Learning , 2000, International Conference on Machine Learning.