论文信息 - Local Feature Selection for the Relevance Vector Machine Using Adaptive Kernel Learning

Local Feature Selection for the Relevance Vector Machine Using Adaptive Kernel Learning

A Bayesian learning algorithm is presented that is based on a sparse Bayesian linear model (the Relevance Vector Machine (RVM)) and learns the parameters of the kernels during model training. The novel characteristic of the method is that it enables the introduction of parameters called `scaling factors' that measure the significance of each feature. Using the Bayesian framework, a sparsity promoting prior is then imposed on the scaling factors in order to eliminate irrelevant features. Feature selection is local, because different values are estimated for the scaling factors of each kernel, therefore different features are considered significant at different regions of the input space. We present experimental results on artificial data to demonstrate the advantages of the proposed model and then we evaluate our method on several commonly used regression and classification datasets.

Nikolas P. Galatsanos | Aristidis Likas | Dimitris Tzikas

[1] Bayesian wavelet analysis with a model complexity prior , 1999 .

[2] Christopher M. Bishop,et al. Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[3] Nikolas P. Galatsanos,et al. Sparse Bayesian Modeling With Adaptive Kernel Learning , 2009, IEEE Transactions on Neural Networks.

[4] Michael E. Tipping,et al. Fast Marginal Likelihood Maximisation for Sparse Bayesian Models , 2003 .

[5] George Eastman House,et al. Sparse Bayesian Learning and the Relevan e Ve tor Ma hine , 2001 .

[6] Lawrence Carin,et al. A Bayesian approach to joint feature selection and classifier design , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Christopher Bishop,et al. Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics , 2003 .

[8] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.

[9] Richard M. Everson,et al. Smooth relevance vector machine: a smoothness prior extension of the RVM , 2007, Machine Learning.