Incorporating fuzzy prior knowledge into Relevance Vector Machine regression

Although supervised learning has been widely used to tackle problems of function approximation and regression estimation, prior knowledge fails to be incorporated into the data-driven approach because the form of input-output data pairs are not applied. To overcome this limitation, focusing on the fusion between rough fuzzy system and very rare samples of input-output pairs with noise, this paper presents a simple but effective re-sampling algorithm based on piecewise differential interpolation and it is integrated with the sparse Bayesian learning framework for fuzzy model fused Relevance Vector Machine (RVM) regression. By using resampling algorithm encoded derivative regularization, the prior knowledge is translated into a pseudo training dataset, which finally is trained by the adaptive Gaussian kernel RVM to obtain more sparse solution. A preliminary empirical study shows that combining prior knowledge with training examples can dramatically improve the regression performance, particularly when the training dataset is limited.

[1]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[2]  Jude W. Shavlik,et al.  Knowledge-Based Kernel Approximation , 2004, J. Mach. Learn. Res..

[3]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[4]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[5]  Tao Yu,et al.  Integrating relevance vector machines and genetic algorithms for optimization of seed-separating process , 2007, Eng. Appl. Artif. Intell..

[6]  Bernhard Schölkopf,et al.  Prior Knowledge in Support Vector Kernels , 1997, NIPS.

[7]  David J. C. MacKay,et al.  The Evidence Framework Applied to Classification Networks , 1992, Neural Computation.

[8]  Tomaso Poggio,et al.  Incorporating prior information in machine learning by creating virtual examples , 1998, Proc. IEEE.

[9]  Rohini K. Srihari,et al.  Incorporating prior knowledge with weighted margin support vector machines , 2004, KDD.

[10]  Lotfi A. Zadeh,et al.  Outline of a New Approach to the Analysis of Complex Systems and Decision Processes , 1973, IEEE Trans. Syst. Man Cybern..

[11]  John MacIntyre,et al.  Integrating RBF networks with domain knowledge , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[12]  Tao Yu,et al.  Adaptive spherical Gaussian kernel in sparse Bayesian learning framework for nonlinear regression , 2009, Expert Syst. Appl..

[13]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[14]  Joaquin Quiñonero-Candela,et al.  Learning with Uncertainty: Gaussian Processes and Relevance Vector Machines , 2004 .

[15]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[16]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[17]  Ankur Agarwal,et al.  3D human pose from silhouettes by relevance vector regression , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[18]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[20]  Bernhard Sendhoff,et al.  Knowledge Incorporation into Neural Networks From Fuzzy Rules , 2004, Neural Processing Letters.

[21]  George Eastman House,et al.  Sparse Bayesian Learning and the Relevance Vector Machine , 2001 .

[22]  Andrew Blake,et al.  Sparse Bayesian learning for efficient visual tracking , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  John Hallam,et al.  IEEE International Joint Conference on Neural Networks , 2005 .