论文信息 - Metric Learning for Kernel Regression

Metric Learning for Kernel Regression

Kernel regression is a well-established method for nonlinear regression in which the target value for a test point is estimated using a weighted average of the surrounding training samples. The weights are typically obtained by applying a distance-based kernel function to each of the samples, which presumes the existence of a well-defined distance metric. In this paper, we construct a novel algorithm for supervised metric learning, which learns a distance function by directly minimizing the leave-one-out regression error. We show that our algorithm makes kernel regression comparable with the state of the art on several benchmark datasets, and we provide efficient implementation details enabling application to datasets with ∼O(10k) instances. Further, we show that our algorithm can be viewed as a supervised variation of PCA and can be used for dimensionality reduction and high dimensional data visualization.

Kilian Q. Weinberger | Gerald Tesauro | G. Tesauro

[1] J. K. Benedetti. On the Nonparametric Estimation of Regression Functions , 1977 .

[2] H. Müller,et al. Kernel estimation of regression functions , 1979 .

[3] Steve R. Waterhouse,et al. Constructive Algorithms for Hierarchical Mixtures of Experts , 1995, NIPS.

[4] Xiaofei He,et al. Locality Preserving Projections , 2003, NIPS.

[5] Geoffrey E. Hinton,et al. Neighbourhood Components Analysis , 2004, NIPS.

[6] Kilian Q. Weinberger,et al. Unsupervised Learning of Image Manifolds by Semidefinite Programming , 2004, CVPR.

[7] Naftali Tishby,et al. Nearest Neighbor Based Feature Selection for Regression and its Application to Neural Activity , 2005, NIPS.

[8] John Langford,et al. Cover trees for nearest neighbor , 2006, ICML.

[9] Shie Mannor,et al. Automatic basis function construction for approximate dynamic programming and reinforcement learning , 2006, ICML.

[10] Peyman Milanfar,et al. Robust Kernel Regression for Restoration and Reconstruction of Images from Sparse Noisy Data , 2006, 2006 International Conference on Image Processing.

[11] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[12] Heng Tao Shen,et al. Principal Component Analysis , 2009, Encyclopedia of Biometrics.