论文信息 - Distance measures as prior probabilities

Distance measures as prior probabilities

Many learning algorithms, especially nonparametric ones, use distance measures as a source of prior knowledge about the domain. This paper shows how the work of Baxter and Yianilos provides a formal equivalence between distance measures and prior probability distributions in Bayesian inference. The prior distribution applies either to how the data was generated or to the shape of the discrimination boundary. This perspective is useful for extending distance-based algorithms to new feature spaces and especially for learning distance measures on those spaces.

T. Minka

[1] Keinosuke Fukunaga,et al. The optimal distance measure for nearest neighbor classification , 1981, IEEE Trans. Inf. Theory.

[2] David L. Waltz,et al. Toward memory-based reasoning , 1986, CACM.

[3] Jonathan Baxter,et al. Learning internal representations , 1995, COLT '95.

[4] Robert Tibshirani,et al. Discriminant Adaptive Nearest Neighbor Classification , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[5] Jonathan Baxter,et al. The Canonical Distortion Measure for Vector Quantization and Function Approximation , 1997, ICML.

[6] Ran El-Yaniv,et al. Agnostic Classification of Markovian Sequences , 1997, NIPS.