论文信息 - Distance Metric Learning with Application to Clustering with Side-Information

Distance Metric Learning with Application to Clustering with Side-Information

Many algorithms rely critically on being given a good metric over their inputs. For instance, data can often be clustered in many "plausible" ways, and if a clustering algorithm such as K-means initially fails to find one that is meaningful to a user, the only recourse may be for the user to manually tweak the metric until sufficiently good clusters are found. For these and other applications requiring good metrics, it is desirable that we provide a more systematic way for users to indicate what they consider "similar." For instance, we may ask them to provide examples. In this paper, we present an algorithm that, given examples of similar (and, if desired, dissimilar) pairs of points in ℝn, learns a distance metric over ℝn that respects these relationships. Our method is based on posing metric learning as a convex optimization problem, which allows us to give efficient, local-optima-free algorithms. We also demonstrate empirically that the learned metrics can be used to significantly improve clustering performance.

[1] 丸山徹. Convex Analysisの二,三の進展について , 1977 .

[2] Robert Tibshirani,et al. Discriminant Adaptive Nearest Neighbor Classification , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[3] Alexander J. Smola,et al. Learning with kernels , 1998 .

[4] David Haussler,et al. Exploiting Generative Models in Discriminative Classifiers , 1998, NIPS.

[5] Claire Cardie,et al. Clustering with Instance-Level Constraints , 2000, AAAI/IAAI.

[6] S T Roweis,et al. Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[7] Naftali Tishby,et al. The information bottleneck method , 2000, ArXiv.

[8] Claire Cardie,et al. Proceedings of the Eighteenth International Conference on Machine Learning, 2001, p. 577–584. Constrained K-means Clustering with Background Knowledge , 2022 .

[9] Dimitrios Gunopulos,et al. Adaptive Nearest Neighbor Classification Using Support Vector Machines , 2001, NIPS.

[10] L. Saul,et al. An Introduction to Locally Linear Embedding , 2001 .

[11] Andrew W. Moore,et al. Locally Weighted Learning , 1997, Artificial Intelligence Review.

[12] Marion Kee,et al. Analysis , 2004, Machine Translation.