论文信息 - Metric Learning Tutorial

Metric Learning Tutorial

Most popular machine learning algorithms like k-nearest neighbour, k-means, SVM uses a metric to identify the distance(or similarity) between data instances. It is clear that performances of these algorithm heavily depends on the metric being used. In absence of prior knowledge about data we can only use general purpose metrics like Euclidean distance, Cosine similarity or Manhattan distance etc, but these metric often fail to capture the correct behaviour of data which directly affects the performance of the learning algorithm. Solution to this problem is to tune the metric according to the data and the problem, manually deriving the metric for high dimensional data which is often difficult to even visualize is not only tedious but is extremely difficult. Which leads to put effort on metric learning which satisfies the data geometry. Goal of metric learning algorithm is to learn a metric which assigns small distance to similar points and relatively large distance to dissimilar points.

Parag Jain

[1] Bo Wang,et al. Unsupervised metric learning by Self-Smoothing Operator , 2011, 2011 International Conference on Computer Vision.

[2] Brian Kulis,et al. Metric Learning: A Survey , 2013, Found. Trends Mach. Learn..

[3] Dean P. Foster,et al. Unsupervised Distance Metric Learning Using Predictability , 2008 .

[4] Kilian Q. Weinberger,et al. Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[5] Yoram Singer,et al. Online and batch learning of pseudo-metrics , 2004, ICML.

[6] Bernt Schiele,et al. Active Metric Learning for Object Recognition , 2012, DAGM/OAGM Symposium.

[7] Jude W. Shavlik,et al. Mirror Descent for Metric Learning: A Unified Approach , 2012, ECML/PKDD.

[8] Fei Wang,et al. Two Heads Better Than One: Metric+Active Learning and its Applications for IT Service Classification , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[9] Inderjit S. Dhillon,et al. Information-theoretic metric learning , 2006, ICML '07.