论文信息 - A Stochastic Treatment of Similarity

A Stochastic Treatment of Similarity

This study investigates a robust measure of similarity applicable in many domains and across many dimensions of data. Given a distance or discrepancy measure on a domain, the similarity of two values in this domain is defined as the probability that any pair of values from that domain are more different (at a larger distance) than these two values are. We discuss the motivation for this approach, its properties, and the issues that arise from it.

Anca L. Ralescu | Sofia Visa | Stefana Popovici

[1] Ramon C. Littell,et al. Asymptotic Optimality of Fisher's Method of Combining Independent Tests , 1971 .

[2] Tu Bao Ho,et al. Measuring the Similarity for Heterogenous Data: An Ordered Probability-Based Approach , 2004, Discovery Science.

[3] Michihiko Minoh,et al. Measuring Proximity between Heterogeneous Data , 2007, 2007 IEEE International Fuzzy Systems Conference.

[4] Stefana A. Popovici. On evaluating similarity between heterogeneous data , 2008 .