A Stochastic Treatment of Similarity
暂无分享,去创建一个
This study investigates a robust measure of similarity applicable in many domains and across many dimensions of data. Given a distance or discrepancy measure on a domain, the similarity of two values in this domain is defined as the probability that any pair of values from that domain are more different (at a larger distance) than these two values are. We discuss the motivation for this approach, its properties, and the issues that arise from it.
[1] Ramon C. Littell,et al. Asymptotic Optimality of Fisher's Method of Combining Independent Tests , 1971 .
[2] Tu Bao Ho,et al. Measuring the Similarity for Heterogenous Data: An Ordered Probability-Based Approach , 2004, Discovery Science.
[3] Michihiko Minoh,et al. Measuring Proximity between Heterogeneous Data , 2007, 2007 IEEE International Fuzzy Systems Conference.
[4] Stefana A. Popovici. On evaluating similarity between heterogeneous data , 2008 .