Distance measures for point sets and their computation

Abstract.We consider the problem of measuring the similarity or distance between two finite sets of points in a metric space, and computing the measure. This problem has applications in, e.g., computational geometry, philosophy of science, updating or changing theories, and machine learning. We review some of the distance functions proposed in the literature, among them the minimum distance link measure, the surjection measure, and the fair surjection measure, and supply polynomial time algorithms for the computation of these measures. Furthermore, we introduce the minimum link measure, a new distance function which is more appealing than the other distance functions mentioned. We also present a polynomial time algorithm for computing this new measure. We further address the issue of defining a metric on point sets. We present the metric infimum method that constructs a metric from any distance functions on point sets. In particular, the metric infimum of the minimum link measure is a quite intuitive. The computation of this measure is shown to be in NP for a broad class of instances; it is NP-hard for a natural problem class.

[1]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[2]  Ronald L. Graham,et al.  Some NP-complete geometric problems , 1976, STOC '76.

[3]  Translator-IEEE Expert staff Machine Learning: A Theoretical Approach , 1992, IEEE Expert.

[4]  Peter Gärdenfors,et al.  Knowledge in Flux , 1988 .

[5]  Stephen Muggleton,et al.  Inductive acquisition of expert knowledge , 1986 .

[6]  Jan van Leeuwen,et al.  Graph Algorithms , 1991, Handbook of Theoretical Computer Science, Volume A: Algorithms and Complexity.

[7]  Pravin M. Vaidya,et al.  Geometry helps in matching , 1989, STOC '88.

[8]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[9]  Peter Z. Revesz On the semantics of theory change: arbitration between old and new information , 1993, PODS '93.

[10]  Heikki Mannila,et al.  Pruning and grouping of discovered association rules , 1995 .

[11]  Gabriel M. Kuper,et al.  Updating Logical Databases , 1986, Adv. Comput. Res..

[12]  Some Comments On Truth and the Growth of Knowledge , 1966 .

[13]  Kurt Mehlhorn,et al.  Congruence, similarity, and symmetries of geometric objects , 1987, SCG '87.

[14]  Kurt Mehlhorn,et al.  Graph Algorithm and NP-Completeness , 1984 .

[15]  Rae Baxter,et al.  Acknowledgments.-The authors would like to , 1982 .

[16]  Stefan Wrobel,et al.  On the Proper Definition of Minimality in Specialization and Theory Revision , 1993, ECML.

[17]  A. Bonato,et al.  Graphs and Hypergraphs , 2022 .

[18]  A. Alexandrova The British Journal for the Philosophy of Science , 1965, Nature.

[19]  Michael R. Anderberg,et al.  Cluster Analysis for Applications , 1973 .

[20]  H. Charles Romesburg,et al.  Cluster analysis for researchers , 1984 .

[21]  W. Salmon,et al.  Knowledge in Flux , 1991 .

[22]  G. Oddie Likeness to Truth , 1986 .