Practical applications of metric space magnitude and weighting vectors

Metric space magnitude, an active subject of research in algebraic topology, originally arose in the context of biology, where it was used to represent the effective number of distinct species in an environment. In a more general setting, the magnitude of a metric space is a real number that aims to quantify the effective number of distinct points in the space. The contribution of each point to a metric space's global magnitude, which is encoded by the {\em weighting vector}, captures much of the underlying geometry of the original metric space. Surprisingly, when the metric space is Euclidean, the weighting vector also serves as an effective tool for boundary detection. This allows the weighting vector to serve as the foundation of novel algorithms for classic machine learning tasks such as classification, outlier detection and active learning. We demonstrate, using experiments and comparisons on classic benchmark datasets, the promise of the proposed magnitude and weighting vector-based approaches.

[1]  Glenn Fung,et al.  Proximal support vector machine classifiers , 2001, KDD '01.

[2]  Michael Shulman,et al.  Magnitude homology of enriched categories and metric spaces , 2017, Algebraic & Geometric Topology.

[3]  Leonidas J. Guibas,et al.  Persistence barcodes for shapes , 2004, SGP '04.

[4]  Mark W. Meckes,et al.  Positive definite metric spaces , 2010, 1012.5863.

[5]  T. Leinster The magnitude of metric spaces , 2010, Documenta Mathematica.

[6]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[7]  Gerald B. Folland,et al.  Real Analysis: Modern Techniques and Their Applications , 1984 .

[8]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[10]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[11]  William A. Gale,et al.  A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[12]  A. Solow,et al.  Measuring biological diversity , 2006, Environmental and Ecological Statistics.

[13]  A. Carbery,et al.  On the magnitudes of compact sets in Euclidean spaces , 2015, 1507.02502.

[14]  M. Meckes Magnitude, Diversity, Capacities, and Dimensions of Metric Spaces , 2013, 1308.5407.

[15]  Afra Zomorodian,et al.  Computing Persistent Homology , 2004, SCG '04.

[16]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[17]  Herbert Edelsbrunner,et al.  Topological Persistence and Simplification , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.