Stabilizing the unstable output of persistent homology computations

We propose a general technique for extracting a larger set of stable information from persistent homology computations than is currently done. The persistent homology algorithm is usually viewed as a procedure which starts with a filtered complex and ends with a persistence diagram. This procedure is stable (at least to certain types of perturbations of the input). This justifies the use of the diagram as a signature of the input, and the use of features derived from it in statistics and machine learning. However, these computations also produce other information of great interest to practitioners that is unfortunately unstable. For example, each point in the diagram corresponds to a simplex whose addition in the filtration results in the birth of the corresponding persistent homology class, but this correspondence is unstable. In addition, the persistence diagram is not stable with respect to other procedures that are employed in practice, such as thresholding a point cloud by density. We recast these problems as real-valued functions which are discontinuous but measurable, and then observe that convolving such a function with a suitable function produces a Lipschitz function. The resulting stable function can be estimated by perturbing the input and averaging the output. We illustrate this approach with a number of examples, including a stable localization of a persistent homology generator from brain imaging data.

[1]  Shmuel Weinberger,et al.  The Complexity of Some Topological Inference Problems , 2014, Found. Comput. Math..

[2]  Vin de Silva,et al.  On the Local Behavior of Spaces of Natural Images , 2007, International Journal of Computer Vision.

[3]  Henry Adams,et al.  Persistence Images: A Stable Vector Representation of Persistent Homology , 2015, J. Mach. Learn. Res..

[4]  M. Gameiro,et al.  A topological measurement of protein compressibility , 2014, Japan Journal of Industrial and Applied Mathematics.

[5]  L. Wasserman,et al.  Statistical Inference using the Morse-Smale Complex , 2015, 1506.08826.

[6]  Gunnar E. Carlsson,et al.  Topology and data , 2009 .

[7]  Andrew J. Blumberg,et al.  Robust Statistics, Hypothesis Testing, and Confidence Intervals for Persistent Homology on Metric Measure Spaces , 2012, Found. Comput. Math..

[8]  Leonidas J. Guibas,et al.  Proximity of persistence modules and their diagrams , 2009, SCG '09.

[9]  David Cohen-Steiner,et al.  Lipschitz Functions Have Lp-Stable Persistence , 2010, Found. Comput. Math..

[10]  Moo K. Chung,et al.  Persistence Diagrams of Cortical Surface Data , 2009, IPMI.

[11]  J. Marron,et al.  Persistent Homology Analysis of Brain Artery Trees. , 2014, The annals of applied statistics.

[12]  Patrizio Frosini,et al.  Size theory as a topological tool for computer vision , 1999 .

[13]  Frédéric Chazal,et al.  On the Bootstrap for Persistence Diagrams and Landscapes , 2013, ArXiv.

[14]  David Cohen-Steiner,et al.  Extending Persistence Using Poincaré and Lefschetz Duality , 2009, Found. Comput. Math..

[15]  Gunnar E. Carlsson,et al.  Topological estimation using witness complexes , 2004, PBG.

[16]  R. Ghrist Barcodes: The persistent topology of data , 2007 .

[17]  Frédéric Chazal,et al.  Robust Topological Inference: Distance To a Measure and Kernel Distance , 2014, J. Mach. Learn. Res..

[18]  Peter Bubenik,et al.  Statistical topological data analysis using persistence landscapes , 2012, J. Mach. Learn. Res..

[19]  Tamal K. Dey,et al.  Stability of Critical Points with Interval Persistence , 2007, Discret. Comput. Geom..

[20]  Stephen Smale,et al.  A Topological View of Unsupervised Learning from Noisy Data , 2011, SIAM J. Comput..

[21]  Brittany Terese Fasy,et al.  Local persistent homology based distance between maps , 2014, SIGSPATIAL/GIS.

[22]  Frédéric Chazal,et al.  Stochastic Convergence of Persistence Landscapes and Silhouettes , 2013, J. Comput. Geom..

[23]  Gunnar Carlsson,et al.  Nudged Elastic Band in Topological Data Analysis , 2011 .

[24]  Frédéric Chazal,et al.  Geometric Inference for Measures based on Distance Functions , 2011 .

[25]  G. Carlsson,et al.  Statistical topology via Morse theory, persistence and nonparametric estimation , 2009, 0908.3668.

[26]  Clint P. George,et al.  Principled Selection of Hyperparameters in the Latent Dirichlet Allocation Model , 2017, J. Mach. Learn. Res..

[27]  Paul Bendich,et al.  Supervised Learning of Labeled Pointcloud Differences via Cover-Tree Entropy Reduction , 2017, ArXiv.

[28]  Adam Watkins,et al.  Topological and statistical behavior classifiers for tracking applications , 2014, IEEE Transactions on Aerospace and Electronic Systems.

[29]  Frédéric Chazal,et al.  Subsampling Methods for Persistent Homology , 2014, ICML.

[30]  Sayan Mukherjee,et al.  Probabilistic Fréchet Means and Statistics on Vineyards , 2013, ArXiv.

[31]  Dragan D. Nikolic,et al.  Using persistent homology and dynamical distances to analyze protein binding , 2014, Statistical applications in genetics and molecular biology.

[32]  Afra Zomorodian,et al.  Localized Homology , 2007, IEEE International Conference on Shape Modeling and Applications 2007 (SMI '07).

[33]  J. K. Hunter,et al.  Measure Theory , 2007 .

[34]  Steve Oudot,et al.  Persistence Theory - From Quiver Representations to Data Analysis , 2015, Mathematical surveys and monographs.

[35]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[36]  Herbert Edelsbrunner,et al.  Computational Topology - an Introduction , 2009 .

[37]  David Cohen-Steiner,et al.  Vines and vineyards by updating persistence in linear time , 2006, SCG '06.

[38]  Steve Oudot,et al.  Eurographics Symposium on Geometry Processing 2015 Stable Topological Signatures for Points on 3d Shapes , 2022 .

[39]  Alexander Russell,et al.  Computational topology: ambient isotopic approximation of 2-manifolds , 2003, Theor. Comput. Sci..

[40]  David Cohen-Steiner,et al.  Stability of Persistence Diagrams , 2005, Discret. Comput. Geom..

[41]  Sivaraman Balakrishnan,et al.  Confidence sets for persistence diagrams , 2013, The Annals of Statistics.

[42]  Maks Ovsjanikov,et al.  Persistence-Based Structural Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Alessandro Rinaldo,et al.  Statistical Analysis of Persistence Intensity Functions , 2015, 1510.02502.

[44]  Steve Oudot,et al.  The Structure and Stability of Persistence Modules , 2012, Springer Briefs in Mathematics.

[45]  Tamal K. Dey,et al.  Graph induced complex on point data , 2013, SoCG '13.

[46]  Ulrich Bauer,et al.  A stable multi-scale kernel for topological machine learning , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Matthew P. Wand,et al.  Kernel Smoothing , 1995 .

[48]  Frédéric Chazal,et al.  Geometric Inference for Probability Measures , 2011, Found. Comput. Math..

[49]  James R. Munkres,et al.  Elements of algebraic topology , 1984 .