Learning metrics for persistence-based summaries and applications for graph classification

Recently a new feature representation and data analysis methodology based on a topological tool called persistent homology (and its corresponding persistence diagram summary) has started to attract momentum. A series of methods have been developed to map a persistence diagram to a vector representation so as to facilitate the downstream use of machine learning tools, and in these approaches, the importance (weight) of different persistence features are often preset. However often in practice, the choice of the weight function should depend on the nature of the specific type of data one considers, and it is thus highly desirable to learn a best weight function (and thus metric for persistence diagrams) from labelled data. We study this problem and develop a new weighted kernel, called WKPI, for persistence summaries, as well as an optimization framework to learn a good metric for persistence summaries. Both our kernel and optimization problem have nice properties. We further apply the learned kernel to the challenging task of graph classification, and show that our WKPI-based classification framework obtains similar or (sometimes significantly) better results than the best results from a range of previous graph classification frameworks on a collection of benchmark datasets.

[1]  James G. King,et al.  Reconstruction and Simulation of Neocortical Microcircuitry , 2015, Cell.

[2]  Xavier Bresson,et al.  CayleyNets: Graph Convolutional Neural Networks With Complex Rational Spectral Filters , 2017, IEEE Transactions on Signal Processing.

[3]  Mathias Niepert,et al.  Learning Convolutional Neural Networks for Graphs , 2016, ICML.

[4]  Peter Bubenik,et al.  Statistical topological data analysis using persistence landscapes , 2012, J. Mach. Learn. Res..

[5]  Henry Adams,et al.  Persistence Images: A Stable Vector Representation of Persistent Homology , 2015, J. Mach. Learn. Res..

[6]  Hans-Peter Kriegel,et al.  Protein function prediction via graph kernels , 2005, ISMB.

[7]  Edwin R. Hancock,et al.  A quantum Jensen-Shannon graph kernel for unattributed graphs , 2015, Pattern Recognit..

[8]  Dmitriy Morozov,et al.  Geometry Helps to Compare Persistence Diagrams , 2016, ALENEX.

[9]  Tam Le,et al.  Persistence Fisher Kernel : A Riemannian Manifold Kernel for Persistence Diagrams , 2018 .

[10]  Frédéric Chazal,et al.  A General Neural Network Architecture for Persistence Diagrams and Graph Classification , 2019, ArXiv.

[11]  Afra Zomorodian,et al.  The Theory of Multidimensional Persistence , 2007, SCG '07.

[12]  Ulrich Bauer,et al.  PHAT - Persistent Homology Algorithms Toolbox , 2014, ICMS.

[13]  S. Yau,et al.  Ricci curvature of graphs , 2011 .

[14]  Steve Oudot,et al.  Sliced Wasserstein Kernel for Persistence Diagrams , 2017, ICML.

[15]  Mathieu Carrière,et al.  PersLay: A Neural Network Layer for Persistence Diagrams and New Graph Topological Signatures , 2020, AISTATS.

[16]  Steve Oudot,et al.  The Structure and Stability of Persistence Modules , 2012, Springer Briefs in Mathematics.

[17]  Ulrich Bauer,et al.  A stable multi-scale kernel for topological machine learning , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  P. Dobson,et al.  Distinguishing enzyme structures from non-enzymes without alignments. , 2003, Journal of molecular biology.

[19]  Manohar Kaul,et al.  Understanding and Predicting Links in Graphs: A Persistent Homology Perspective , 2018, ArXiv.

[20]  Michael Kerber,et al.  Barcodes of Towers and a Streaming Algorithm for Persistent Homology , 2019, Discret. Comput. Geom..

[21]  David Cohen-Steiner,et al.  Extending Persistence Using Poincaré and Lefschetz Duality , 2009, Found. Comput. Math..

[22]  Dmitriy Morozov,et al.  Zigzag persistent homology and real-valued functions , 2009, SCG '09.

[23]  Herbert Edelsbrunner,et al.  Computational Topology - an Introduction , 2009 .

[24]  Kurt Mehlhorn,et al.  Efficient graphlet kernels for large graph comparison , 2009, AISTATS.

[25]  Roman Garnett,et al.  Efficient Graph Kernels by Randomization , 2012, ECML/PKDD.

[26]  Mariette Yvinec,et al.  The Gudhi Library: Simplicial Complexes and Persistent Homology , 2014, ICMS.

[27]  Ashwin Srinivasan,et al.  The Predictive Toxicology Challenge 2000-2001 , 2001, Bioinform..

[28]  Don Sheehy,et al.  Linear-Size Approximations to the Vietoris–Rips Filtration , 2012, Discrete & Computational Geometry.

[29]  Alexander J. Smola,et al.  Deep Sets , 2017, 1703.06114.

[30]  A. Debnath,et al.  Structure-activity relationship of mutagenic aromatic and heteroaromatic nitro compounds. Correlation with molecular orbital energies and hydrophobicity. , 1991, Journal of medicinal chemistry.

[31]  Kurt Mehlhorn,et al.  Weisfeiler-Lehman Graph Kernels , 2011, J. Mach. Learn. Res..

[32]  Yijian Xiang,et al.  RetGK: Graph Kernels based on Return Probabilities of Random Walks , 2018, NeurIPS.

[33]  Jure Leskovec,et al.  How Powerful are Graph Neural Networks? , 2018, ICLR.

[34]  Kenji Fukumizu,et al.  Kernel Method for Persistence Diagrams via Kernel Embedding and Weight Factor , 2017, J. Mach. Learn. Res..

[35]  Mikhail Belkin,et al.  Diving into the shallows: a computational perspective on large-scale shallow learning , 2017, NIPS.

[36]  Andreas Uhl,et al.  Deep Learning with Topological Signatures , 2017, NIPS.

[37]  Zhi-Li Zhang,et al.  Hunt For The Unique, Stable, Sparse And Fast Feature Learning On Graphs , 2017, NIPS.

[38]  Risi Kondor,et al.  Covariant Compositional Networks For Learning Graphs , 2018, ICLR.

[39]  Herbert Edelsbrunner,et al.  Topological persistence and simplification , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[40]  Nils M. Kriege,et al.  On Valid Optimal Assignment Kernels and Applications to Graph Classification , 2016, NIPS.

[41]  Xiaofeng Wang,et al.  A Mixed Weisfeiler-Lehman Graph Kernel , 2015, GbRPR.

[42]  David Cohen-Steiner,et al.  Stability of Persistence Diagrams , 2005, Discret. Comput. Geom..

[43]  Gunnar E. Carlsson,et al.  Zigzag Persistence , 2008, Found. Comput. Math..

[44]  Hisashi Kashima,et al.  A Linear-Time Graph Kernel , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[45]  Yanjie Li,et al.  Metrics for comparing neuronal tree shapes based on persistent homology , 2016, bioRxiv.

[46]  Karsten M. Borgwardt,et al.  A Persistent Weisfeiler-Lehman Procedure for Graph Classification , 2019, ICML.

[47]  David Cohen-Steiner,et al.  Lipschitz Functions Have Lp-Stable Persistence , 2010, Found. Comput. Math..

[48]  Leonidas J. Guibas,et al.  Proximity of persistence modules and their diagrams , 2009, SCG '09.

[49]  Pinar Yanardag,et al.  Deep Graph Kernels , 2015, KDD.

[50]  Yousef Saad,et al.  Trace optimization and eigenproblems in dimension reduction methods , 2011, Numer. Linear Algebra Appl..